Abstract: In response to the multiple challenges faced in detecting small infrared moving targets—such as cluttered backgrounds, limited object size, weak feature representation, and low detection ...
Abstract: Referring Video Object Segmentation (R-VOS) demands precise visual comprehension and sophisticated cross-modal reasoning to segment objects in videos based on descriptions from natural ...