Publications

(2024). SaccadeMOT: Enhancing Object Detection and Tracking in Gigapixel Images via Scale-Aware Density Estimation. In ECAI 2024.

Cite

(2024). Sparsely-Supervised Object Tracking. TIP 2024.

Cite DOI

(2024). SaccadeDet: A Novel Dual-Stage Architecture for Rapid and Accurate Detection in Gigapixel Images. In ECML-PKDD 2024.

Cite

(2024). Semantic Enrichment for Video Question Answering with Gated Graph Neural Networks. In ICASSP 2024.

PDF Cite

(2024). GigaHumanDet: Exploring Full-body Detection on Gigapixel-level Images. In AAAI 2024.

Cite

(2023). Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment. In ACM MM 2023.

PDF Cite DOI

(2023). Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering. In ICANN 2023.

PDF Cite DOI