近年发表会议论文-视觉与智能学习实验室Visual and Intelligent Learning Lab

论著

网站首页 > 论著 > 正文

近年发表会议论文

作者：时间：2018-05-15 点击数：

[1]. Wentao Gu, Yuquan Li, XINYANG JIANG, Zilong Wang, Dongsheng Li, Zehui Li, Zijian Dong, Cairong Zhao*. Joint Adaptation of Uni-modal Foundation Models for Multi-modal Alzheimer's Disease Diagnosis. ICLR 2026, Accepted. [pdf]

[2]. Ding Qi, Jian Li, Shuguang Dou, Zifan Song, Junyao Gao, Yabiao Wang, Chengjie Wang, Cairong Zhao*. Asynchronous Matching with Dynamic Sampling for Multimodal Dataset Distillation. ICLR 2026, Accepted. [pdf]

[3]. Zifan Song, Kaitao Song, Guosheng Hu, Ding Qi, Junyao Gao, Xiaohua Wang, Dongsheng Li, Cairong Zhao*. Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading. ICLR 2026, Accepted. [pdf]

[4]. Yubin Wang, XINYANG JIANG, De Cheng, Xiangqian Zhao, Zilong Wang, Dongsheng Li, Cairong Zhao*. Exploring Interpretability for Visual Prompt Tuning with Cross-layer Concepts. ICLR 2026, Accepted. [pdf][code]

[5]. Xiaowen Zhang, Zijie Yue, Yong Luo, Cairong Zhao, Qijun Chen, Miaojing Shi. Bootstrapping MLLM for Weakly-Supervised Class-Agnostic Object Counting. ICLR 2026, Accepted. [pdf][code]

[6]. Weixun Wan, XINYANG JIANG, Zilong Wang, Bei Li, Cairong Zhao*. Tuning Medical Foundation Models for Inner Ear Temporal CT Analysis with Plug-and-play Domain Knowledge Aggregator. The Fortieth AAAI Conference on Artificial Intelligence (AAAI) 2026, Accepted. [pdf][code]

[7]. Xueyu Chen, Kaitao Song, Zifan Song, Dongsheng Li, Cairong Zhao*. Improving Long-Context Summarization with Multi-Granularity Retrieval Optimization. The Fortieth AAAI Conference on Artificial Intelligence (AAAI) 2026, Accepted. [pdf]

[8]. Li yong cheng, Wang xuekuan, Zhifei Zhang, Cairong Zhao*. Dual-Phase Visual-Language Pretraining and Adaptation for Long-Tailed Multi-Label Recognition. The Fortieth AAAI Conference on Artificial Intelligence (AAAI) 2026, Accepted. [pdf][code]

[9]. Ji, Chenhao, Chaohui Yu, Junyao Gao, Fan Wang, and Cairong Zhao*. CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion. In Proceedings of the SIGGRAPH Asia 2025 Conference Papers, pp. 1-12. 2025. [pdf]

[10]. Zhao, J., Jiang, X., Gao, J., Xue, Y., & Zhao, C. (2025). One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models. ICCV2025. [pdf][code]

[11]. Qi, D., Li, J., Gao, J., Dou, S., Tai, Y., Hu, J., ... & Zhao, C. (2025). Towards Universal Dataset Distillation via Task-Driven Diffusion. In Proceedings of the Computer Vision and Pattern Recognition Conference (pp. 10557-10566). [pdf]

[12]. Gao, J., Sun, Y., Shen, F., Jiang, X., Xing, Z., Chen, K.,& Zhao, C. (2025). Faceshot: Bring any character into life. ICLR2025. [pdf][code]

[13]. Wang, Y., Zou, Z., Ye, X., Tan, X., Ding, E., & Zhao, C. (2025). Uni²Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection. ICLR2025. [pdf]

[14]. Song, Z., Wang, Y., Zhang, W., Liu, K., Lyu, C., Song, D., ... & Zhao, C. (2024). Alchemistcoder: Harmonizing and eliciting code capability by hindsight tuning on multi-source data. Advances in Neural Information Processing Systems, 37, 2185-2214. [pdf][code]

[15]. Wang, Y., Xu, J., He, Y., Song, Z., Wang, L., Qiao, Y., & Zhao, C. (2024). Does video-text pretraining help open-vocabulary online action detection?. Advances in Neural Information Processing Systems, 37, 47908-47930.[pdf][code]

[16]. Qi, D., Li, J., Peng, J., Zhao, B., Dou, S., Li, J., ... & Zhao, C. (2024). Fetch and forge: Efficient dataset condensation for object detection. Advances in Neural Information Processing Systems, 37, 119283-119300. [pdf]

[17]. Ye, W., Ji, C., Chen, Z., Gao, J., Huang, X., Zhang, S. H., ... Zhao,C. & Zhang, G. (2024). Diffpano: Scalable and consistent text to panorama generation with spherical epipolar-aware diffusion. Advances in Neural Information Processing Systems, 37, 1304-1332. [pdf][code]

[18]. Qu, Z., Jiang, X., Yang, Y., Li, D., & Zhao, C. (2024). Online Video Quality Enhancement with Spatial-Temporal Look-up Tables.In European Conference on Computer Vision. [pdf]

[19]. Tu, Y., Zhang, B., Liu, L., Li, Y., Zhang, J., Wang, Y., ... & Zhao, C. (2024, September). Self-supervised feature adaptation for 3d industrial anomaly detection. In European Conference on Computer Vision (pp. 75-91). Cham: Springer Nature Switzerland. [pdf][code]

[20]. Self-supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes.Yuanpeng Tu; Yuxi Li; Boshen Zhang; Liang Liu; Jiangning Zhang; Yabiao Wang; Chengjie Wang; Cai Rong Zhao* ,AAAI 2024. [pdf][code]

[21]. Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models. Yubin Wang, Xinyang Jiang, De Cheng, Dongsheng Li, Cairong Zhao* ,AAAI 2024. [pdf][code]

[22]. Diverse Person: Customize Your Own Dataset for Text-based Person Search. Zifan Song, Guosheng Hu,Cairong Zhao*, AAAI 2024. [pdf][code]

[23]. Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations. Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao*. International Conference on Machine Learning, 2023. [pdf][code]

[24]. Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cairong Zhao*. Learning with Noisy labels via Self-supervised Adversarial Noisy Masking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023. [pdf][code]

[25]. Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Yabiao Wang, Chengjie Wang, Cairong Zhao*. Learning from Noisy Labels with Decoupled Meta Label Purifier. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023. [pdf][code]

[26]. Junyao Gao, Xinyang Jiang, Huishuai Zhang, Yifan Yang, Shuguang Dou, Dongsheng Li, Duoqian Miao, Cheng Deng, Cairong Zhao*. Similarity Distribution based Membership Inference Attack on Person Re-identification. Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023, oral.[pdf][code]

[27]. Yufeng Jin, Guosheng Hu, Haonan Chen, Duoqian Miao, Liang Hu, Cairong Zhao*. Cross-Modal Distillation for Speaker Recognition.Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023. [pdf][code]

[28]. Shuguang Dou, Xinyang Jiang, Cairong Zhao, Dongsheng Li. EA-HAS-Bench: Energy-aware Hyperparameter and Architecture Search Benchmark. ICLR 2023, spotlight. [pdf][code]

[29]. Chutian Wang, Cairong Zhao*, Guosheng Hu: Multi-Definition Video Deepfake Detection via Semantics Reduction and Cross-Domain Training. IEEE International Conference on Multimedia and Expo，2022 (Oral). [pdf][code]

[30]. Cairong Zhao*, Shuyang Feng, Brian Nlong Zhao, Zhijun Ding, Jun Wu, Fuming Shen, and Hengtao Shen. Scene Text Image Super-Resolution via Parallelly Contextual Attention Network. In Proceedings of the 29th ACM International Conference on Multimedia, 2021, Oral. [pdf][code]

[31]. Yipeng Chen, Cairong Zhao*, Tianli Sun.Single Image Based Metric Learning via Overlapping Blocks Model for Person Re-Identification. CVPR2019 workshop. [pdf]

[32]. Cairong Zhao*, Xuekuan Wang, Yipeng Chen, Can Gao, Wangmeng Zuo, Duoqian Miao. Consistent Iterative Multi-view Transfer Learning for Person Re-identification. ICCV Workshops2017: 1087-1094. [pdf]

上一篇：专利