[1]. Zhao, J., Jiang, X., Gao, J., Xue, Y., & Zhao, C. (2025). One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models. ICCV2025. [pdf][code]
[2]. Qi, D., Li, J., Gao, J., Dou, S., Tai, Y., Hu, J., ... & Zhao, C. (2025). Towards Universal Dataset Distillation via Task-Driven Diffusion. In Proceedings of the Computer Vision and Pattern Recognition Conference (pp. 10557-10566). [pdf]
[3]. Gao, J., Sun, Y., Shen, F., Jiang, X., Xing, Z., Chen, K.,& Zhao, C. (2025). Faceshot: Bring any character into life. ICLR2025. [pdf][code]
[4]. Wang, Y., Zou, Z., Ye, X., Tan, X., Ding, E., & Zhao, C. (2025). Uni²Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection. ICLR2025. [pdf]
[5]. Song, Z., Wang, Y., Zhang, W., Liu, K., Lyu, C., Song, D., ... & Zhao, C. (2024). Alchemistcoder: Harmonizing and eliciting code capability by hindsight tuning on multi-source data. Advances in Neural Information Processing Systems, 37, 2185-2214. [pdf][code]
[6]. Wang, Y., Xu, J., He, Y., Song, Z., Wang, L., Qiao, Y., & Zhao, C. (2024). Does video-text pretraining help open-vocabulary online action detection?. Advances in Neural Information Processing Systems, 37, 47908-47930.[pdf][code]
[7]. Qi, D., Li, J., Peng, J., Zhao, B., Dou, S., Li, J., ... & Zhao, C. (2024). Fetch and forge: Efficient dataset condensation for object detection. Advances in Neural Information Processing Systems, 37, 119283-119300. [pdf]
[8]. Ye, W., Ji, C., Chen, Z., Gao, J., Huang, X., Zhang, S. H., ... Zhao,C. & Zhang, G. (2024). Diffpano: Scalable and consistent text to panorama generation with spherical epipolar-aware diffusion. Advances in Neural Information Processing Systems, 37, 1304-1332. [pdf][code]
[9]. Qu, Z., Jiang, X., Yang, Y., Li, D., & Zhao, C. (2024). Online Video Quality Enhancement with Spatial-Temporal Look-up Tables.In European Conference on Computer Vision. [pdf]
[10]. Tu, Y., Zhang, B., Liu, L., Li, Y., Zhang, J., Wang, Y., ... & Zhao, C. (2024, September). Self-supervised feature adaptation for 3d industrial anomaly detection. In European Conference on Computer Vision (pp. 75-91). Cham: Springer Nature Switzerland. [pdf][code]
[11]. Self-supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes.Yuanpeng Tu; Yuxi Li; Boshen Zhang; Liang Liu; Jiangning Zhang; Yabiao Wang; Chengjie Wang; Cai Rong Zhao* ,AAAI 2024. [pdf][code]
[12]. Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models. Yubin Wang, Xinyang Jiang, De Cheng, Dongsheng Li, Cairong Zhao* ,AAAI 2024. [pdf][code]
[13]. Diverse Person: Customize Your Own Dataset for Text-based Person Search. Zifan Song, Guosheng Hu, Cairong Zhao*,AAAI 2024. [pdf][code]
[14]. Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations. Zifan Song, Xiao Gong, Guosheng Hu, Cairong Zhao*. International Conference on Machine Learning, 2023. [pdf][code]
[15]. Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cairong Zhao*. Learning with Noisy labels via Self-supervised Adversarial Noisy Masking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023. [pdf][code]
[16]. Yuanpeng Tu, Boshen Zhang, Yuxi Li, Liang Liu, Jian Li, Yabiao Wang, Chengjie Wang, Cairong Zhao*. Learning from Noisy Labels with Decoupled Meta Label Purifier. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023. [pdf][code]
[17]. Shuguang Dou, Xinyang Jiang, Cairong Zhao, Dongsheng Li. EA-HAS-Bench: Energy-aware Hyperparameter and Architecture Search Benchmark. ICLR 2023, spotlight. [pdf][code]
[18]. Junyao Gao, Xinyang Jiang, Huishuai Zhang, Yifan Yang, Shuguang Dou, Dongsheng Li, Duoqian Miao, Cheng Deng, Cairong Zhao*. Similarity Distribution based Membership Inference Attack on Person Re-identification. Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023, oral.[pdf][code]
[19]. Yufeng Jin, Guosheng Hu, Haonan Chen, Duoqian Miao, Liang Hu, Cairong Zhao*. Cross-Modal Distillation for Speaker Recognition.Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023. [pdf][code]
[20]. Chutian Wang, Cairong Zhao*, Guosheng Hu: Multi-Definition Video Deepfake Detection via Semantics Reduction and Cross-Domain Training. IEEE International Conference on Multimedia and Expo,2022 (Oral). [pdf][code]
[21]. Cairong Zhao*, Shuyang Feng, Brian Nlong Zhao, Zhijun Ding, Jun Wu, Fuming Shen, and Hengtao Shen. Scene Text Image Super-Resolution via Parallelly Contextual Attention Network. In Proceedings of the 29th ACM International Conference on Multimedia, 2021, Oral. [pdf][code]
[22]. Yipeng Chen, Cairong Zhao*, Tianli Sun.Single Image Based Metric Learning via Overlapping Blocks Model for Person Re-Identification. CVPR2019 workshop. [pdf]
[23]. Cairong Zhao*, Xuekuan Wang, Yipeng Chen, Can Gao, Wangmeng Zuo, Duoqian Miao. Consistent Iterative Multi-view Transfer Learning for Person Re-identification. ICCV Workshops2017: 1087-1094. [pdf]