Select Publications

Conference Papers

Cai Q; Zhang X; Ding H; Tao R, 2023, 'Efficient Information Recognition for Machine-printed Invoices', in 2023 International Conference on Image Processing Computer Vision and Machine Learning Icicml 2023, pp. 913 - 918, http://dx.doi.org/10.1109/ICICML60161.2023.10424949

Wang S; Liu Y; Wang T; Li Y; Zhang X, 2023, 'Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3598 - 3608, http://dx.doi.org/10.1109/ICCV51070.2023.00335

Yu L; Xie T; Zhu Y; Yang T; Zhang X; Zhang C, 2023, 'Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration', in Advances in Neural Information Processing Systems

Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13488 - 13498, http://dx.doi.org/10.1109/CVPR52729.2023.01296

Zhou H; Ge Z; Li Z; Zhang X, 2023, 'MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8514 - 8523, http://dx.doi.org/10.1109/ICCV51070.2023.00785

Chua VYH; Liu H; Perera LPG; Woon FT; Wong J; Zhang X; Khudanpur S; Khong AWH; Dauwels J; Styles SJ, 2023, 'MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4109 - 4113, http://dx.doi.org/10.21437/Interspeech.2023-1446

Zhang Y; Wang T; Zhang X, 2023, 'MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 22056 - 22065, http://dx.doi.org/10.1109/CVPR52729.2023.02112

Wu D; Wang T; Zhang Y; Zhang X; Shen J, 2023, 'OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation', in Proceedings of the IEEE International Conference on Computer Vision, pp. 2749 - 2758, http://dx.doi.org/10.1109/ICCV51070.2023.00259

Liu Y; Yan J; Jia F; Li S; Gao A; Wang T; Zhang X, 2023, 'PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3239 - 3249, http://dx.doi.org/10.1109/ICCV51070.2023.00302

Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2023, 'PQLM - Multilingual Decentralized Portable Quantum Language Model', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095215

Ding X; Chen H; Zhang X; Huang K; Han J; Ding G, 2023, 'RE-PARAMETERIZING YOUR OPTIMIZERS RATHER THAN ARCHITECTURES', in 11th International Conference on Learning Representations Iclr 2023

Wu D; Han W; Wang T; Dong X; Zhang X; Shen J, 2023, 'Referring Multi-Object Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 14633 - 14642, http://dx.doi.org/10.1109/CVPR52729.2023.01406

Han Q; Cai Y; Zhang X, 2023, 'RevColV2: Exploring Disentangled Representations in Masked Image Modeling', in Advances in Neural Information Processing Systems

Cai Y; Zhou Y; Han Q; Sun J; Kong X; Li J; Zhang X, 2023, 'REVERSIBLE COLUMN NETWORKS', in 11th International Conference on Learning Representations Iclr 2023

Wang X; Chu X; Han C; Zhang X, 2023, 'SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers', in Proceedings 2023 IEEE Cvf International Conference on Computer Vision Workshops Iccvw 2023, pp. 731 - 741, http://dx.doi.org/10.1109/ICCVW60793.2023.00081

Qi D; Yang T; Zhang X, 2023, 'Slot-guided Volumetric Object Radiance Fields', in Advances in Neural Information Processing Systems

Zhang X; Zhou Y; Yang G; Chen T, 2023, 'Syntax-Aware Retrieval Augmented Code Generation', in Findings of the Association for Computational Linguistics Emnlp 2023, pp. 1291 - 1302, http://dx.doi.org/10.18653/v1/2023.findings-emnlp.90

Zhang X; Mo S; Wan Z, 2023, 'Traffic sign detection algorithm based on YOLOv5 combined with BIFPN and attention mechanism', in Itoec 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference, pp. 966 - 970, http://dx.doi.org/10.1109/ITOEC57671.2023.10291927

Zhong Z; Cui J; Yang Y; Wu X; Qi X; Zhang X; Jia J, 2023, 'Understanding Imbalanced Semantic Segmentation Through Neural Collapse', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 19550 - 19559, http://dx.doi.org/10.1109/CVPR52729.2023.01873

Kong X; Zhang X, 2023, 'Understanding Masked Image Modeling via Learning Occlusion Invariant Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6241 - 6251, http://dx.doi.org/10.1109/CVPR52729.2023.00604

Chen Y; Liu J; Zhang X; Qi X; Jia J, 2023, 'VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21674 - 21683, http://dx.doi.org/10.1109/CVPR52729.2023.02076

Zhang X; Sun Z; Sun X; Ji W; Zhang X, 2022, 'Design of a Spring Cold Warning System for Kiwifruit Orchards Based on the Internet of Things', in Advances in Transdisciplinary Engineering, pp. 1286 - 1295, http://dx.doi.org/10.3233/ATDE220999

Wang Y; Zhang X; Yang T; Sun J, 2022, 'Anchor DETR: Query Design for Transformer-Based Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 2567 - 2575, http://dx.doi.org/10.1609/aaai.v36i3.20158

Zhang P; Kang Z; Yang T; Zhang X; Zheng N; Sun J, 2022, 'LGD: Label-Guided Self-Distillation for Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 3309 - 3317, http://dx.doi.org/10.1609/aaai.v36i3.20240

Chen Y; Li Y; Zhang X; Sun J; Jia J, 2022, 'Focal Sparse Convolutional Networks for 3D Object Detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5418 - 5427, http://dx.doi.org/10.1109/CVPR52688.2022.00535

Zeng F; Dong B; Zhang Y; Wang T; Zhang X; Wei Y, 2022, 'MOTR: End-to-End Multiple-Object Tracking with Transformer', in Lecture Notes in Computer Science, pp. 659 - 675, http://dx.doi.org/10.1007/978-3-031-19812-0_38

Liu Y; Wang T; Zhang X; Sun J, 2022, 'PETR: Position Embedding Transformation for Multi-view 3D Object Detection', in Lecture Notes in Computer Science, pp. 531 - 548, http://dx.doi.org/10.1007/978-3-031-19812-0_31

Zheng A; Zhang Y; Zhang X; Qi X; Sun J, 2022, 'Progressive End-to-End Object Detection in Crowded Scenes', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 847 - 856, http://dx.doi.org/10.1109/CVPR52688.2022.00093

He YY; Zhang P; Wei XS; Zhang X; Sun J, 2022, 'Relieving Long-tailed Instance Segmentation via Pairwise Class Balance', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6990 - 6999, http://dx.doi.org/10.1109/CVPR52688.2022.00687

Ding X; Chen H; Zhang X; Han J; Ding G, 2022, 'RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 568 - 577, http://dx.doi.org/10.1109/CVPR52688.2022.00066

Huang J; Kong X; Zhang X, 2022, 'Revisiting the Critical Factors of Augmentation-Invariant Representation Learning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 42 - 58, http://dx.doi.org/10.1007/978-3-031-19821-2_3

Ding X; Zhang X; Han J; Ding G, 2022, 'Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 11953 - 11965, http://dx.doi.org/10.1109/CVPR52688.2022.01166

Wen X; Zhao B; Zheng A; Zhang X; Qi X, 2022, 'Self-Supervised Visual Representation Learning with Semantic Grouping', in Advances in Neural Information Processing Systems

Chen L; Chu X; Zhang X; Sun J, 2022, 'Simple Baselines for Image Restoration', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 17 - 33, http://dx.doi.org/10.1007/978-3-031-20071-7_2

Liang Z; Wang T; Zhang X; Sun J; Shen J, 2022, 'Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16886 - 16895, http://dx.doi.org/10.1109/CVPR52688.2022.01640

Qian G; Zhang X; Li G; Zhao C; Chen Y; Zhang X; Ghanem B; Sun J, 2022, 'When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2781 - 2786, http://dx.doi.org/10.1109/CVPRW56347.2022.00314

Ma L; Wang T; Dong B; Yan J; Li X; Zhang X, 2021, 'Implicit Feature Refinement for Instance Segmentation', in Mm 2021 Proceedings of the 29th ACM International Conference on Multimedia, pp. 3088 - 3096, http://dx.doi.org/10.1145/3474085.3475449

Ignatov A; Byeoung-Su K; Timofte R; Pouget A; Song F; Li C; Xiao S; Fu Z; Maggioni M; Huang Y; Cheng S; Lu X; Zhou Y; Chen L; Liu D; Zhang X; Fan H; Sun J; Liu S; Kwon M; Lee M; Yoo J; Kang C; Wang S; Huang B; Zhou T; Liu S; Lei L; Feng C; Huang L; Lei Z; Chen F, 2021, 'Fast camera image denoising on mobile GPUS with deep learning, mobile AI 2021 challenge: Report', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2515 - 2524, http://dx.doi.org/10.1109/CVPRW53098.2021.00285

Fu Z; Sun Y; Zhang X; Stainton S; Barney S; Hogg J; Innes W; Dlay S, 2021, 'MPG-net: Multi-prediction guided network for segmentation of retinal layers in OCT images', in European Signal Processing Conference, pp. 1299 - 1303, http://dx.doi.org/10.23919/Eusipco47968.2020.9287561

Ma N; Zhang X; Liu M; Sun J, 2021, 'Activate or Not: Learning Customized Activation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8028 - 8038, http://dx.doi.org/10.1109/CVPR46437.2021.00794

Wang T; Yang T; Cao J; Zhang X, 2021, 'Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection', in 35th Aaai Conference on Artificial Intelligence Aaai 2021, pp. 2800 - 2808, http://dx.doi.org/10.1609/aaai.v35i4.16385

Zhang X; Yang J; Li X; Liu M; Kang R; Wang R, 2021, 'Deeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection', in Proceedings 2021 7th International Conference on Big Data and Information Analytics Bigdia 2021, pp. 149 - 156, http://dx.doi.org/10.1109/BigDIA53151.2021.9619703

Ding X; Zhang X; Han J; Ding G, 2021, 'Diverse Branch Block: Building a Convolution as an Inception-like Unit', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10881 - 10890, http://dx.doi.org/10.1109/CVPR46437.2021.01074

Chen J; Wang X; Guo Z; Zhang X; Sun J, 2021, 'Dynamic Region-Aware Convolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8060 - 8069, http://dx.doi.org/10.1109/CVPR46437.2021.00797

Wang Y; Qi L; Chen YC; Zhang X; Jia J, 2021, 'Image Synthesis via Semantic Composition', in Proceedings of the IEEE International Conference on Computer Vision, pp. 13729 - 13738, http://dx.doi.org/10.1109/ICCV48922.2021.01349

Kang Z; Zhang P; Zhang X; Sun J; Zheng N, 2021, 'Instance-Conditional Knowledge Distillation for Object Detection', in Advances in Neural Information Processing Systems, pp. 16468 - 16480

Zhang X; Hou P; Zhang X; Sun J, 2021, 'Neural Architecture Search with Random Labels', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10902 - 10911, http://dx.doi.org/10.1109/CVPR46437.2021.01076

Chen L; Yang T; Zhang X; Zhang W; Sun J, 2021, 'Points as Queries: Weakly Semi-supervised Object Detection by Points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8819 - 8828, http://dx.doi.org/10.1109/CVPR46437.2021.00871

Ding X; Zhang X; Ma N; Han J; Ding G; Sun J, 2021, 'RepVgg: Making VGG-style ConvNets Great Again', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13728 - 13737, http://dx.doi.org/10.1109/CVPR46437.2021.01352

Dong B; Zeng F; Wang T; Zhang X; Wei Y, 2021, 'SOLQ: Segmenting Objects by Learning Queries', in Advances in Neural Information Processing Systems, pp. 21898 - 21909


Back to profile page