Select Publications

Conference Papers

Wan R; Zhu Z; Zhang X; Sun J, 2021, 'Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay', in Advances in Neural Information Processing Systems, pp. 6380 - 6391

Chen Q; Wang Y; Yang T; Zhang X; Cheng J; Sun J, 2021, 'You Only Look One-level Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13034 - 13043, http://dx.doi.org/10.1109/CVPR46437.2021.01284

Hu Y; Liang Y; Guo Z; Wan R; Zhang X; Wei Y; Gu Q; Sun J, 2020, 'Angle-Based Search Space Shrinking for Neural Architecture Search', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 119 - 134, http://dx.doi.org/10.1007/978-3-030-58529-7_8

Wang Y; Chen YC; Zhang X; Sun J; Jia J, 2020, 'Attentive normalization for conditional image generation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5093 - 5102, http://dx.doi.org/10.1109/CVPR42600.2020.00514

Chu X; Zheng A; Zhang X; Sun J, 2020, 'Detection in crowded scenes: One proposal, multiple predictions', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 12211 - 12220, http://dx.doi.org/10.1109/CVPR42600.2020.01223

Ma N; Zhang X; Sun J, 2020, 'Funnel Activation for Visual Recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 351 - 368, http://dx.doi.org/10.1007/978-3-030-58621-8_21

Hao M; Liu Y; Zhang X; Sun J, 2020, 'LabelEnc: A New Intermediate Supervision Method for Object Detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 529 - 545, http://dx.doi.org/10.1007/978-3-030-58595-2_32

Cai Y; Wang Z; Luo Z; Yin B; Du A; Wang H; Zhang X; Zhou X; Zhou E; Sun J, 2020, 'Learning Delicate Local Representations for Multi-person Pose Estimation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 455 - 472, http://dx.doi.org/10.1007/978-3-030-58580-8_27

Li Y; Song L; Chen Y; Li Z; Zhang X; Wang X; Sun J, 2020, 'Learning dynamic routing for semantic segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8550 - 8559, http://dx.doi.org/10.1109/CVPR42600.2020.00858

Wang T; Yang T; Danelljan M; Khan FS; Zhang X; Sun J, 2020, 'Learning human-object interaction detection using interaction points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4115 - 4124, http://dx.doi.org/10.1109/CVPR42600.2020.00417

Song L; Li Y; Jiang Z; Li Z; Zhang X; Sun H; Sun J; Zheng N, 2020, 'Rethinking learnable tree filter for generic feature transform', in Advances in Neural Information Processing Systems

Guo Z; Zhang X; Mu H; Heng W; Liu Z; Wei Y; Sun J, 2020, 'Single Path One-Shot Neural Architecture Search with Uniform Sampling', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 544 - 560, http://dx.doi.org/10.1007/978-3-030-58517-4_32

Yan J; Wan R; Zhang X; Zhang W; Wei Y; Sun J, 2020, 'TOWARDS STABILIZING BATCH STATISTICS IN BACKWARD PROPAGATION OF BATCH NORMALIZATION', in 8th International Conference on Learning Representations Iclr 2020

Li Y; Wu W; Liu Z; Zhang C; Zhang X; Yao H; Yin B, 2020, 'Weight-Dependent Gates for Differentiable Neural Network Pruning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 23 - 37, http://dx.doi.org/10.1007/978-3-030-68238-5_3

Ma N; Zhang X; Huang J; Sun J, 2020, 'WeightNet: Revisiting the Design Space of Weight Networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 776 - 792, http://dx.doi.org/10.1007/978-3-030-58555-6_46

Liu Z; Mu H; Zhang X; Guo Z; Yang X; Cheng KT; Sun J, 2019, 'MetaPruning: Meta learning for automatic neural network channel pruning', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3295 - 3304, http://dx.doi.org/10.1109/ICCV.2019.00339

Shao S; Li Z; Zhang T; Peng C; Yu G; Zhang X; Li J; Sun J, 2019, 'Objects365: A large-scale, high-quality dataset for object detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8429 - 8438, http://dx.doi.org/10.1109/ICCV.2019.00852

He Y; Zhu C; Wang J; Savvides M; Zhang X, 2019, 'Bounding box regression with uncertainty for accurate object detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2883 - 2892, http://dx.doi.org/10.1109/CVPR.2019.00300

Hu X; Mu H; Zhang X; Wang Z; Tan T; Sun J, 2019, 'Meta-SR: A magnification-arbitrary network for super-resolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1575 - 1584, http://dx.doi.org/10.1109/CVPR.2019.00167

Chen Y; Yang T; Zhang X; Meng G; Xiao X; Sun J, 2019, 'DetNAS: Backbone search for object detection', in Advances in Neural Information Processing Systems

Peng C; Xiao T; Li Z; Jiang Y; Zhang X; Jia K; Yu G; Sun J, 2018, 'MegDet: A Large Mini-Batch Object Detector', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6181 - 6189, http://dx.doi.org/10.1109/CVPR.2018.00647

Zhang X; Zhou X; Lin M; Sun J, 2018, 'ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6848 - 6856, http://dx.doi.org/10.1109/CVPR.2018.00716

Li Z; Peng C; Yu G; Zhang X; Deng Y; Sun J, 2018, 'DetNet: Design backbone for object detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 339 - 354, http://dx.doi.org/10.1007/978-3-030-01240-3_21

Zhang Z; Zhang X; Peng C; Xue X; Sun J, 2018, 'ExFuse: Enhancing feature fusion for semantic segmentation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 273 - 288, http://dx.doi.org/10.1007/978-3-030-01249-6_17

Yang T; Zhang X; Li Z; Zhang W; Sun J, 2018, 'Metaanchor: Learning to detect objects with customized anchors', in Advances in Neural Information Processing Systems, pp. 320 - 330

Ma N; Zhang X; Zheng HT; Sun J, 2018, 'Shufflenet V2: Practical guidelines for efficient cnn architecture design', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 122 - 138, http://dx.doi.org/10.1007/978-3-030-01264-9_8

He Y; Zhang X; Sun J, 2017, 'Channel Pruning for Accelerating Very Deep Neural Networks', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1398 - 1406, http://dx.doi.org/10.1109/ICCV.2017.155

Peng C; Zhang X; Yu G; Luo G; Sun J, 2017, 'Large kernel matters - Improve semantic segmentation by global convolutional network', in Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017, pp. 1743 - 1751, http://dx.doi.org/10.1109/CVPR.2017.189

He K; Zhang X; Ren S; Sun J, 2016, 'Deep residual learning for image recognition', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770 - 778, http://dx.doi.org/10.1109/CVPR.2016.90

He K; Zhang X; Ren S; Sun J, 2016, 'Identity mappings in deep residual networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 630 - 645, http://dx.doi.org/10.1007/978-3-319-46493-0_38

Zhang X; Zou J; Ming X; He K; Sun J, 2015, 'Efficient and accurate approximations of nonlinear convolutional networks', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1984 - 1992, http://dx.doi.org/10.1109/CVPR.2015.7298809

He K; Zhang X; Ren S; Sun J, 2015, 'Delving deep into rectifiers: Surpassing human-level performance on imagenet classification', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1026 - 1034, http://dx.doi.org/10.1109/ICCV.2015.123

He K; Zhang X; Ren S; Sun J, 2014, 'Spatial pyramid pooling in deep convolutional networks for visual recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 346 - 361, http://dx.doi.org/10.1007/978-3-319-10578-9_23

Preprints

Zhang X; Fang F; Gao P; Qin B; Ahmed B; Epps J, 2025, Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation, http://arxiv.org/abs/2505.18516v1

Zhang X; Ahmed B; Epps J, 2025, Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection, http://arxiv.org/abs/2503.06620v1

Zhang X; Liu H; Zhang Q; Ahmed B; Epps J, 2025, SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information, http://arxiv.org/abs/2502.10950v2

Zhang X; Liu D; Xiao T; Xiao C; Szalay T; Shahin M; Ahmed B; Epps J, 2024, Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction, http://arxiv.org/abs/2409.07969v2

Zhang X; Ma J; Shahin M; Ahmed B; Epps J, 2024, Rethinking Mamba in Speech Processing by Self-Supervised Models, http://arxiv.org/abs/2409.07273v1

Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6

Joshi A; Renzella J; Bhattacharyya P; Jha S; Zhang X, 2024, Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy, http://arxiv.org/abs/2405.09854v2

Zhang X; Liu H; Xu K; Zhang Q; Liu D; Ahmed B; Epps J, 2024, When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection, http://arxiv.org/abs/2402.13276v2

Zhang X; Liu D; Liu H; Zhang Q; Meng H; Garcia LP; Chng ES; Yao L, 2024, Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model, http://arxiv.org/abs/2402.10642v2

Li SS; Zhang X; Zhou S; Shu H; Liang R; Liu H; Garcia LP, 2022, PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection, http://arxiv.org/abs/2210.03221v5

Zhang X; Li SS; He Z; Togneri R; Garcia LP, 2022, End-to-End Lyrics Recognition with Self-supervised Learning, http://arxiv.org/abs/2209.12702v4


Back to profile page