Select Publications
Conference Papers
, 2021, 'Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay', in Advances in Neural Information Processing Systems, pp. 6380 - 6391
, 2021, 'You Only Look One-level Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13034 - 13043, http://dx.doi.org/10.1109/CVPR46437.2021.01284
, 2020, 'Angle-Based Search Space Shrinking for Neural Architecture Search', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 119 - 134, http://dx.doi.org/10.1007/978-3-030-58529-7_8
, 2020, 'Attentive normalization for conditional image generation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5093 - 5102, http://dx.doi.org/10.1109/CVPR42600.2020.00514
, 2020, 'Detection in crowded scenes: One proposal, multiple predictions', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 12211 - 12220, http://dx.doi.org/10.1109/CVPR42600.2020.01223
, 2020, 'Funnel Activation for Visual Recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 351 - 368, http://dx.doi.org/10.1007/978-3-030-58621-8_21
, 2020, 'LabelEnc: A New Intermediate Supervision Method for Object Detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 529 - 545, http://dx.doi.org/10.1007/978-3-030-58595-2_32
, 2020, 'Learning Delicate Local Representations for Multi-person Pose Estimation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 455 - 472, http://dx.doi.org/10.1007/978-3-030-58580-8_27
, 2020, 'Learning dynamic routing for semantic segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8550 - 8559, http://dx.doi.org/10.1109/CVPR42600.2020.00858
, 2020, 'Learning human-object interaction detection using interaction points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4115 - 4124, http://dx.doi.org/10.1109/CVPR42600.2020.00417
, 2020, 'Rethinking learnable tree filter for generic feature transform', in Advances in Neural Information Processing Systems
, 2020, 'Single Path One-Shot Neural Architecture Search with Uniform Sampling', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 544 - 560, http://dx.doi.org/10.1007/978-3-030-58517-4_32
, 2020, 'TOWARDS STABILIZING BATCH STATISTICS IN BACKWARD PROPAGATION OF BATCH NORMALIZATION', in 8th International Conference on Learning Representations Iclr 2020
, 2020, 'Weight-Dependent Gates for Differentiable Neural Network Pruning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 23 - 37, http://dx.doi.org/10.1007/978-3-030-68238-5_3
, 2020, 'WeightNet: Revisiting the Design Space of Weight Networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 776 - 792, http://dx.doi.org/10.1007/978-3-030-58555-6_46
, 2019, 'MetaPruning: Meta learning for automatic neural network channel pruning', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3295 - 3304, http://dx.doi.org/10.1109/ICCV.2019.00339
, 2019, 'Objects365: A large-scale, high-quality dataset for object detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8429 - 8438, http://dx.doi.org/10.1109/ICCV.2019.00852
, 2019, 'Bounding box regression with uncertainty for accurate object detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2883 - 2892, http://dx.doi.org/10.1109/CVPR.2019.00300
, 2019, 'Meta-SR: A magnification-arbitrary network for super-resolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1575 - 1584, http://dx.doi.org/10.1109/CVPR.2019.00167
, 2019, 'DetNAS: Backbone search for object detection', in Advances in Neural Information Processing Systems
, 2018, 'MegDet: A Large Mini-Batch Object Detector', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6181 - 6189, http://dx.doi.org/10.1109/CVPR.2018.00647
, 2018, 'ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6848 - 6856, http://dx.doi.org/10.1109/CVPR.2018.00716
, 2018, 'DetNet: Design backbone for object detection', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 339 - 354, http://dx.doi.org/10.1007/978-3-030-01240-3_21
, 2018, 'ExFuse: Enhancing feature fusion for semantic segmentation', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 273 - 288, http://dx.doi.org/10.1007/978-3-030-01249-6_17
, 2018, 'Metaanchor: Learning to detect objects with customized anchors', in Advances in Neural Information Processing Systems, pp. 320 - 330
, 2018, 'Shufflenet V2: Practical guidelines for efficient cnn architecture design', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 122 - 138, http://dx.doi.org/10.1007/978-3-030-01264-9_8
, 2017, 'Channel Pruning for Accelerating Very Deep Neural Networks', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1398 - 1406, http://dx.doi.org/10.1109/ICCV.2017.155
, 2017, 'Large kernel matters - Improve semantic segmentation by global convolutional network', in Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017, pp. 1743 - 1751, http://dx.doi.org/10.1109/CVPR.2017.189
, 2016, 'Deep residual learning for image recognition', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770 - 778, http://dx.doi.org/10.1109/CVPR.2016.90
, 2016, 'Identity mappings in deep residual networks', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 630 - 645, http://dx.doi.org/10.1007/978-3-319-46493-0_38
, 2015, 'Efficient and accurate approximations of nonlinear convolutional networks', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1984 - 1992, http://dx.doi.org/10.1109/CVPR.2015.7298809
, 2015, 'Delving deep into rectifiers: Surpassing human-level performance on imagenet classification', in Proceedings of the IEEE International Conference on Computer Vision, pp. 1026 - 1034, http://dx.doi.org/10.1109/ICCV.2015.123
, 2014, 'Spatial pyramid pooling in deep convolutional networks for visual recognition', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 346 - 361, http://dx.doi.org/10.1007/978-3-319-10578-9_23
Preprints
, 2025, Distinctive Feature Codec: Adaptive Segmentation for Efficient Speech Representation, http://arxiv.org/abs/2505.18516v1
, 2025, Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection, http://arxiv.org/abs/2503.06620v1
, 2025, SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information, http://arxiv.org/abs/2502.10950v2
, 2024, Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction, http://arxiv.org/abs/2409.07969v2
, 2024, Rethinking Mamba in Speech Processing by Self-Supervised Models, http://arxiv.org/abs/2409.07273v1
, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6
, 2024, Striking a Balance between Classical and Deep Learning Approaches in Natural Language Processing Pedagogy, http://arxiv.org/abs/2405.09854v2
, 2024, When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection, http://arxiv.org/abs/2402.13276v2
, 2024, Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model, http://arxiv.org/abs/2402.10642v2
, 2022, PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection, http://arxiv.org/abs/2210.03221v5
, 2022, End-to-End Lyrics Recognition with Self-supervised Learning, http://arxiv.org/abs/2209.12702v4