Select Publications
Conference Papers
, 2023, 'Efficient Information Recognition for Machine-printed Invoices', in 2023 International Conference on Image Processing Computer Vision and Machine Learning Icicml 2023, pp. 913 - 918, http://dx.doi.org/10.1109/ICICML60161.2023.10424949
, 2023, 'Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3598 - 3608, http://dx.doi.org/10.1109/ICCV51070.2023.00335
, 2023, 'Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration', in Advances in Neural Information Processing Systems
, 2023, 'LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13488 - 13498, http://dx.doi.org/10.1109/CVPR52729.2023.01296
, 2023, 'MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception', in Proceedings of the IEEE International Conference on Computer Vision, pp. 8514 - 8523, http://dx.doi.org/10.1109/ICCV51070.2023.00785
, 2023, 'MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4109 - 4113, http://dx.doi.org/10.21437/Interspeech.2023-1446
, 2023, 'MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 22056 - 22065, http://dx.doi.org/10.1109/CVPR52729.2023.02112
, 2023, 'OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation', in Proceedings of the IEEE International Conference on Computer Vision, pp. 2749 - 2758, http://dx.doi.org/10.1109/ICCV51070.2023.00259
, 2023, 'PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images', in Proceedings of the IEEE International Conference on Computer Vision, pp. 3239 - 3249, http://dx.doi.org/10.1109/ICCV51070.2023.00302
, 2023, 'PQLM - Multilingual Decentralized Portable Quantum Language Model', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095215
, 2023, 'RE-PARAMETERIZING YOUR OPTIMIZERS RATHER THAN ARCHITECTURES', in 11th International Conference on Learning Representations Iclr 2023
, 2023, 'Referring Multi-Object Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 14633 - 14642, http://dx.doi.org/10.1109/CVPR52729.2023.01406
, 2023, 'RevColV2: Exploring Disentangled Representations in Masked Image Modeling', in Advances in Neural Information Processing Systems
, 2023, 'REVERSIBLE COLUMN NETWORKS', in 11th International Conference on Learning Representations Iclr 2023
, 2023, 'SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers', in Proceedings 2023 IEEE Cvf International Conference on Computer Vision Workshops Iccvw 2023, pp. 731 - 741, http://dx.doi.org/10.1109/ICCVW60793.2023.00081
, 2023, 'Slot-guided Volumetric Object Radiance Fields', in Advances in Neural Information Processing Systems
, 2023, 'Syntax-Aware Retrieval Augmented Code Generation', in Findings of the Association for Computational Linguistics Emnlp 2023, pp. 1291 - 1302, http://dx.doi.org/10.18653/v1/2023.findings-emnlp.90
, 2023, 'Traffic sign detection algorithm based on YOLOv5 combined with BIFPN and attention mechanism', in Itoec 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference, pp. 966 - 970, http://dx.doi.org/10.1109/ITOEC57671.2023.10291927
, 2023, 'Understanding Imbalanced Semantic Segmentation Through Neural Collapse', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 19550 - 19559, http://dx.doi.org/10.1109/CVPR52729.2023.01873
, 2023, 'Understanding Masked Image Modeling via Learning Occlusion Invariant Feature', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6241 - 6251, http://dx.doi.org/10.1109/CVPR52729.2023.00604
, 2023, 'VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21674 - 21683, http://dx.doi.org/10.1109/CVPR52729.2023.02076
, 2022, 'Design of a Spring Cold Warning System for Kiwifruit Orchards Based on the Internet of Things', in Advances in Transdisciplinary Engineering, pp. 1286 - 1295, http://dx.doi.org/10.3233/ATDE220999
, 2022, 'Anchor DETR: Query Design for Transformer-Based Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 2567 - 2575, http://dx.doi.org/10.1609/aaai.v36i3.20158
, 2022, 'LGD: Label-Guided Self-Distillation for Object Detection', in Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, pp. 3309 - 3317, http://dx.doi.org/10.1609/aaai.v36i3.20240
, 2022, 'Focal Sparse Convolutional Networks for 3D Object Detection', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5418 - 5427, http://dx.doi.org/10.1109/CVPR52688.2022.00535
, 2022, 'MOTR: End-to-End Multiple-Object Tracking with Transformer', in Lecture Notes in Computer Science, pp. 659 - 675, http://dx.doi.org/10.1007/978-3-031-19812-0_38
, 2022, 'PETR: Position Embedding Transformation for Multi-view 3D Object Detection', in Lecture Notes in Computer Science, pp. 531 - 548, http://dx.doi.org/10.1007/978-3-031-19812-0_31
, 2022, 'Progressive End-to-End Object Detection in Crowded Scenes', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 847 - 856, http://dx.doi.org/10.1109/CVPR52688.2022.00093
, 2022, 'Relieving Long-tailed Instance Segmentation via Pairwise Class Balance', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6990 - 6999, http://dx.doi.org/10.1109/CVPR52688.2022.00687
, 2022, 'RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 568 - 577, http://dx.doi.org/10.1109/CVPR52688.2022.00066
, 2022, 'Revisiting the Critical Factors of Augmentation-Invariant Representation Learning', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 42 - 58, http://dx.doi.org/10.1007/978-3-031-19821-2_3
, 2022, 'Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 11953 - 11965, http://dx.doi.org/10.1109/CVPR52688.2022.01166
, 2022, 'Self-Supervised Visual Representation Learning with Semantic Grouping', in Advances in Neural Information Processing Systems
, 2022, 'Simple Baselines for Image Restoration', in Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pp. 17 - 33, http://dx.doi.org/10.1007/978-3-031-20071-7_2
, 2022, 'Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 16886 - 16895, http://dx.doi.org/10.1109/CVPR52688.2022.01640
, 2022, 'When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2781 - 2786, http://dx.doi.org/10.1109/CVPRW56347.2022.00314
, 2021, 'Implicit Feature Refinement for Instance Segmentation', in Mm 2021 Proceedings of the 29th ACM International Conference on Multimedia, pp. 3088 - 3096, http://dx.doi.org/10.1145/3474085.3475449
, 2021, 'Fast camera image denoising on mobile GPUS with deep learning, mobile AI 2021 challenge: Report', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 2515 - 2524, http://dx.doi.org/10.1109/CVPRW53098.2021.00285
, 2021, 'MPG-net: Multi-prediction guided network for segmentation of retinal layers in OCT images', in European Signal Processing Conference, pp. 1299 - 1303, http://dx.doi.org/10.23919/Eusipco47968.2020.9287561
, 2021, 'Activate or Not: Learning Customized Activation', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8028 - 8038, http://dx.doi.org/10.1109/CVPR46437.2021.00794
, 2021, 'Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection', in 35th Aaai Conference on Artificial Intelligence Aaai 2021, pp. 2800 - 2808, http://dx.doi.org/10.1609/aaai.v35i4.16385
, 2021, 'Deeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection', in Proceedings 2021 7th International Conference on Big Data and Information Analytics Bigdia 2021, pp. 149 - 156, http://dx.doi.org/10.1109/BigDIA53151.2021.9619703
, 2021, 'Diverse Branch Block: Building a Convolution as an Inception-like Unit', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10881 - 10890, http://dx.doi.org/10.1109/CVPR46437.2021.01074
, 2021, 'Dynamic Region-Aware Convolution', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8060 - 8069, http://dx.doi.org/10.1109/CVPR46437.2021.00797
, 2021, 'Image Synthesis via Semantic Composition', in Proceedings of the IEEE International Conference on Computer Vision, pp. 13729 - 13738, http://dx.doi.org/10.1109/ICCV48922.2021.01349
, 2021, 'Instance-Conditional Knowledge Distillation for Object Detection', in Advances in Neural Information Processing Systems, pp. 16468 - 16480
, 2021, 'Neural Architecture Search with Random Labels', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10902 - 10911, http://dx.doi.org/10.1109/CVPR46437.2021.01076
, 2021, 'Points as Queries: Weakly Semi-supervised Object Detection by Points', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8819 - 8828, http://dx.doi.org/10.1109/CVPR46437.2021.00871
, 2021, 'RepVgg: Making VGG-style ConvNets Great Again', in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 13728 - 13737, http://dx.doi.org/10.1109/CVPR46437.2021.01352
, 2021, 'SOLQ: Segmenting Objects by Learning Queries', in Advances in Neural Information Processing Systems, pp. 21898 - 21909