Select Publications
Journal articles
2025, 'Underwater variable zoom: Depth-guided perception network for underwater image enhancement', Expert Systems with Applications, 259, http://dx.doi.org/10.1016/j.eswa.2024.125350
,2024, 'FeatSync: 3D point cloud multiview registration with attention feature-based refinement', Neurocomputing, 600, http://dx.doi.org/10.1016/j.neucom.2024.128088
,2024, 'A Visual Survey of Tunnel Boring Machine (TBM) Performance in Tunneling Excavation: Mainstream Direction, Brief Review and Future Prospects', Applied Sciences (Switzerland), 14, http://dx.doi.org/10.3390/app14114512
,2023, 'Semantic Navigation of PowerPoint-Based Lecture Video for AutoNote Generation', IEEE Transactions on Learning Technologies, 16, pp. 1 - 17, http://dx.doi.org/10.1109/TLT.2022.3216535
,2023, 'Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation', IEEE Transactions on Multimedia, 25, pp. 4052 - 4066, http://dx.doi.org/10.1109/TMM.2022.3171085
,2023, 'InterREC: An Interpretable Method for Referring Expression Comprehension', IEEE Transactions on Multimedia, 25, pp. 9330 - 9342, http://dx.doi.org/10.1109/TMM.2023.3251111
,2023, 'MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection', IEEE Transactions on Multimedia, 25, pp. 4199 - 4212, http://dx.doi.org/10.1109/TMM.2022.3172547
,2021, 'Rethinking feature aggregation for deep RGB-D salient object detection', Neurocomputing, 423, pp. 463 - 473, http://dx.doi.org/10.1016/j.neucom.2020.10.079
,Conference Papers
2024, 'Seeing Text in the Dark: Algorithm and Benchmark', in Proceedings of the 32nd ACM International Conference on Multimedia, ACM, pp. 2870 - 2878, presented at MM '24: The 32nd ACM International Conference on Multimedia, http://dx.doi.org/10.1145/3664647.3680728
,2023, 'Pseudo-label based and transformation-consistent self-ensembling model for underwater life images', in Batista P; Bilas Pachori R (ed.), International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), SPIE, pp. 147 - 147, presented at International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 24 February 2023 - 26 February 2023, http://dx.doi.org/10.1117/12.2681247
,2022, 'StrokePEO: Construction of a Clinical Ontology for Physical Examination of Stroke', in Proceedings - 2022 9th International Conference on Digital Home, ICDH 2022, pp. 218 - 223, http://dx.doi.org/10.1109/ICDH57206.2022.00041
,2019, 'Lecture2Note: Automatic Generation of Lecture Notes from Slide-Based Educational Videos', in 2019 IEEE International Conference on Multimedia and Expo (ICME), IEEE, presented at 2019 IEEE International Conference on Multimedia and Expo (ICME), 08 July 2019 - 12 July 2019, http://dx.doi.org/10.1109/icme.2019.00159
,Preprints
2024, Multimodal Hyperbolic Graph Learning for Alzheimer’s Disease Detection, http://dx.doi.org/10.1101/2024.10.29.24316334
,2024, MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection, http://arxiv.org/abs/2404.17151v1
,2024, Seeing Text in the Dark: Algorithm and Benchmark, http://arxiv.org/abs/2404.08965v3
,2021, What's Wrong with the Bottom-up Methods in Arbitrary-shape Scene Text Detection, http://arxiv.org/abs/2108.01809v2
,