Select Publications
Conference Papers
, 2000, 'Objective and subjective performance measures for voice activity detectors', in 8th Australian International Conference on Speech Science and Technology (SST 00), Canberra, presented at 8th Australian International Conference on Speech Science and Technology (SST 00), Canberra, 05 December 2000 - 07 December 2000
Preprints
, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357
, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6
, 2023, Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method, http://arxiv.org/abs/2311.07037v1
, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922
, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983
, 2022, Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations, http://arxiv.org/abs/2211.07769v1
, 2022, Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning, http://arxiv.org/abs/2210.10231v2