Select Publications

By Associate Professor Vidhyasaharan Sethu

Preprints

Meng H; Sethu V; Ambikairajah E; Zhang Q; Li H, 2025, Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing, http://dx.doi.org/10.48550/arxiv.2510.18206

Dang T; Jeyaseelan TM; Ambikairajah E; Sethu V, 2025, Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal, http://dx.doi.org/10.48550/arxiv.2509.01419

Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends, http://dx.doi.org/10.48550/arxiv.2502.03260

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v2

Hong X; Gong Y; Sethu V; Dang T, 2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1

Meng H; Sethu V; Ambikairajah E, 2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922

Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983

Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360

Back to profile page

Filter by type

View all »

ORCID as entered in ROS

https://orcid.org/0000-0001-8492-1787