Select Publications
Preprints
2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339
,2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357
,2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344
,2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1
,2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617
,2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922
,2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983
,2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993
,2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605
,2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360
,