Select Publications

Conference Papers

Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015

Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html

Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878

Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, pp. 330 - 335

Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1238 - 1242

Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 746 - 750

Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741

Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125

Sethu V; Epps J; Ambikairajah E, 2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013

Cummins N; Epps J; Sethu V; Breakspear M; Goecke R, 2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013

Cummins N; Joshi J; Dhall A; Sethu V; Goecke R; Epps J, 2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge, pp. 11 - 20, http://dx.doi.org/10.1145/2512530.2512535

Ambikairajah E; Kua JM; Sethu V; Li H, 2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012

Ding N; Sethu V; Epps JR; Ambikairajah E, 2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012, http://dx.doi.org/10.1109/ICASSP.2012.6289068

Le PN; Sethu V; Ambikairajah E; Kua JMK, 2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in ICICS 2011 - 8th International Conference on Information, Communications and Signal Processing, http://dx.doi.org/10.1109/ICICS.2011.6174268

Le NP; Epps JR; Ambikairajah E; Sethu V, 2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010

Ambikairajah E; Ibrahim RK; Sethu V, 2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010

Sethu V; Ambikairajah E; Epps JR, 2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009

Sethu V; Ambikairajah E; Epps JR, 2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009

Sethu V; Ambikairajah E; Epps JR, 2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008

Sethu V; Ambikairajah E; Epps JR, 2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008

Le NP; Ambikairajah E; Sethu V, 2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008

Wang Y; An J; Sethu V; Ambikairajah E, 2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information, Communications and Signal Processing, ICICS, http://dx.doi.org/10.1109/ICICS.2007.4449758

Sethu V; Ambikairajah E; Epps JR, 2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007

Sethu V; Ambikairajah E; Epps JR, 2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas

Ambikairajah E; Sethu V; Ge L, 2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006

Preprints

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v1

Hong X; Gong Y; Sethu V; Dang T, 2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1

Meng H; Sethu V; Ambikairajah E, 2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922

Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983

Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360


Back to profile page