Select Publications
Conference Papers
2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015
,2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html
,2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878
,2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, pp. 330 - 335
,2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1238 - 1242
,2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 746 - 750
,2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741
,2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125
,2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013
,2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013
,2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge, pp. 11 - 20, http://dx.doi.org/10.1145/2512530.2512535
,2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012
,2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012, http://dx.doi.org/10.1109/ICASSP.2012.6289068
,2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in ICICS 2011 - 8th International Conference on Information, Communications and Signal Processing, http://dx.doi.org/10.1109/ICICS.2011.6174268
,2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010
,2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010
,2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009
,2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009
,2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008
,2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008
,2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008
,2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information, Communications and Signal Processing, ICICS, http://dx.doi.org/10.1109/ICICS.2007.4449758
,2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007
,2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas
,2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006
,Preprints
2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v1
,2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339
,2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357
,2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344
,2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1
,2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617
,2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922
,2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983
,2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993
,2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605
,2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360
,