Select Publications
Conference Papers
, 2025, 'Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), IEEE, pp. 658 - 663, presented at 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 22 October 2025 - 24 October 2025, http://dx.doi.org/10.1109/apsipaasc65261.2025.11249066
, 2025, 'Tiered Assessment for DSP Education: Exploring Students' Motivation and Performance', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), IEEE, pp. 1847 - 1852, presented at 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 22 October 2025 - 24 October 2025, http://dx.doi.org/10.1109/apsipaasc65261.2025.11249035
, 2025, 'A Study of Speech Embedding Similarities Between Australian Aboriginal and High-Resource Languages', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1498 - 1502, http://dx.doi.org/10.21437/Interspeech.2025-911
, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10888198
, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887842
, 2025, 'Evidential Neural GPLDA: A Novel Approach to Quantify Prediction Uncertainty in Speaker Verification Systems', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887887
, 2025, 'Improved Out-of-domain Detection in VAE Latent Spaces with Boundary-driven Regularisation', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10890806
, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025
, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025
, 2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872
, 2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578840
, 2024, 'Aligning Tiered Assessments With Course Learning Outcomes', in 2024 IEEE International Conference on Teaching Assessment and Learning for Engineering Tale 2024 Proceedings, http://dx.doi.org/10.1109/TALE62452.2024.10834314
, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4323 - 4327, http://dx.doi.org/10.21437/Interspeech.2024-683
, 2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3714 - 3718, http://dx.doi.org/10.21437/Interspeech.2024-482
, 2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578884
, 2024, 'Emotion Recognition Systems Must Embrace Ambiguity', in Proceedings 2024 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos Aciiw 2024, pp. 166 - 170, http://dx.doi.org/10.1109/ACIIW63320.2024.00033
, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6495 - 6499, http://dx.doi.org/10.1109/ICASSP48485.2024.10447530
, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024
, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024
, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499, https://doi.org/10.1109/ICASSP48485.2024
, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction Acii 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210
, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778
, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213
, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533
, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617
, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902, https://doi.org/10.21437/Interspeech.2023
, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350
, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000
, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000
, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684, https://doi.org/10.21437/Interspeech.2021
, 2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502, https://doi.org/10.21437/Interspeech.2021
, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322
, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297
, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149
, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925490
, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925450
, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693
, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411
, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386
, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369
, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in Avec 2018 Proceedings of the 2018 Audio Visual Emotion Challenge and Workshop Co Located with mm 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314
, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321
, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419
, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094
, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978
, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586
, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510
, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819
, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933
, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846