Select Publications
Conference Papers
2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Interspeech 2024, ISCA, pp. 3714 - 3718, presented at Interspeech 2024, http://dx.doi.org/10.21437/interspeech.2024-482
,2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872
,2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON, http://dx.doi.org/10.1109/EDUCON60312.2024.10578840
,2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON, http://dx.doi.org/10.1109/EDUCON60312.2024.10578884
,2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499, https://doi.org/10.1109/ICASSP48485.2024
,2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210
,2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778
,2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213
,2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533
,2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617
,2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902, https://doi.org/10.21437/Interspeech.2023
,2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350
,2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000
,2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000
,2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684, https://doi.org/10.21437/Interspeech.2021
,2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502, https://doi.org/10.21437/Interspeech.2021
,2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322
,2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297
,2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149
,2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, http://dx.doi.org/10.1109/ACII.2019.8925490
,2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, http://dx.doi.org/10.1109/ACII.2019.8925450
,2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693
,2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411
,2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386
,2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369
,2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314
,2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321
,2018, 'End-to-End Hierarchical Language Identification System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419
,2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094
,2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978
,2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586
,2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510
,2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819
,2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933
,2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846
,2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805
,2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337
,2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in Ringeval F; Schuller BW; Valstar MF; Gratch J; Cowie R; Pantic M (eds.), AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Association for Computing Machinery (ACM), Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952
,2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-266
,2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-836
,2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-596
,2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-512
,2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-286
,2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-203
,2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211
,2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648
,2017, 'Salience based lexical features for emotion recognition', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5830 - 5834, http://dx.doi.org/10.1109/ICASSP.2017.7953274
,2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44, presented at Proceedings of the 1st IJCAI Workshop on Artificial Intelligence in Affective Computing (AffComp 2017), Melbourne, Australia, August 20, 2017., http://proceedings.mlr.press/v66/
,2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017, https://www.researchgate.net/publication/311615271_Eigenfeatures_An_alternative_to_Shifted_Delta_Coefficients_for_Language_Identification
,2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016, http://dx.doi.org/10.1145/2988257.2988265
,