Select Publications
Book Chapters
2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7
,2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018
,Journal articles
2024, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO
2024, 'Binaural Selective Attention Model for Target Speaker Extraction', Interspeech 2024, pp. 4323 - 4327, http://dx.doi.org/10.21437/interspeech.2024-683
,2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction', Interspeech 2024, pp. 3185 - 3189, http://dx.doi.org/10.21437/interspeech.2024-119
,2024, 'Full prediction of band potentials in semiconductor materials', Materials Today Physics, 46, pp. 101519, http://dx.doi.org/10.1016/j.mtphys.2024.101519
,2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6495 - 6499, http://dx.doi.org/10.1109/icassp48485.2024.10447530
,2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, 15, pp. 1684 - 1695, http://dx.doi.org/10.1109/TAFFC.2024.3367371
,2024, 'A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework.', CoRR, abs/2409.15357
,2024, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', CoRR, abs/2409.18339
,2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154, http://dx.doi.org/10.1016/j.specom.2023.102973
,2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4, http://dx.doi.org/10.1016/j.xcrp.2023.101555
,2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101, http://dx.doi.org/10.1109/TAFFC.2022.3159782
,2023, 'Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.', CoRR, abs/2310.10922
,2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482, http://dx.doi.org/10.1016/j.jhydrol.2022.127482
,2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3, http://dx.doi.org/10.3389/fcomp.2021.767767
,2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004
,2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143, http://dx.doi.org/10.1109/MSP.2021.3057855
,2021, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information.', CoRR, abs/2108.04605
,2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145
,2020, 'An analysis of speaker dependent models in replay detection', APSIPA Transactions on Signal and Information Processing, 9, http://dx.doi.org/10.1017/ATSIP.2020.9
,2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419
,2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531
,2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003
,2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184
,2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360
,2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814
,2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044
,2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004
,2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027
,2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202
,2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629
,2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003
,2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, http://dx.doi.org/10.1049/el.2015.3117
,2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio, Speech, and Music Processing, 2013, http://dx.doi.org/10.1186/1687-4722-2013-19
,2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005
,2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081
,2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099
,2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845
,Conference Papers
2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Interspeech 2024, ISCA, pp. 3714 - 3718, presented at Interspeech 2024, http://dx.doi.org/10.21437/interspeech.2024-482
,2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872
,2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON, http://dx.doi.org/10.1109/EDUCON60312.2024.10578840
,2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference, EDUCON, http://dx.doi.org/10.1109/EDUCON60312.2024.10578884
,2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499, https://doi.org/10.1109/ICASSP48485.2024
,2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210
,2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778
,2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213
,2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533
,2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617
,2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902, https://doi.org/10.21437/Interspeech.2023
,