Select Publications

Book Chapters

Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7

Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018

Journal articles

Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, vol. 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004

Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, vol. 38, pp. 133 - 143, http://dx.doi.org/10.1109/MSP.2021.3057855

Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, vol. 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145

Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', APSIPA Transactions on Signal and Information Processing, vol. 9, http://dx.doi.org/10.1017/ATSIP.2020.9

Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, vol. 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419

Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, vol. 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531

Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, vol. 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003

Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, vol. 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, vol. abs/1909.00360

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, vol. 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814

Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, vol. 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004

Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, vol. 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027

Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, vol. 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, vol. 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629

Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, vol. 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003

Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, vol. 51, pp. 2149 - 2151, http://dx.doi.org/10.1049/el.2015.3117

Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio, Speech, and Music Processing, vol. 2013, http://dx.doi.org/10.1186/1687-4722-2013-19

Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, vol. 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005

Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, vol. 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081

Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, vol. 2008, pp. 78092 - 78099, http://dx.doi.org/10.1155/2008/378092

Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, vol. 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845

Conference Papers

Ahmed B; Ballard KJ; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children’s Speech', in Interspeech 2021, ISCA, presented at Interspeech 2021, http://dx.doi.org/10.21437/interspeech.2021-2000

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Interspeech 2021, ISCA, presented at Interspeech 2021, http://dx.doi.org/10.21437/interspeech.2021-1000

Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322

Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297

Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149

Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, IEEE, pp. 738 - 744, http://dx.doi.org/10.1109/ACII.2019.8925490

Atcheson M; Sethu V; Epps J, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, IEEE, pp. 718 - 724, http://dx.doi.org/10.1109/ACII.2019.8925450

Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411

Fernando S; Sethu V; Ambikairajah E; Li H, 2019, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2019, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510

Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369

Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314

Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419

Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978

Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2018, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211

Gamage KW; Sethu V; Ambikairajah E, 2018, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648

Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819

Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933

Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846

Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805

Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337

Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952


Back to profile page