Select Publications

By Dr Vidhyasaharan Sethu

Book Chapters

Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7

Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018

Journal articles

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), http://dx.doi.org/10.1109/icassp48485.2024.10447530

Bose D; Sethu V; Ambikairajah E, 2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, http://dx.doi.org/10.1109/TAFFC.2024.3367371

Haghshenas Y; Wong WP; Sethu V; Amal R; Kumar PV; Teoh WY, 2024, 'Full prediction of band potentials in semiconductor materials', Materials Today Physics, pp. 101519 - 101519, http://dx.doi.org/10.1016/j.mtphys.2024.101519

Haghshenas Y; Wong WP; Gunawan D; Khataee A; Keyikoğlu R; Razmjou A; Kumar PV; Toe CY; Masood H; Amal R; Sethu V; Teoh WY, 2023, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO2 using machine learning with active photon flux as a unifying feature', EES Catalysis, 2, pp. 612 - 623, http://dx.doi.org/10.1039/d3ey00246b

Wickramasinghe B; Ambikairajah E; Sethu V; Epps J; Li H; Dang T, 2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154, http://dx.doi.org/10.1016/j.specom.2023.102973

Masood H; Sirojan T; Toe CY; Kumar PV; Haghshenas Y; Sit PHL; Amal R; Sethu V; Teoh WY, 2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4, http://dx.doi.org/10.1016/j.xcrp.2023.101555

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101, http://dx.doi.org/10.1109/TAFFC.2022.3159782

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, 'Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.', CoRR, abs/2310.10922

Ramandi HL; Irtza S; Sirojan T; Naman A; Mathew R; Sethu V; Roshan H; Lamei Ramandi H, 2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482, http://dx.doi.org/10.1016/j.jhydrol.2022.127482

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3, http://dx.doi.org/10.3389/fcomp.2021.767767

Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004

Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143, http://dx.doi.org/10.1109/MSP.2021.3057855

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information.', CoRR, abs/2108.04605

Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145

Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', APSIPA Transactions on Signal and Information Processing, 9, http://dx.doi.org/10.1017/ATSIP.2020.9

Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419

Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531

Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003

Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814

Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004

Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027

Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629

Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003

Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, http://dx.doi.org/10.1049/el.2015.3117

Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio, Speech, and Music Processing, 2013, http://dx.doi.org/10.1186/1687-4722-2013-19

Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005

Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081

Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099

Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845

Conference Papers

Jing M; Sethu V; Ahmed B, 2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210

Dang T; Dimitriadis A; Wu J; Sethu V; Ambikairajah E, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213

Shahin M; Nan Z; Sethu V; Ahmed B, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533

Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617

Wu J; Dang T; Sethu V; Ambikairajah E, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350

Ahmed B; Ballard K; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000

Ahmed B; Ballard KJ; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin MA; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684, https://doi.org/10.21437/Interspeech.2021

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502, https://doi.org/10.21437/Interspeech.2021

Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322

Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297

Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149

Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019, http://dx.doi.org/10.1109/ACII.2019.8925490

Back to profile page

Filter by type

View all »

ORCID as entered in ROS

https://orcid.org/0000-0001-8492-1787