Select Publications

By Dr Vidhyasaharan Sethu

Conference Papers

Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411

Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability, ICIAfS 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369

Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314

Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419

Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978

Fernando S; Sethu V; Ambikairajah E; Li H, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510

Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819

Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933

Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846

Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805

Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337

Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in Ringeval F; Schuller BW; Valstar MF; Gratch J; Cowie R; Pantic M (eds.), AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Association for Computing Machinery (ACM), Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-266

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-836

Irtza S; Sethu V; Ambikairajah E; Li H, 2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-596

Dang T; Sethu V; Epps J; Ambikairajah E, 2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-512

Fernando S; Sethu V; Ambikairajah E; Epps J, 2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-286

Lee KA; Hautamäki V; Kinnunen T; Larcher A; Zhang C; Nautsch A; Stafylakis T; Liu G; Rouvier M; Rao W; Alegre F; Ma J; Mak MW; Sarkar AK; Delgado H; Saeidi R; Aronowitz H; Sizov A; Sun H; Nguyen TH; Wang G; Ma B; Vestman V; Sahidullah M; Halonen M; Kanervisto A; Le Lan G; Bahmaninezhad F; Isadskiy S; Rathgeb C; Busch C; Tzimiropoulos G; Qian Q; Wang Z; Zhao Q; Wang T; Li H; Xue J; Zhu S; Jin R; Zhao T; Bousquet PM; Ajili M; Kheder WB; Matrouf D; Lim ZH; Xu C; Xu H; Xiao X; Chng ES; Fauve B; Sriskandaraja K; Sethu V; Lin WW; Thomsen DAL; Tan ZH; Todisco M; Evans N; Li H; Hansen JHL; Bonastre JF; Ambikairajah E, 2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-203

Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings - 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction, ACII 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Salience based lexical features for emotion recognition', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 5830 - 5834, http://dx.doi.org/10.1109/ICASSP.2017.7953274

Atcheson M; Sethu V; Epps J, 2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44, presented at Proceedings of the 1st IJCAI Workshop on Artificial Intelligence in Affective Computing (AffComp 2017), Melbourne, Australia, August 20, 2017., http://proceedings.mlr.press/v66/

Sethu V; Fernando S; Ambikairajah E, 2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017, https://www.researchgate.net/publication/311615271_Eigenfeatures_An_alternative_to_Shifted_Delta_Coefficients_for_Language_Identification

Huang Z; Stasak B; Dang T; Gamage KW; Le P; Sethu V; Epps J, 2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016, http://dx.doi.org/10.1145/2988257.2988265

Fernando S; Sethu V; Ambikairajah E, 2016, 'A feature normalisation technique for PLLR based language identification systems', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, CA, USA, pp. 2925 - 2929, presented at Interspeech 2016, San Francisco, CA, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-560

Sriskandaraja K; Sethu V; Le PN; Ambikairajah E, 2016, 'Investigation of sub-band discriminative information between spoofed and genuine speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1710 - 1714, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-844

Ma J; Irtza S; Sriskandaraja K; Sethu V; Ambikairajah E, 2016, 'Parallel speaker and content modelling for text-dependent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 435 - 439, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-825

Ma J; Sethu V; Ambikairajah E; Lee KA, 2016, 'Twin model G-PLDA for duration mismatch compensation in text-independent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1853 - 1857, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-683

Irtza S; Sethu V; Bavattichalil H; Ambikairajah E; Li H, 2016, 'A hierarchical framework for language identification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Shanghai, China, pp. 5820 - 5824, presented at 2016 IEEE International Conference on, Shanghai, China, 20 March 2016 - 25 March 2016, http://dx.doi.org/10.1109/ICASSP.2016.7472793

Dang T; Sethu V; Ambikairajah E, 2016, 'Factor analysis based speaker normalisation for continuous emotion prediction', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 913 - 917, http://dx.doi.org/10.21437/interspeech.2016-880

Irtza S; Sethu V; Fernando S; Ambikairajah E; Li H, 2016, 'Out of set language modelling in Hierarchical language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3270 - 3274, http://dx.doi.org/10.21437/interspeech.2016-558

Gamage KW; Sethu V; Le P; Ambikairajah E, 2015, 'An i-vector GPLDA System for Speech based Emotion Recognition', in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://dx.doi.org/10.1109/APSIPA.2015.7415522

Irtza S; Bavattichalil H; Sethu V; Ambikairajah E, 2015, 'Scalable I-vector Concatenation for PLDA based Language Identification System', in The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7415458

Epps J; Sethu V; Eaton R; Ambikairajah E, 2015, 'High Definition Multi-View Video Guidance for Self-Directed Learning and More Effective Engineering Laboratories', Geelong,Australia, presented at Australasian Association for Engineering Education, Geelong,Australia, 06 December 2015 - 09 December 2015, https://aaee2015conference.sched.org/event/5aaZ/4b-high-definition-multi-view-video-guidance-for-self-directed-learning-and-more-effective-engineering-laboratories

Hines C; Sethu V; Epps J, 2015, 'Twitter: A new online source of automatically tagged data for conversational speech emotion recognition', in ASM 2015 - Proceedings of the 1st International Workshop on Affect and Sentiment in Multimedia, co-located with ACM MM 2015, pp. 9 - 14, http://dx.doi.org/10.1145/2813524.2813529

Huang Z; Dang T; Cummins N; Stasak B; Le P; Sethu V; Epps J, 2015, 'An investigation of annotation delay compensation and output-associative fusion for multimodal continuous emotion prediction', in AVEC 2015 - Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, co-Located with MM 2015, pp. 41 - 48, http://dx.doi.org/10.1145/2808196.2811640

Sriskandaraja K; Sethu V; Le P; Ambikairajah E, 2015, 'A Model Based Voice Activity Detector for Noisy Environments', Dresden, Germany, presented at Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_2297.html

Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015

Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html

Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878

Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015, pp. 330 - 335

Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 1238 - 1242

Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 746 - 750

Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741

Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125

Back to profile page

Filter by type

View all »

ORCID as entered in ROS

https://orcid.org/0000-0001-8492-1787