Select Publications
Conference Papers
1993, 'COMPARISON OF VARIOUS ADAPTATION MECHANISMS IN AN AUDITORY MODEL FOR THE PURPOSE OF SPEECH PROCESSING', in 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993, pp. 717 - 720
,1993, 'THE APPLICATION OF THE WAVELET TRANSFORM FOR SPEECH PROCESSING', in 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993, pp. 151 - 154
,1992, 'A speaker verification system based on a Neural Prediction Model', in Proceedings - Singapore ICCS/ISITA 1992: ''Communications on the Move'', pp. 419 - 422, http://dx.doi.org/10.1109/ICCS.1992.254917
,1992, 'A two-layer Kohonen neural network using a cochlear model as a front-end processor for a speech recognition system', in Neural Networks for Signal Processing - Proceedings of the IEEE Workshop, pp. 139 - 148, http://dx.doi.org/10.1109/NNSP.1992.253699
,1992, 'TRANSPUTER IMPLEMENTATION OF FRONT-END PROCESSORS FOR SPEECH RECOGNITION SYSTEMS', in 2nd International Conference on Spoken Language Processing, ICSLP 1992, pp. 1531 - 1534
,1991, 'A PERCEPTUALLY-BASED PITCH EXTRACTOR FOR BAND-LIMITED SPEECH', in 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, pp. 449 - 452
,1991, 'AN ADAPTIVE COCHLEAR MODEL FOR SPEECH RECOGNITION', in 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, pp. 1331 - 1334
,'Novel Features for Effective Speech and Music Discrimination', in 2006 IEEE International Conference on Engineering of Intelligent Systems, IEEE, pp. 1 - 5, presented at 2006 IEEE International Conference on Engineering of Intelligent Systems, http://dx.doi.org/10.1109/iceis.2006.1703190
,Preprints
2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v1
,2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344
,2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1
,2024, An Exploration of Length Generalization in Transformer-Based Speech Enhancement, http://dx.doi.org/10.48550/arxiv.2406.11401
,2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://dx.doi.org/10.48550/arxiv.2405.12609
,2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617
,2024, An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement, http://dx.doi.org/10.48550/arxiv.2401.09686
,2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993
,2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605
,2019, An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks, http://dx.doi.org/10.48550/arxiv.1909.01302
,