2024
Liusie, A., Manakul, P. and Gales, MJF., 2024. LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, v. 1
Fathullah, Y., Radmard, P., Liusie, A. and Gales, MJF., 2024. Who Needs Decoders? Efficient Estimation of Sequence-Level Attributes with Proxies EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, v. 1
2023
Liusie, A., Raina, V. and Gales, M., 2023. "World Knowledge" in Multiple Choice Reading Comprehension FEVER 2023 - 6th Fact Extraction and VERification Workshop, Proceedings,
Manakul, P., Fathullah, Y., Liusie, A., Raina, V., Raina, V. and Gales, M., 2023. CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models Proceedings of the Annual Meeting of the Association for Computational Linguistics,
Fathullah, Y., Xia, G. and Gales, MJF., 2023. Logit-Based Ensemble Distribution Distillation for Robust Autoregressive Sequence Uncertainties Proceedings of Machine Learning Research, v. 216
Liusie, A., Manakul, P. and Gales, MJF., 2023. Mitigating Word Bias in Zero-shot Prompt-based Classifiers P-AACL 2023 ...,
Manakul, P., Liusie, A. and Gales, MJF., 2023. SELFCHECKGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
Raina, V. and Gales, M., 2023. Sample Attackability in Natural Language Adversarial Attacks Proceedings of the Annual Meeting of the Association for Computational Linguistics,
Nicholls, D., Knill, K., Gales, MJF., Ragni, A. and Ricketts, P., 2023. Speak & Improve: L2 English Speaking Practice Tool Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2023-August
Ma, R., Qian, M., Gales, MJF. and Knill, KM., 2023. Adapting an Unadaptable ASR System Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2023-August
Doi: 10.21437/Interspeech.2023-1899
Teh, TH., Hu, V., Ram Mohan, DS., Hodari, Z., Wallis, CGR., Gomez Ibarrondo, T., Torresquintero, A., Leoni, J., Gales, M. and King, S., 2023. Ensemble Prosody Prediction for Expressive Speech Synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP49357.2023.10096962
Fathullah, Y., Wu, C., Shangguan, Y., Jia, J., Xiong, W., Mahadeokar, J., Liu, C., Shi, Y., Kalinli, O., Seltzer, M. and Gales, MJF., 2023. Multi-Head State Space Model for Speech Recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2023-August
Doi: http://doi.org/10.21437/Interspeech.2023-1036
Ma, R., Gales, MJF., Knill, KM. and Qian, M., 2023. N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2023-August
Doi: 10.21437/Interspeech.2023-1616
Molchanova, N., Raina, V., Malinin, A., La Rosa, F., Muller, H., Gales, M., Granziera, C., Graziani, M. and Cuadra, MB., 2023. Novel Structural-Scale Uncertainty Measures and Error Retention Curves: Application to Multiple Sclerosis Proceedings - International Symposium on Biomedical Imaging, v. 2023-April
Doi: http://doi.org/10.1109/ISBI53787.2023.10230563
Raina, V., Molchanova, N., Graziani, M., Malinin, A., Muller, H., Cuadra, MB. and Gales, M., 2023. Tackling Bias in the Dice Similarity Coefficient: Introducing NDSC for White Matter Lesion Segmentation Proceedings - International Symposium on Biomedical Imaging, v. 2023-April
Doi: http://doi.org/10.1109/ISBI53787.2023.10230755
Yang, Y., Li, Q., Tian, X., Ng, WWY., Wang, H., Kittler, J., Gales, M. and Cooper, R., 2023. Unsupervised Multi-Hashing for Image Retrieval in Non-stationary Environments 2023 15th International Conference on Advanced Computational Intelligence, ICACI 2023,
Doi: http://doi.org/10.1109/ICACI58115.2023.10146177
2022
McDonald, A., Gales, MJF. and Agarwal, A., 2022. Detection of Heart Murmurs in Phonocardiograms with Parallel Hidden Semi-Markov Models Computing in Cardiology, v. 2022-September
Doi: http://doi.org/10.22489/CinC.2022.020
Banno, S., Balusu, B., Gales, M., Knill, K. and Kyriakopoulos, K., 2022. View-Specific Assessment of L2 Spoken English Proc. Interspeech 2022,
Doi: 10.21437/Interspeech.2022-10691
Raina, V. and Gales, M., 2022. Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension Proceedings of the Annual Meeting of the Association for Computational Linguistics,
Lu, Y., Gales, M. and Bannò, S., 2022. On Assessing and Developing Spoken’Grammatical Error Correction’ Systems BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
Raina, V. and Gales, M., 2022. Residue-Based Natural Language Adversarial Attack Detection NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference,
Fathullah, Y. and Gales, MJF., 2022. Self-Distribution Distillation: Efficient Uncertainty Estimation Proceedings of Machine Learning Research, v. 180
Fathullah, Y. and Gales, M., 2022. Self-Distribution Distillation: Efficient Uncertainty Estimation Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence, v. 180
2021
Wei, X., Gales, MJF. and Knill, KM., 2021. Analysing bias in spoken language assessment using concept activation vectors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9413988
Fathullah, Y., Gales, MJF. and Malinin, A., 2021. Ensemble distillation approaches for grammatical error correction ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9413385
Lu, Y., Wang, Y. and Gales, MJF., 2021. Efficient use of end-to-end data in spoken language processing ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9414510
Ryabinin, M., Malinin, A. and Gales, M., 2021. Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets Advances in Neural Information Processing Systems, v. 8
Dou, Q., Wu, X., Wan, M., Lu, Y. and Gales, MJF., 2021. Deliberation-based multi-pass speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 5
Doi: http://doi.org/10.21437/Interspeech.2021-1405
Malinin, A. and Gales, M., 2021. UNCERTAINTY ESTIMATION IN AUTOREGRESSIVE STRUCTURED PREDICTION ICLR 2021 - 9th International Conference on Learning Representations,
Manakul, P. and Gales, MJF., 2021. Long-span summarization via local attention and content selection ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
Manakul, P. and Gales, MJF., 2021. Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings,
2020
Malinin, A., Mlodozeniec, B. and Gales, M., 2020. ENSEMBLE DISTRIBUTION DISTILLATION 8th International Conference on Learning Representations, ICLR 2020,
Raina, V., Gales, MJF. and Knill, K., 2020. Complementary systems for Off-Topic spoken response detection Proceedings of the Annual Meeting of the Association for Computational Linguistics,
Manakul, P., Gales, MJF. and Wang, L., 2020. Abstractive spoken document summarization using hierarchical model with multi-stage attention diversity optimization Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-1683
Lu, Y., Gales, MJF. and Wang, Y., 2020. Spoken language 'grammatical error correction' Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-1852
Manakul, P. and Gales, M., 2020. CUED_SPEECH AT TREC 2020 PODCAST SUMMARISATION TRACK 29th Text REtrieval Conference, TREC 2020 - Proceedings,
Knill, KM., Wang, L., Wang, Y., Wu, X. and Gales, MJF., 2020. Non-native children's automatic speech recognition: The INTERSPEECH 2020 shared task ALTA systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2154
Wu, X., Knill, KM., Gales, MJF. and Malinin, A., 2020. Ensemble approaches for uncertainty in spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2238
Kastanos, A., Ragni, A. and Gales, MJF., 2020. Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2020-May
Doi: http://doi.org/10.1109/ICASSP40776.2020.9053264
Raina, V., Gales, MJF. and Knill, K., 2020. Universal adversarial attacks on spoken language assessment systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: 10.21437/Interspeech.2020-1890
Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2020. Automatic detection of accent and lexical pronunciation errors in spontaneous non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: 10.21437/Interspeech.2020-2881
Dou, Q., Efiong, J. and Gales, MJF., 2020. Attention forcing for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2520
2019 (Accepted for publication)
Gales, M. and Malinin, A., 2019 (Accepted for publication). Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness Advances in Neural Information Processing Systems 32 (NeurIPS 2019),
Li, Q., Ness, P., Ragni, A. and Gales, M., 2019 (Accepted for publication). BI-DIRECTIONAL LATTICE RECURRENT NEURAL NETWORKS
FOR CONFIDENCE ESTIMATION
Doi: http://doi.org/10.17863/CAM.36745
Lu, Y., Gales, M., Knill, K., Manakul, P. and Wang, Y., 2019 (Accepted for publication). Disfluency Detection for Spoken Learner English
Doi: http://doi.org/10.17863/CAM.42082
2019
Lu, Y., Gales, MJF., Knill, KM., Manakul, P. and Wang, Y., 2019. Disfluency Detection for Spoken Learner English 8th ISCA Workshop on Speech and Language Technology in Education, SLaTE 19,
Doi: http://doi.org/10.21437/SLaTE.2019-14
Lu, Y., Gales, MJF., Knill, KM., Manakul, P., Wang, L. and Wang, Y., 2019. Impact of ASR performance on spoken grammatical error detection Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
Doi: http://doi.org/10.21437/Interspeech.2019-1706
Knill, KM., Gales, MJF., Manakul, PP. and Caines, AP., 2019. Automatic Grammatical Error Detection of Non-native Spoken Learner English ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2019-May
Doi: 10.1109/ICASSP.2019.8683080
Knill, K., Gales, M., Manakul, P. and Caines, A., 2019. Automatic grammatical error detection of non-native spoken learner English ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: 10.1109/icassp.2019.8683755
Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2019. A deep learning approach to automatic characterisation of rhythm in non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
Doi: http://doi.org/10.21437/Interspeech.2019-3186
Wong, JHM., Gales, MJF. and Wang, Y., 2019. Learning between Different Teacher and Student Models in ASR 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings,
Doi: http://doi.org/10.1109/ASRU46091.2019.9003756
2018
Chen, O., Ragni, A., Gales, MJF. and Chen, X., 2018. Active memory networks for language modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-78
Dou, Q., Wan, M., Degottex, G., Ma, Z. and Gales, MJF., 2018. Hierarchical RNNs for Waveform-Level Speech Synthesis 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639588
Del Vecchio, M., Malinin, A. and Gales, MJF., 2018. Improved Auto-Marking Confidence for Spoken Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639634
Wang, Y., Chen, X., Gales, MJF., Ragni, A. and Wong, JHM., 2018. Phonetic and graphemic systems for multi-genre broadcast transcription ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2018-April
Doi: http://doi.org/10.1109/ICASSP.2018.8462353
Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2018. A deep learning approach to assessing non-native pronunciation of English using phone distances Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1087
Knill, KM., Gales, MJF., Kyriakopoulos, K., Malinin, A., Ragni, A., Wang, Y. and Caines, AP., 2018. Impact of ASR performance on free speaking language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1312
Wan, M., Degottex, G. and Gales, MJF., 2018. Waveform-based speaker representations for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1154
Degottex, G. and Gales, M., 2018. A Spectrally Weighted Mixture of Least Square Error and Wasserstein Discriminator Loss for Generative SPSS 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639609
Wang, Y., Zhang, C., Gales, MJF. and Woodland, PC., 2018. Speaker adaptation and adaptive training for jointly optimised tandem systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-2432
Wang, Y., Wong, JHM., Gales, MJF., Knill, KM. and Ragni, A., 2018. Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639557
Ragni, A., Li, Q., Gales, MJF. and Wang, Y., 2018. Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639678
Malinin, A. and Gales, M., 2018. Predictive Uncertainty Estimation via Prior Networks NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, v. 31
Ragni, A. and Gales, MJF., 2018. Automatic speech recognition system development in the “wild“ Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1085
2017 (Accepted for publication)
Wong, JHM. and Gales, MJF., 2017 (Accepted for publication). Student-teacher training with diverse decision tree ensembles
Kyriakopoulos, K., Gales, M. and Knill, K., 2017 (Accepted for publication). Automatic characterisation of the pronunciation of non-native English speakers using phone distance features http://www.isca-speech.org/archive/SLaTE_2017/,
Doi: http://doi.org/10.21437/SLaTE.2017-11
Malinin, A., Knill, K., Ragni, A., Wang, Y. and Gales, M., 2017 (Accepted for publication). An attention based model for off-topic spontaneous spoken response detection: An Initial Study http://www.isca-speech.org/archive/SLaTE_2017/,
Doi: http://doi.org/10.21437/SLaTE.2017-25
2017
Ragni, A., Saunders, D., Zahemszky, P., Vasilakes, J., Gales, MJF. and Knill, KM., 2017. Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2017.7953262
Gales, MJF., Knill, KM. and Ragni, A., 2017. Low-resource speech recognition and keyword-spotting Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 10458 LNAI
Doi: 10.1007/978-3-319-66429-3_1
Knill, KM., Gales, MJF., Kyriakopoulos, K., Ragni, A. and Wang, Y., 2017. Use of graphemic lexicons for spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: 10.21437/Interspeech.2017-978
Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating bidirectional recurrent neural network language models for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: http://doi.org/10.21437/Interspeech.2017-513
Wu, C. and Gales, MJF., 2017. Deep activation mixture model for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: http://doi.org/10.21437/Interspeech.2017-1233
Malinin, A., Knill, K. and Gales, MJF., 2017. A hierarchical attention based model for off-topic spontaneous spoken response detection 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
Doi: 10.1109/ASRU.2017.8268963
Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6,
Doi: http://doi.org/10.21437/Interapeech.2017-513
Wan, M., Degottex, G., Gales, MJF. and IEEE, , 2017. Integrated speaker-adaptive speech synthesis 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
Doi: http://doi.org/10.1109/ASRU.2017.8269006
Wong, JHM. and Gales, MJF., 2017. Multi-task ensembles with teacher-student training 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
Doi: http://doi.org/10.1109/ASRU.2017.8268920
Malinin, A., Ragni, A., Knill, KM. and Gales, MJF., 2017. Incorporating uncertainty into deep learning for spoken language assessment ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 2
Doi: http://doi.org/10.18653/v1/P17-2008
Chen, X., Ragni, A., Vasilakes, J., Liu, X., Knill, K. and Gales, MJF., 2017. Recurrent neural network language models for keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2017.7953263
Ragni, A., Wu, C., Gales, MJF., Vasilakes, J. and Knill, KM., 2017. Stimulated training for automatic speech recognition and keyword search in limited resource conditions ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2017.7953074
2016
Chen, X., Liu, X., Qian, Y., Gales, MJF. and Woodland, PC., 2016. CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472829
Yang, J., Zhang, C., Ragni, A., Gales, MJF. and Woodland, PC., 2016. System combination with log-linear models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472764
Tan, S., Sim, KC. and Gales, M., 2016. Improving the interpretability of deep neural networks with stimulated learning 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404853
Wu, C., Karanasou, P. and Gales, MJF., 2016. Combining i-vector representation and structured neural networks for rapid adaptation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472629
Kundu, S., Sim, KC. and Gales, M., 2016. Incorporating a generative front-end layer to deep neural network for noise robust automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-760
Ragni, A., Dakin, E., Chen, X., Gales, MJF. and Knill, KM., 2016. Multi-language neural network language models Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-371
Yang, J., Ragni, A., Gales, MJF. and Knill, KM., 2016. Log-linear system combination using structured support vector machines Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-377
Malinin, A., Van Dalen, RC., Wang, Y., Knill, KM. and Gales, MJF., 2016. Off-topic response detection for spontaneous spoken English assessment 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 2
Doi: http://doi.org/10.18653/v1/p16-1102
Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Selection of multi-genre broadcast data for the training of automatic speech recognition systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-462
Wu, C., Karanasou, P., Gales, MJF. and Sim, KC., 2016. Stimulated deep neural network for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-580
Wong, JHM. and Gales, MJF., 2016. Sequence student-teacher training of deep neural networks Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-911
Degottex, G., Lanchantin, P. and Gales, M., 2016. A Pulse Model in Log-domain for a Uniform Synthesizer 9th ISCA Speech Synthesis Workshop, SSW 2016,
Litman, D., Young, S., Gales, M., Knill, K., Ottewell, K., van Dalen, R. and Vandyke, D., 2016. Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English SIGDIAL 2016 - 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
Bell, P., Gales, MJF., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., McParland, A., Renals, S., Saz, O., Wester, M. and Woodland, PC., 2016. The MGB challenge: Evaluating multi-genre broadcast media recognition 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404863
Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2016. Investigation of back-off based interpolation between recurrent neural network and n-gram language models 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404792
Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. The development of the Cambridge university alignment systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404857
Woodland, PC., Liu, X., Qian, Y., Zhang, C., Gales, MJF., Karanasou, P., Lanchantin, P. and Wang, L., 2016. Cambridge university transcription systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404856
Degottex, G., Lanchantin, P. and Gales, M., 2016. A Pulse Model in Log-domain for a Uniform Synthesizer Proceedings of the 9th ISCA Speech Synthesis Workshop,
Cui, J., Kingsbury, B., Ramabhadran, B., Sethy, A., Audhkhasi, K., Cui, X., Kislal, E., Mangu, L., Nussbaum-Thom, M., Picheny, M., Tüske, Z., Golik, P., Schluter, R., Ney, H., Gales, MJF., Knill, KM., Ragni, A., Wang, H. and Woodland, P., 2016. Multilingual representations for low resource speech recognition and keyword search 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: 10.1109/ASRU.2015.7404803
Karanasou, P., Gales, MJF., Lanchantin, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Speaker diarisation and longitudinal linking in multi-genre broadcast data 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404859
Van Dalen, RC., Yang, J., Wang, H., Ragni, A., Zhang, C. and Gales, MJF., 2016. Structured discriminative models using deep neural-network features 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404789
Wang, L., Zhang, C., Woodland, PC., Gales, MJF., Karanasou, P., Lanchantin, P., Liu, X. and Qian, Y., 2016. Improved DNN-based segmentation for multi-genre broadcast audio ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472769
2015
Wu, C. and Gales, MJF., 2015. Multi-basis adaptive neural network for rapid adaptation in speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7178785
Van Dalen, RC. and Gales, MJF., 2015. Annotating large lattices with the exact word error Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
Van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. Improving multiple-crowd-sourced transcriptions using a speech recogniser ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178864
Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. Paraphrastic recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179004
Ragni, A., Gales, MJF. and Knill, KM., 2015. A language space representation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178849
Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Improving the training and evaluation efficiency of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179003
Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model training with noise contrastive estimation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179005
Gales, MJF., Knill, KM. and Ragni, A., 2015. Unicode-based graphemic systems for limited resource languages ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178960
Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. Robust excitation-based features for Automatic Speech Recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7178855
van Dalen, RC., Knill, KM. and Gales, MJF., 2015. Automatically Grading Learners’ English Using a Gaussian Process Speech and Language Technology in Education, SLaTE 2015,
van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
Wang, H., Ragni, A., Gales, MJF., Knill, KM., Woodland, PC. and Zhang, C., 2015. Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
Lanchantin, P., Veaux, C., Gales, MJF., King, S. and Yamagishi, J., 2015. Reconstructing voices within the multiple-average-voice-model framework Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
Chen, X., Tan, T., Liu, X., Lanchantin, P., Wan, M., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model adaptation for multi-genre broadcast speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
Wu, C. and Gales, MJF., 2015. MULTI-BASIS ADAPTIVE NEURAL NETWORK FOR RAPID ADAPTATION IN SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
Liu, X., Flego, F., Wang, L., Zhang, C., Gales, M. and Woodland, P., 2015. The Cambridge university 2014 BOLT conversational telephone Mandarin Chinese lvcsr system for speech translation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
Mendels, G., Cooper, E., Soto, V., Hirschberg, J., Gales, M., Knill, K., Ragni, A. and Wang, H., 2015. Improving speech recognition and keyword search for low resource languages using web data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
2014
Chen, L., Braunschweiler, N. and Gales, MJF., 2014. Speaker dependent expression predictor from text: Expressiveness and transplantation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854065
Yoshioka, T., Chen, X. and Gales, MJF., 2014. Impact of single-microphone dereverberation on DNN-based meeting transcription systems ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854660
Karanasou, P., Wang, Y., Gales, MJF. and Woodland, PC., 2014. Adaptation of deep neural network acoustic models using factorised i-vectors Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Ragni, A., Knill, KM., Rath, SP. and Gales, MJF., 2014. Data augmentation for low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Knill, KM., Gales, MJF., Ragni, A. and Rath, SP., 2014. Language independent and unsupervised acoustic models for speech recognition and keyword spotting Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Rath, SP., Knill, KM., Ragni, A. and Gales, MJF., 2014. Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Wan, V., Latorre, J., Yanagisawa, K., Gales, M. and Stylianou, Y., 2014. Cluster adaptive training of average voice models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6853602
Kolluru, BK., Wan, V., Latorre, J., Yanagisawa, K. and Gales, MJF., 2014. Generating multiple-accent pronunciations for TTS using joint sequence model interpolation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Chen, X., Wang, Y., Liu, X., Gales, MJF. and Woodland, PC., 2014. Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Yanagisawa, K., Chen, L. and Gales, MJF., 2014. Noise-robust TTS speaker adaptation with statistics smoothing Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Latorre, J., Yanagisawa, K., Wan, V., Kolluru, BK. and Gales, MJF., 2014. Speech intonation for TTS: Study on evaluation methodology Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Chen, X., Gales, MJF., Knill, K., Breslin, C., Chen, L., Chin, KK. and Wan, V., 2014. An initial investigation of long-term adaptation for meeting transcription Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Liu, X., Wang, Y., Chen, X., Gales, MJF. and Woodland, PC., 2014. Efficient lattice rescoring using recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854535
Liu, X., Gales, MJF. and Woodland, PC., 2014. Paraphrastic neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854534
Gales, MJF., Knill, KM., Ragni, A. and Rath, SP., 2014. SPEECH RECOGNITION AND KEYWORD SPOTTING FOR LOW RESOURCE LANGUAGES: BABEL PROJECT RESEARCH AT CUED 4th Workshop on Spoken Language Technologies for Under-Resourced Languages, SLTU 2014,
Yang, J., Van Dalen, RC., Zhang, SX. and Gales, MJF., 2014. Infinite structured support vector machines for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854215
Yoshioka, T., Ragni, A. and Gales, MJF., 2014. Investigation of unsupervised adaptation of DNN acoustic models with filter bank input ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854825
2013
Lanchantin, P., Bell, PJ., Gales, MJF., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, MS., Swietojanski, P. and Woodland, PC., 2013. Automatic transcription of multi-genre media archives CEUR Workshop Proceedings, v. 1012
Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Chen, L., Gales, MJF., Braunschweiler, N., Akamine, M. and Knill, K., 2013. Integrated automatic expression prediction and speech synthesis from text ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639218
Latorre, J., Gales, MJF., Knill, K. and Akamine, M., 2013. Training a supra-segmental parametric F0 model without interpolating F0 ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6638995
Maia, R., Akamine, M. and Gales, MJF., 2013. Complex cepstrum analysis based on the minimum mean squared error ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639217
Van Dalen, RC., Ragni, A. and Gales, MJF., 2013. Efficient decoding with generative score-spaces using the expectation semiring ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639145
Zhang, SX. and Gales, MJF., 2013. Kernelized log linear models for continuous speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639009
Knill, KM., Gales, MJF., Rath, SP., Woodland, PC., Zhang, C. and Zhang, S-X., 2013. Investigation of multilingual deep neural networks for spoken term detection 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings,
Doi: 10.1109/ASRU.2013.6707719
Wang, YQ. and Gales, MJF., 2013. An explicit independence constraint for factorised adaptation in speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Long, Y., Gales, MJF., Lanchantin, P., Liu, X., Seigel, MS. and Woodland, PC., 2013. Improving Lightly Supervised Training for Broadcast Transcription 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain Paraphrasing For Improving Language Modelling Using Out-of-domain Data 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, B., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-Realistic Expressive Text to Talking Head Synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
Yanagisawa, K., Latorre, J., Wan, V., Gales, MJF. and King, S., 2013. Noise Robustness in HMM-TTS Speaker Adaptation 8th ISCA Workshop on Speech Synthesis, SSW 2013,
Wang, Y-Q. and Gales, MJF., 2013. An Explicit Independence Constraint for Factorised Adaptation in Speech Recognition 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain paraphrasing for improving language modelling using out-of-domain data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, BK., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-realistic expressive text to talking head synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
2012 (No publication date)
Ragni, A. and Gales, MJF., 2012 (No publication date). Derivative Kernels for Noise Robust ASR
2012
Chen, L., Gales, MJF., Wan, V., Latorre, J. and Akamine, M., 2012. Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
Latorre, J., Wan, V., Gales, MJF., Chen, L., Chin, KK., Knill, K. and Akamine, M., 2012. Speech factorization for HMM-TTS based on cluster adaptive training. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
Wang, Y-Q. and Gales, MJF., 2012. Model-based approaches to adaptive training in reverberant environments 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
Wan, V., Latorre, J., Chin, KK., Chen, L., Gales, MJF., Zen, H., Knill, K. and Akamine, M., 2012. Combining multiple high quality corpora for improving HMM-TTS 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
Eyben, F., Buchholz, S., Braunschweiler, N., Latorre, J., Wan, V., Gales, MJF. and Knill, K., 2012. Unsupervised clustering of emotion and voice styles for expressive TTS ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2012.6288797
Maia, R., Akamine, M. and Gales, MJF., 2012. COMPLEX CEPSTRUM AS PHASE INFORMATION IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
Roupakia, Z., Ragni, A. and Gales, M., 2012. Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
Ragni, A. and Gales, MJF., 2012. INFERENCE ALGORITHMS FOR GENERATIVE SCORE-SPACES 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2011
Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian Process Experts for Voice Conversion 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Breslin, C., Chin, KK., Gales, MJF. and Knill, K., 2011. Integrated Online Speaker Clustering and Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian process experts for voice conversion Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse sequences for residual signal modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Braunschweiler, N., Gales, MJF. and Buchholz, S., 2011. Lightly supervised recognition for automatic alignment of large coherent speech recordings Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Breslin, C., Chin, KK., Gales, MJF., Knill, K. and Xu, H., 2011. Prior information for rapid speaker adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Gales, MJF. and Yu, K., 2011. Canonical state models for automatic speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Latorre, J., Gales, MJF. and Zen, H., 2011. Training a parametric-based logF0 model with the minimum generation error criterion Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Liu, X., Gales, MJF. and Woodland, PC., 2011. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2011. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,
van Dalen, RC. and Gales, MJF., 2011. Asymptotically exact noise-corrupted speech likelihoods Proceedings of the 11th Annual Conference of the International Speech Communication Association,
Zhang, SX. and Gales, MJF., 2011. Structured support vector machines for noise robust continuous speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Diehl, F., Gales, MJF., Liu, X., Tomalin, M. and Woodland, PC., 2011. Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Liu, X., Gales, MJF. and Woodland, PC., 2011. Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Li, T., Woodland, PC., Diehl, F. and Gales, MJF., 2011. Graphone Model Interpolation and Arabic Pronunciation Generation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse Sequences for Residual Signal Modeling 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
Zhang, S-X. and Gales, MJF., 2011. Structured Support Vector Machines for Noise Robust Continuous Speech Recognition 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
2010
Liu, X., Gales, MJF. and Woodland, PC., 2010. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,
Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2010. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,
Maia, R., Zen, H. and Gales, MJF., 2010. Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters
Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2010. Language model combination and adaptation using weighted finite state transducers Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2010.5494941
Tomalin, M., Park, J., Diehl, F., Gales, MJF. and Woodland, PC., 2010. Recent improvements to the Cambridge Arabic speech-to-text systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: 10.1109/ICASSP.2010.5495641
Zen, H., Gales, MJF., Nankaku, Y. and Tokuda, K., 2010. Statistical parametric synthesis based on products of experts Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2010.5495691
Flego, F. and Gales, MJF., 2010. Discriminative adaptive training with VTS and JUD Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5373266
Gales, MJF., Ragni, A., AlDamarki, H. and Gautier, C., 2010. Support vector machines for noise robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5372913
Xu, H., Gales, MJF. and Chin, KK., 2010. Improving joint uncertainty decoding performance by predictive methods for noise robust speech recognition Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5373317
Maia, R., Zen, H. and Gales, MJF., 2010. Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters
2009
Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological analysis and decomposition for Arabic speech-to-text systems Proceedings of the 10th International Conference of the International Speech Communication Association,
Flego, F. and Gales, MJF., 2009. Incremental adaptation with VTS and joint adaptively trained systems Proceedings of the 10th International Conference of the International Speech Communication Association,
Gales, MJF. and Flego, F., 2009. Combining VTS model compensation and support vector machines Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2009.4960460
Flego, F. and Gales, MJF., 2009. Incremental Adaptation with VTS and Joint Adaptively Trained Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Hieronymus, JL., Liu, X., Gales, MJF. and Woodland, PC., 2009. Exploiting Chinese character models to improve speech recognition performance Proceedings of the 10th International Conference of the International Speech Communication Association,
Kim, D. and Gales, MJF., 2009. Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition 10th Annual Conference of the International Speech Communication Association, Interspeech 2009,
Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of contexts in language model interpolation and adaptation Proceedings of the 10th International Conference of the International Speech Communication Association,
Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational dynamic kernels for speaker verification Proceedings of the 10th International Conference of the International Speech Communication Association,
Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient generation and use of MLP features for Arabic speech recognition Proceedings of the 10th International Conference of the International Speech Communication Association,
van Dalen, RC. and Gales, MJF., 2009. Transforming features to compensate speech recogniser models for noise Proceedings of the 10th Annual Conference of the International Speech Communication Associatio,
van Dalen, RC., Flego, F. and Gales, MJF., 2009. Transforming Features to Compensate Speech Recogniser Models for Noise INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient Generation and Use of MLP Features for Arabic Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational Dynamic Kernels for Speaker Verification INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Kim, DK. and Gales, MJF., 2009. Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Gales, MJF., 2009. Acoustic Modelling for Speech Recognition: Hidden Markov Models and Beyond? 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009),
Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Training and adapting MLP features for Arabic speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: 10.1109/ICASSP.2009.4960620
Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of Contexts in Language Model Interpolation and Adaptation INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
Raut, CK. and Gales, MJF., 2009. Bayesian discriminative adaptation for speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960595
van Dalen, RC. and Gales, MJF., 2009. Extended VTS for noise-robust speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960462
Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological Analysis and Decomposition for Arabic Speech-to-Text Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
2008
Gales, MJF. and Longworth, C., 2008. Discriminative Classifiers with Generative Kernels for Noise Robust ASR INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
Longworth, C. and Gales, MJF., 2008. A Generalised Derivative Kernel for Speaker Verification INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
van Dalen, RC. and Gales, MJF., 2008. Covariance Modelling for Noise-Robust Speech Recognition INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2008. Phonetic pronunciations for arabic speech-to-text systems IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2008.4517924
Yu, K., Gales, MJF. and Woodland, PC., 2008. Unsupervised discriminative adaptation using discriminative mapping transforms International Conference on Acoustics, Speech and Signal Processing, 2008,
Doi: http://doi.org/10.1109/ICASSP.2008.4518599
Gales, MJF. and Longworth, C., 2008. Discriminative classifiers with generative kernels for noise-robust ASR ICSLP - International Conference - CD-ROM,
Liu, XA., Gales, MJF. and Woodland, PC., 2008. Context dependent language model adaptation Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008) incorporating the 12th Australasian International Conference on Speech Science and Technology, SST' 08,
Longworth, C. and Gales, MJF., 2008. A generalised derivative kernel for speaker verification ICSLP - International Conference - CD-ROM,
Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms ICSLP - International Conference - CD-ROM,
Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive Training using Discriminative Mapping Transforms INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
van Dalen, RC. and Gales, MJF., 2008. Covariance modelling for noise-robust speech recognition ICSLP - International Conference - CD-ROM,
Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
Liu, X., Gales, MJF. and Woodland, PC., 2008. Context Dependent Language Model Adaptation INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
2007
Breslin, C. and Gales, MJF., 2007. Building Multiple Complementary Systems using Directed Decision Trees INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 200, ICASSP' 07,
Doi: http://doi.org/10.1109/ICASSP.2007.366922
Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription
Longworth, C. and Gales, MJF., 2007. Derivative and Parametric Kernels for Speaker Verification INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation
Liu, XA., Byrne, WJ., Gales, MJF., De Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007, Proceedings,
Doi: http://doi.org/10.1109/asru.2007.4430101
Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,
Gales, MJF., 2007. Discriminative-models for speech recognition 2007 Information Theory and Applications Workshop,
Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,
Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition
Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition
Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised Training with Directed Manual Transcription for Recognising Mandarin Broadcast Audio INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
Liu, XA., Byrne, WJ., Gales, MJF., de Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation IEEE Workshop on Automatic Speech Recognition & Understanding, 2007,
Doi: http://doi.org/10.1109/ASRU.2007.4430101
Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised training using directed manual transcription for recognising Mandarin broadcast audio Proceedings InterSpeech 2007,
Breslin, C. and Gales, MJF., 2007. Building multiple complementary systems using directed decision trees Proceedings InterSpeech 2007,
Longworth, C. and Gales, MJF., 2007. Parametric and derivative kernels for speaker verification Proceedings InterSpeech 2007,
Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, JL., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367310
Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367172
Sim, KC., Byrne, WJ., Gales, MJF., Sahbi, H. and Woodland, PC., 2007. Consensus network decoding for statistical machine translation system combination IEEE International Conference on Acoustics Speech and Signal Processing, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367174
Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, J-L., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation
Breslin, C. and Gales, MJF., 2007. Complementary system generation using directed decision trees Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07,
Doi: http://doi.org/10.1109/ICASSP.2007.366918
Liao, H. and Gales, MJF., 2007. Adaptive training with joint uncertainty decoding for robust recognition of noisy data Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.366931
2006
Liao, H. and Gales, MJF., 2006. Issue with uncertainty decoding for noise robust speech recognition
Breslin, C. and Gales, MJF., 2006. Generating complementary systems for speech recognition
Longworth, C. and Gales, MJF., 2006. Discriminative adaptation for speaker verification
Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition
Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference Proceedings of the 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
Doi: http://doi.org/10.1109/ICASSP.2006.1659996
Sinha, R., Gales, MJF., Kim, DY., Liu, X., Sim, KC. and Woodland, PC., 2006. The CU-HTK Mandarin broadcast news transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'06,
Liao, H. and Gales, MJF., 2006. Issues with Uncertainty Decoding for Noise Robust Speech Recognition INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,
Longworth, C. and Gales, MJF., 2006. Discriminative Adaptation for Speaker Verification INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,
Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,
Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,
2005
Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems
Layton, MI. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels 2005 IEEE Workshop on Machine Learning for Signal Processing (MLSP),
Layton, M. and Gales, MJF., 2005. Augmented statistical models: exploiting generative models in discriminative classifiers
Yu, K. and Gales, MJF., 2005. Bayesian adaptation and adaptively trained systems Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding,
Doi: http://doi.org/10.1109/ASRU.2005.1566532
Layton, M. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels Proceedings of Machine Learning for Signal Processing Workshop,
Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2005, v. 1
Doi: http://doi.org/10.1109/ICASSP.2005.1415247
Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Development of the CU-HTK 2004 broadcast news transcription systems IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2005.1415250
Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Training LVCSR systems on thousands of hours of data IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05, v. 1
Doi: http://doi.org/10.1109/ICASSP.2005.1415087
Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05,
Doi: http://doi.org/10.1109/ICASSP.2005.1415250
Liao, H. and Gales, MJF., 2005. Joint uncertainty decoding for noise robust speech recognition Interspeech: 9th European Conference on Speech Communciation and Technology,
Sim, KC. and Gales, MJF., 2005. Adaptation of precision matrix models on large vocabulary continuous speech recognition Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
Sim, KC. and Gales, MJF., 2005. Temporally varying model parameters for large vocabulary continuous speech recognition Interspeech: European Conference on Speech Communciation and Technology,
Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system
2004
Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system
Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions
Sim, KC. and Gales, MJF., 2004. Basis superposition precision matrix modeling for large vocabulary continuous speech recognition Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1
Yu, K. and Gales, MJF., 2004. Adaptive training using structured transforms Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Proceedings, 2004, v. 1
Doi: http://doi.org/10.1109/ICASSP.2004.1325986
Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '04, v. 1
Doi: http://doi.org/10.1109/ICASSP.2004.1325969
Evermann, G., Chan, HY., Gales, MJF., Jia, B., Liu, X., Mrva, D., Sim, KC., Wang, L. and Woodland, PC., 2004. Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data
Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2004. Development of the CUHTK 2004 RT04F Mandarin conversational telephone speech transcription system
Kim, DY., Chan, HY., Evermann, G., Gales, MJF., Mrva, D., Sim, KC. and Woodland, PC., 2004. Recent developments at Cambridge in broadcast news transcription
Kim, DY., Gales, MJF., Hain, T. and Woodland, PC., 2004. Using VTLN for broadcast news transcription Interspeech 2004 ICSLP: 8th International Conference on Spoken Language Processing,
Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions Proceedings of the 29th IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP),
Rosti, AVI. and Gales, MJF., 2004. Rao-blackwellised gibbs sampling for switching linear dynamical systems Proceedings of the 29th IEEE International conference on Acoustics, Speech and Signal Processing (ICASSP),
Tranter, SE., Gales, MJF., Sinha, R., Umesh, S. and Woodland, PC., 2004. The development of the Cambridge University RT-04 diarisation system
2003
Airey, SS. and Gales, MJF., 2003. Product of Gaussians as a distributed representation for speech recognition Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech), v. 2
Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems Proceedings of the 28th IEEE International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
Gales, MJF., Dong, Y., Povey, D. and Woodland, PC., 2003. Porting: SwitchBoard to the VoiceMail task IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
Liu, X. and Gales, MJF., 2003. Automatic model complexity control using marginalized discriminative growth functions Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding,
Liu, X., Gales, MJF. and Woodland, PC., 2003. Automatic complexity control for HLDA systems IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
Povey, D., Woodland, PC. and Gales, MJF., 2003. Discriminative map for acoustic model adaptation IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS,
2002
Smith, ND. and Gales, MJF., 2002. Using SVMs and discriminative models for speech recognition 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS,
Stuttle, MN. and Gales, MJF., 2002. Combining a Gaussian mixture model front end with MFCC parameters Proceedings of the 7th International Conference on Spoken Language Processing (Interspeech), v. 3
Rosti, AVI. and Gales, MJF., 2002. Factor analysed HMMs (Hidden Markov Models) Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1
Smith, ND. and Gales, MJF., 2002. SVMs for speech recognition Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. Volume 1: Speech Processing
Cordoba, R., Woodland, PC. and Gales, MJF., 2002. Improved cross-task recognition using MMIE training IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'02, v. 1
Doi: http://doi.org/10.1109/ICASSP.2002.1005682
Gales, MJF., 2002. The HMM error model Proceedings of the 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
2001
Gales, MJF., 2001. Acoustic factorisation Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),
Gales, MJF., 2001. Adaptive training for robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),
Smith, N. and Gales, MJF., 2001. Speech recognition using SVMs Proceedings of the 15th Conference on Neural Information Processing Systems, v. 2
Gales, MJF., 2001. Multiple-cluster adaptive training schemes Proceedings of 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
Stuttle, MN. and Gales, MJF., 2001. A mixture of gaussians front end for speech recognition Proceedings of the 7th European Conference on Speech Communication and Technology, v. 1
2000
Aiyer, A., Gales, MJF. and Picheny, MA., 2000. Rapid likelihood calculation of subspace clustered Gaussian components Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), v. 3
Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L., Gales, MJF., Novak, M. and Gopinath, R., 2000. IBM's 10xReal-time broadcast news transciption used in the 1999 hub4 evaluation
Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L. and Gales, MJF., 2000. Transcription of broadcast news with time constraint: IBM's 10xRT hub4 system
1999
Chen, S., Eide, EM., Gales, MJF., Gopinath, RA. and Kavensky, RA., 1999. Recent improvements to IBM's speech recognition system for
automatic transcription of broadcast news Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 1
Chen, S., Eide, EM., Gales, MJF., Gopinath, RA., Kavensky, D. and Olsen, PA., 1999. Recent improvements to IBM's speech recognition system for
automatic transcription of broadcast news
Gales, MJF. and Olsen, PA., 1999. Tail distribution modelling using the richter and power exponential distributions
1998
Chen, S., Gales, MJF., Gopalakrishnan, PS., Gopinath, RA., Kavensky, D., Olsen, P. and Polymenakos, L., 1998. IBM's LVCSR system for transcription of broadcast news used in the 1997 hub4 english evaluation
Gales, MJF., 1998. Semi-tied covariance matrices Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 2
Gales, MJF., 1998. Cluster adaptive training for speech recognition Proceedings of 5th International Conference on Spoken Language Processing,
1997
Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. Broadcast news transcription using HTK Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 2
Doi: http://doi.org/10.1109/ICASSP.1997.596005
Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. The development of the 1996 HTK broadcast news transcription system Proceedings of DARPA Speech Recognition Workshop,
Nock, H., Gales, MJF. and Young, SJ., 1997. A comparative study of methods for phonetic decision-tree state clustering
Gales, MJF., 1997. Transformation smoothing for speaker and environmental adaptation
1996
Woodland, PC., Gales, MJF., Pye, D. and Valtchev, V., 1996. The HTK large vocabulary recognition system for the 1995 ARPA H3 task Proceedings of the ARPA Continuous Speech Recognition Workshop,
Gales, MJF., Pye, D. and Woodland, PC., 1996. Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, v. 3
Knill, K., Gales, MJF. and Young, SJ., 1996. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP), v. 1
Woodland, PC., Gales, MJF. and Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 96, v. 1
Doi: http://doi.org/10.1109/ICASSP.1996.540291
Woodland, PC., Pye, D. and Gales, MJF., 1996. Iterative unsupervised adaptation using maximum likelihood linear regression 4th International Conference on Spoken Language Processing (ICSLP 1996), v. 2
1995
Knill, K., Gales, MJF. and Young, SJ., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),
Gales, MJF. and Young, SJ., 1995. A fast and flexible implementation of parallel model combination Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech
Gopinath, RA., Gales, MJF., Gopalakrishnan, PS., Balakrishnan Aiyer, S. and Picheny, MA., 1995. Robust speech recognition in noise --- performance of the IBM continuous speech recogniser on the ARPA noise spoke task Proceedings of the ARPA Spoken Language Systems Technology Workshop,
Gales, MJF. and Young, SJ., 1995. The application of parallel model combination to a large vocabulary dictation task Proceedings of the 4th European Conference on Speech Communication and Technology (EUROSPEECH '95), v. 3
1994
Gales, MJF. and Young, SJ., 1994. PARALLEL MODEL COMBINATION ON A NOISE CORRUPTED RESOURCE MANAGEMENT TASK 3rd International Conference on Spoken Language Processing, ICSLP 1994,
1993
Gales, MJF. and Young, SJ., 1993. HMM recognition in noise using parallel model combination EUROSPEECH 93 proceedings, v. 2
Gales, MJF. and Young, SJ., 1993. Segmental hidden Markov models EUROSPEECH 93 proceedings, v. 3
1992
GALES, MJF. and YOUNG, S., 1992. AN IMPROVED APPROACH TO THE HIDDEN MARKOV MODEL DECOMPOSITION OF SPEECH AND NOISE ICASSP-92 - 1992 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5,