Professor Mark Gales | Cambridge Language Sciences

Publications (from Symplectic)

Theses / dissertations

2022 (No publication date)

Kyriakopoulos, K., 2022 (No publication date). Deep Learning for Automatic Assessment and Feedback of Spoken English
Doi: http://doi.org/10.17863/CAM.82947

1995

Gales, MJF., 1995. Model-based techniques for noise robust speech recognition

Journal articles

2022

Fathullah, Y. and Gales, MJF., 2022. Self-Distribution Distillation: Efficient Uncertainty Estimation

Ragni, A., Gales, MJF., Rose, O., Knill, KM., Kastanos, A., Li, Q. and Ness, PM., 2022. Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 30
Doi: http://doi.org/10.1109/TASLP.2022.3161153

2021

Malinin, A., Band, N., Ganshin, , Alexander, , Chesnokov, G., Gal, Y., Gales, MJF., Noskov, A., Ploskonosov, A., Prokhorenkova, L., Provilkov, I., Raina, V., Raina, V., Roginskiy, , Denis, , Shmatova, M., Tigas, P. and Yangel, B., 2021. Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

Dou, Q., Lu, Y., Manakul, P., Wu, X. and Gales, MJF., 2021. Attention Forcing for Machine Translation

Raina, V. and Gales, MJF., 2021. An Initial Investigation of Non-Native Spoken Question-Answering

2019

Chen, X., Liu, X., Wang, Y., Ragni, A., Wong, JHM. and Gales, MJF., 2019. Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 27
Doi: http://doi.org/10.1109/TASLP.2019.2922048

Wong, JHM., Gales, MJF. and Wang, Y., 2019. General sequence teacher-student learning IEEE/ACM Transactions on Audio Speech and Language Processing, v. 27
Doi: http://doi.org/10.1109/TASLP.2019.2929859

Wang, L., Wang, Y. and Gales, MJF., 2019. Non-native Speaker Verification for Spoken Language Assessment

Dou, Q., Lu, Y., Efiong, J. and Gales, MJF., 2019. Attention Forcing for Sequence-to-sequence Model Training

2018

Wang, Y., Gales, MJF., Knill, KM., Kyriakopoulos, K., Malinin, A., van Dalen, RC. and Rashid, M., 2018. Towards automatic assessment of spontaneous spoken English Speech Communication, v. 104
Doi: http://doi.org/10.1016/j.specom.2018.09.002

Degottex, G., Lanchantin, P. and Gales, M., 2018. A Log Domain Pulse Model for Parametric Speech Synthesis IEEE/ACM Transactions on Audio, Speech, and Language Processing, v. 26
Doi: http://doi.org/10.1109/TASLP.2017.2761546

2017

Wu, C., Gales, M., Ragni, A., Karanasou, P. and Sim, KC., 2017. Improving Interpretability and Regularisation in Deep Learning IEEE/ACM Transactions on Audio Speech and Language Processing,
Doi: http://doi.org/10.1109/TASLP.2017.2774919

Karanasou, P., Wu, C., Gales, M. and Woodland, PC., 2017. I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models IEEE/ACM Transactions on Audio Speech and Language Processing, v. 25
Doi: http://doi.org/10.1109/TASLP.2017.2670141

Chen, X., Liu, X., Ragni, A., Wang, Y. and Gales, MJF., 2017. Future word contexts in neural network language models 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
Doi: http://doi.org/10.1109/ASRU.2017.8268922

2016

Chen, X., Liu, X., Wang, Y., Gales, MJF. and Woodland, PC., 2016. Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 24
Doi: 10.1109/TASLP.2016.2598304

Liu, X., Chen, X., Wang, Y., Gales, MJF. and Woodland, PC., 2016. Two efficient lattice rescoring methods using recurrent neural network language models IEEE/ACM Transactions on Audio Speech and Language Processing, v. 24
Doi: http://doi.org/10.1109/TASLP.2016.2558826

2015

Yoshioka, T. and Gales, MJF., 2015. Environmentally robust ASR front-end for deep neural network acoustic models Computer Speech and Language, v. 31
Doi: http://doi.org/10.1016/j.csl.2014.11.008

Chen, L., Braunschweiler, N. and Gales, MJF., 2015. Speaker and Expression Factorization for Audiobook Data: Expressiveness and Transplantation IEEE Transactions on Audio, Speech and Language Processing, v. 23
Doi: http://doi.org/10.1109/TASLP.2014.2385478

2014

Wan, V., Latorre, J., Yanagisawa, K., Braunschweiler, N., Chen, L., Gales, MJF. and Akamine, M., 2014. Building HMM-TTS voices on diverse data IEEE Journal on Selected Topics in Signal Processing, v. 8
Doi: http://doi.org/10.1109/JSTSP.2013.2295058

Chen, L., Gales, MJF., Braunschweiler, N., Akamine, M. and Knill, K., 2014. Integrated expression prediction and speech synthesis from text IEEE Journal on Selected Topics in Signal Processing, v. 8
Doi: http://doi.org/10.1109/JSTSP.2013.2294938

Liu, X., Gales, MJF. and Woodland, PC., 2014. Paraphrastic language models Computer Speech and Language, v. 28
Doi: http://doi.org/10.1016/j.csl.2014.04.004

Lanchantin, P., Gales, MJF., King, S. and Yamagishi, J., 2014. Multiple-average-voice-based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6853603

2013

Seigel, MS., Woodland, PC. and Gales, MJF., 2013. A confidence-based approach for improving keyword hypothesis scores ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639337

Liu, X., Gales, MJF. and Woodland, PC., 2013. Paraphrastic language models and combination with neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639308

van Dalen, RC. and Gales, MJF., 2013. Importance sampling to compute likelihoods of noise-corrupted speech COMPUTER SPEECH AND LANGUAGE, v. 27
Doi: http://doi.org/10.1016/j.csl.2012.06.007

Mamou, J., Cui, J., Cui, X., Gales, MJF., Kingsbury, B., Knill, K., Mangu, L., Nolden, D., Picheny, M., Ramabhadran, B., Schluter, R., Sethy, A. and Woodland, PC., 2013. System combination and score normalization for spoken term detection ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639278

Kingsbury, B., Cui, J., Cui, X., Gales, MJF., Knill, K., Mamou, J., Mangu, L., Nolden, D., Picheny, M., Ramabhadran, B., Schluter, R., Sethy, A. and Woodland, PC., 2013. A high-performance Cantonese keyword search system ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639279

Wang, YQ. and Gales, MJF., 2013. Tandem system adaptation using multiple linear feature transforms ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639209

Liu, X., Hieronymus, JL., Gales, MJF. and Woodland, PC., 2013. Syllable language models for Mandarin speech recognition: exploiting character language models. J Acoust Soc Am, v. 133
Doi: http://doi.org/10.1121/1.4768800

Zhang, S-X. and Gales, MJF., 2013. Structured SVMs for Automatic Speech Recognition IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 21
Doi: http://doi.org/10.1109/TASL.2012.2227734

Liu, X., Gales, MJF. and Woodland, PC., 2013. Language model cross adaptation for LVCSR system combination Computer Speech and Language, v. 27
Doi: http://doi.org/10.1016/j.csl.2012.07.010

Maia, R., Akamine, M. and Gales, MJF., 2013. Complex cepstrum for statistical parametric speech synthesis SPEECH COMMUNICATION, v. 55
Doi: http://doi.org/10.1016/j.specom.2012.12.008

Long, Y., Gales, MJF., Lanchantin, P., Liu, X., Seigel, MS. and Woodland, PC., 2013. Improving lightly supervised training for broadcast transcription Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Liu, X., Gales, MJF. and Woodland, PC., 2013. Use of contexts in language model interpolation and adaptation Computer Speech and Language, v. 27
Doi: http://doi.org/10.1016/j.csl.2012.06.004

Yang, J., Van Dalen, RC. and Gales, M., 2013. Infinite support vector machines in speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

2012

Flego, F. and Gales, MJF., 2012. Factor analysis based VTS discriminative adaptive training ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2012.6288960

Gales, MJF., Watanabe, S. and Fosler-Lussier, E., 2012. Structured discriminative models for speech recognition: An overview IEEE Signal Processing Magazine, v. 29
Doi: http://doi.org/10.1109/MSP.2012.2207140

Bell, PJ., Gales, MJF., Lanchantin, P., Liu, X., Long, Y., Renals, S., Swietojanski, P. and Woodland, PC., 2012. Transcription of multi-genre media archives using out-of-domain data 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2012.6424244

Zen, H., Gales, MJF., Nankaku, Y. and Tokuda, K., 2012. Product of Experts for Statistical Parametric Speech Synthesis IEEE Transactions on Audio, Speech and Language Processing, v. 20

Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2012. Morphological decomposition in Arabic ASR systems Computer Speech and Language,

Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2012. Morphological decomposition in Arabic ASR systems Computer Speech and Language, v. 26
Doi: http://doi.org/10.1016/j.csl.2011.12.001

Zen, H., Braunschweiler, N., Buchholz, S., Gales, MJF., Knill, K., Krstulović, S. and Latorre, J., 2012. Statistical parametric speech synthesis based on speaker and language factorization IEEE Transactions on Audio, Speech and Language Processing, v. 20
Doi: http://doi.org/10.1109/TASL.2012.2187195

Wang, Y. and Gales, MJF., 2012. Speaker and Noise Factorization for Robust Speech Recognition IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 20
Doi: http://doi.org/10.1109/TASL.2012.2198059

Gales, MJF. and Flego, F., 2012. Model-Based Approaches for Degraded Channel Modelling in Robust ASR 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,

Liu, X., Gales, MJF. and Woodland, PC., 2012. Paraphrastic Language Models 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,

2011

Liu, X., Gales, MJF. and Woodland, PC., 2011. Improving LVCSR system combination using neural network language model cross adaptation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Li, T., Woodland, PC., Diehl, F. and Gales, MJF., 2011. Graphone model interpolation and Arabic pronunciation generation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2011. The efficient incorporation of MLP features into automatic speech recognition systems Computer Speech and Language, v. 25
Doi: http://doi.org/10.1016/j.csl.2010.07.005

Kim, D. and Gales, MJF., 2011. Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition IEEE Transactions on Audio, Speech and Language Processing, v. 19
Doi: http://doi.org/10.1109/TASL.2010.2047756

Van Dalen, RC. and Gales, MJF., 2011. Extended VTS for noise-robust speech recognition IEEE Transactions on Audio, Speech and Language Processing, v. 19
Doi: http://doi.org/10.1109/TASL.2010.2061226

Flego, F. and Gales, MJF., 2011. Factor analysis based VTS and JUD noise estimation and compensation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947427

Wang, YQ. and Gales, MJF., 2011. Speaker and noise factorisation on the AURORA4 task ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947375

Latorre, J., Gales, MJF., Buchholz, S., Knill, K., Tamura, M., Ohtani, Y. and Akamine, M., 2011. CONTINUOUS F0 IN THE SOURCE-EXCITATION GENERATION FOR HMM-BASED TTS: DO WE NEED VOICEDIUNVOICED CLASSIFICATION? 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,

Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2011. Investigation of acoustic units for LVCSR systems ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947447

Chin, KK., Xu, HT., Gales, MJF., Breslin, C. and Knill, K., 2011. RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,

Chen, L., Gales, MJF. and Chin, KK., 2011. Constrained discriminative mapping transforms for unsupervised speaker adaptation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947565

Ragni, A. and Gales, MJF., 2011. Structured discriminative models for noise robust continuous speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947426

Zen, H. and Gales, MJF., 2011. Decision tree-based context clustering based on cross validation and hierarchical priors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947369

Gales, MJF. and Wang, YQ., 2011. Model-based approaches to handling additive noise in reverberant environments 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11,
Doi: http://doi.org/10.1109/HSCMA.2011.5942377

Xu, HT., Gales, MJF. and Chin, KK., 2011. Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition IEEE T AUDIO SPEECH, v. 19
Doi: http://doi.org/10.1109/TASL.2010.2096214

Van Dalen, RC. and Gales, MJF., 2011. A variational perspective on noise-robust speech recognition 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
Doi: http://doi.org/10.1109/ASRU.2011.6163917

Wang, YQ. and Gales, MJF., 2011. Improving reverberant VTS for hands-free robust speech recognition 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
Doi: http://doi.org/10.1109/ASRU.2011.6163915

Ragni, A. and Gales, MJF., 2011. Derivative kernels for noise robust ASR 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
Doi: http://doi.org/10.1109/ASRU.2011.6163916

Zhang, SX. and Gales, MJF., 2011. Extending noise robust structured support vector machines to larger vocabulary tasks 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
Doi: http://doi.org/10.1109/ASRU.2011.6163898

Dieh, F., Gales, MJF., Liu, X., Tomalin, M. and Woodland, PC., 2011. Word boundary modelling and full covariance gaussians for Arabic Speech-to-Text systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Breslin, C., Chin, KK., Gales, MJF. and Knill, K., 2011. Integrated online speaker clustering and adaptation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

2010

Zhang, SX., Ragni, A. and Gales, MJF., 2010. Structured Log Linear Models for Noise Robust Speech Recognition IEEE SIGNAL PROC LET, v. 17
Doi: http://doi.org/10.1109/LSP.2010.2077626

Gales, MJF. and Flego, F., 2010. Discriminative classifiers with adaptive kernels for noise robust speech recognition Computer Speech and Language, v. 24
Doi: http://doi.org/10.1016/j.csl.2009.09.002

Yu, K., Gales, MJF., Wang, L. and Woodland, PC., 2010. Unsupervised training and directed manual transcription for LVCSR Speech Communication, v. 52
Doi: http://doi.org/10.1016/j.specom.2010.02.014

Yu, K., Gales, M., Wang, L. and Woodland, PC., 2010. Unsupervised training and directed manual transcription for LVCSR SPEECH COMMUN, v. 52
Doi: http://doi.org/10.1016/j.specom.2010.02.014

Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2010. Language model combination and adaptation using weighted finite state transducers ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2010.5494941

2009

Longworth, C. and Gales, MJF., 2009. Combining derivative and parametric kernels for speaker verification IEEE Transactions on Audio Speech and Language Processing, v. 17
Doi: http://doi.org/10.1109/TASL.2008.2012193

Yu, K., Gales, MJF. and Woodland, PC., 2009. Unsupervised adaptation with discriminative mapping transforms IEEE Transactions on Audio Speech and Language Processing, v. 17
Doi: http://doi.org/10.1109/TASL.2008.2011535

Breslin, C. and Gales, MJF., 2009. Directed decision trees for generating complementary systems Speech Communication, v. 51
Doi: http://doi.org/10.1016/j.specom.2008.09.004

Hieronymus, JL., Liu, X., Gales, MJF. and Woodland, PC., 2009. Exploiting Chinese character models to improve speech recognition performance Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

2008

Liao, H. and Gales, MJF., 2008. Issues with uncertainty decoding for noise robust automatic speech recognition Speech Communication, v. 50
Doi: http://doi.org/10.1016/j.specom.2007.10.004

2007

Sim, KC. and Gales, MJF., 2007. Discriminative semi-parametric trajectory models for speech recognition Computer Speech and Language, v. 21
Doi: http://doi.org/10.1016/j.csl.2007.03.004

Layton, M. and Gales, MJF., 2007. Acoustic modelling using continuous rational kernels Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, v. 48
Doi: http://doi.org/10.1007/s11265-006-0027-4

Yu, K. and Gales, MJF., 2007. Bayesian adaptive inference and adaptive training IEEE Transactions on Audio Speech and Language Processing, v. 15
Doi: http://doi.org/10.1109/TASL.2007.901300

Gales, MJF. and Young, SJ., 2007. The application of hidden Markov models in speech recognition Foundations and Trends in Signal Processing, v. 1
Doi: http://doi.org/10.1561/20000000004

Liu, X. and Gales, MJF., 2007. Automatic model complexity control using marginalized discriminative growth functions IEEE Transactions on Audio Speech and Language Processing, v. 15
Doi: http://doi.org/10.1109/TASL.2006.889804

Liu, X. and Gales, M., 2007. Automatic model complexity control using marginalized discriminative growth functions IEEE Transactions on Audio, Speech and Language Processing, v. 15
Doi: http://doi.org/10.1109/TASL.2006.889804

Tomalin, M., Gales, MJF., Liu, XA., Sim, KC., Sinha, R., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-english translation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367172

2006

Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, GL., Povey, D. and Wang, L., 2006. Corrections to “Automatic transcription of conversational telephone speech” IEEE Transactions on Audio, Speech and Language Processing, v. 14
Doi: http://doi.org/10.1109/TASL.2006.871051

Yu, K. and Gales, MJF., 2006. Discriminative cluster adaptive training IEEE Transactions on Audio Speech and Language Processing, v. 14
Doi: http://doi.org/10.1109/TSA.2005.858555

Sim, KC. and Gales, MJF., 2006. Minimum phone error training of precision matrix models IEEE Transactions on Audio Speech and Language Processing, v. 14
Doi: http://doi.org/10.1109/TSA.2005.858062

Gales, MJF. and Layton, MI., 2006. Training augmented models using SVMs IEICE Transactions on Information and Systems, v. E89-D
Doi: http://doi.org/10.1093/ietisy/e89-d.3.892

Gales, MJF. and Airey, SS., 2006. Product of Gaussians for speech recognition Computer Speech and Language, v. 20
Doi: http://doi.org/10.1016/j.csl.2004.12.002

Gales, MJF., Kim, DY., Woodland, PC., Chan, HY., Mrva, D., Sinha, R. and Tranter, SE., 2006. Progress in the CU-HTK broadcast news transcription system IEEE Transactions on Speech and Audio Processing, v. 14
Doi: http://doi.org/10.1109/TASL.2006.878264

Sinha, R., Gales, MJF., Kim, DY., Liu, XA., Sim, KC. and Woodland, PC., 2006. The CU-HTK Mandarin broadcast news transcription system ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1

2005

Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, GL., Povey, D. and Wang, L., 2005. Automatic transcription of conversational telephone speech IEEE Transactions on Speech and Audio Processing, v. 13
Doi: http://doi.org/10.1109/TSA.2005.852999

Sinha, R., Tranter, SE., Gales, MJF. and Woodland, PC., 2005. The Cambridge University March 2005 Speaker Diarisation System Interspeech: 9th European Conference on Speech Communciation and Technology,

2004

Rosti, AVI. and Gales, MJF., 2004. Factor analysed hidden Markov models for speech recognition Computer Speech and Language, v. 18
Doi: http://doi.org/10.1016/j.csl.2003.09.004

2003

Povey, D., Gales, MJF., Kim, DY. and Woodland, PC., 2003. MMI-MAP and MPE-MAP for acoustic model adaptation Eurospeech Proceedings: 8th Speech Communication and Technology Conference, v. 8

2002

Gales, MJF., 2002. Transformation streams and the HMM error model COMPUT SPEECH LANG, v. 16
Doi: http://doi.org/10.1006/csla.2002.193

Chen, SS., Eide, EM., Gales, MJF., Gopinath, RA., Kanevsky, D. and Olsen, P., 2002. Automatic transcription of broadcast news Speech communication, v. 37
Doi: http://doi.org/10.1016/S0167-6393(01)00060-7

Gales, MJF., 2002. Transformation streams and the HMM error model Computer Speech and Language, v. 16
Doi: http://doi.org/10.1006/csla.2002.0193

Gales, MJF., 2002. Maximum likelihood multiple subspace projections for hidden markov models IEEE transactions on Speech and Audio Processing, v. 10
Doi: http://doi.org/10.1109/89.985541

2000

Gales, MJF., 2000. Factored semi-tied covariance matrices Advances In Neural Information Processing Systems,

Gales, MJF., 2000. Cluster adaptive training of hidden markov models IEEE Transactions on Speech and Audio Processing, v. 8
Doi: http://doi.org/10.1109/89.848223

1999

Gales, MJF., 1999. Semi-tied covariance matrices for hidden markov models IEEE Transactions on Speech and Audio Processing, v. 7
Doi: http://doi.org/10.1109/89.759034

Gales, MJF., Knill, K. and Young, SJ., 1999. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs IEEE Transactions on Speech and Audio Processing, v. 7
Doi: 10.1109/89.748120

1998

Gales, MJF., 1998. Predictive model-based compensation schemes for robust speech recognition Speech Communication, v. 25
Doi: http://doi.org/10.1016/S0167-6393(98)00029-6

Gales, MJF., 1998. Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language, v. 12
Doi: http://doi.org/10.1006/csla.1998.0043

1997

Gales, MJF., 1997. Predictive model-based compensation schemes for robust speech recognition Speech Communication, v. 25

1996

Gales, MJF. and Woodland, PC., 1996. Mean and variance adaptation within the MLLR framework Computer Speech and Language, v. 10
Doi: http://doi.org/10.1006/csla.1996.0013

Gales, MJF. and Young, SJ., 1996. Robust continuous speech recognition using parallel model combination IEEE Proceedings on Speech and Audio Processing, v. 4
Doi: http://doi.org/10.1109/89.536929

1995

Gales, MJF. and Young, SJ., 1995. Robust speech recognition in additive and convolutional noise using parallel model combination Computer Speech and Language, v. 9
Doi: http://doi.org/10.1006/csla.1995.0014

Woodland, PC., Gales, MJF., Pye, D. and Valtchev, V., 1995. Large vocabulary multilingual speech recognition using HTK Eurospeech Proceedings: 4th European Conference on Speech Communication and Technology, v. 1

1993

GALES, MJF. and YOUNG, SJ., 1993. CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE SPEECH COMMUN, v. 12

Conference proceedings

2021 (Accepted for publication)

Gales, M. and Malinin, A., 2021 (Accepted for publication). UNCERTAINTY ESTIMATION IN AUTOREGRESSIVE STRUCTURED PREDICTION
Doi: http://doi.org/10.17863/CAM.63497

Gales, M., Malinin, A. and Ryabinin, M., 2021 (Accepted for publication). Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets
Doi: http://doi.org/10.17863/CAM.78106

2021

Manakul, P. and Gales, MJF., 2021. Long-span summarization via local attention and content selection ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,

Manakul, P. and Gales, MJF., 2021. Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings,

Wei, X., Gales, MJF. and Knill, KM., 2021. Analysing bias in spoken language assessment using concept activation vectors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9413988

Fathullah, Y., Gales, MJF. and Malinin, A., 2021. Ensemble distillation approaches for grammatical error correction ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9413385

Lu, Y., Wang, Y. and Gales, MJF., 2021. Efficient use of end-to-end data in spoken language processing ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
Doi: http://doi.org/10.1109/ICASSP39728.2021.9414510

Dou, Q., Wu, X., Wan, M., Lu, Y. and Gales, MJF., 2021. Deliberation-based multi-pass speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 5
Doi: http://doi.org/10.21437/Interspeech.2021-1405

2020

Wu, X., Knill, KM., Gales, MJF. and Malinin, A., 2020. Ensemble approaches for uncertainty in spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2238

Kastanos, A., Ragni, A. and Gales, MJF., 2020. Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2020-May
Doi: http://doi.org/10.1109/ICASSP40776.2020.9053264

Raina, V., Gales, MJF. and Knill, K., 2020. Universal adversarial attacks on spoken language assessment systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: 10.21437/Interspeech.2020-1890

Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2020. Automatic detection of accent and lexical pronunciation errors in spontaneous non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: 10.21437/Interspeech.2020-2881

Dou, Q., Efiong, J. and Gales, MJF., 2020. Attention forcing for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2520

Raina, V., Gales, MJF. and Knill, K., 2020. Complementary systems for Off-Topic spoken response detection Proceedings of the Annual Meeting of the Association for Computational Linguistics,

Manakul, P., Gales, MJF. and Wang, L., 2020. Abstractive spoken document summarization using hierarchical model with multi-stage attention diversity optimization Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-1683

Lu, Y., Gales, MJF. and Wang, Y., 2020. Spoken language 'grammatical error correction' Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-1852

Knill, KM., Wang, L., Wang, Y., Wu, X. and Gales, MJF., 2020. Non-native children's automatic speech recognition: The INTERSPEECH 2020 shared task ALTA systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
Doi: http://doi.org/10.21437/Interspeech.2020-2154

2019 (Accepted for publication)

Gales, M. and Malinin, A., 2019 (Accepted for publication). Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness Advances in Neural Information Processing Systems 32 (NeurIPS 2019),

Li, Q., Ness, P., Ragni, A. and Gales, M., 2019 (Accepted for publication). BI-DIRECTIONAL LATTICE RECURRENT NEURAL NETWORKS FOR CONFIDENCE ESTIMATION
Doi: http://doi.org/10.17863/CAM.36745

Lu, Y., Gales, M., Knill, K., Manakul, P. and Wang, Y., 2019 (Accepted for publication). Disfluency Detection for Spoken Learner English
Doi: http://doi.org/10.17863/CAM.42082

Gales, M., Malinin, A. and Mlodozeniec, B., 2019 (Accepted for publication). Ensemble Distribution Distillation
Doi: http://doi.org/10.17863/CAM.49348

2019

Lu, Y., Gales, MJF., Knill, KM., Manakul, P., Wang, L. and Wang, Y., 2019. Impact of ASR performance on spoken grammatical error detection Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
Doi: http://doi.org/10.21437/Interspeech.2019-1706

Knill, KM., Gales, MJF., Manakul, PP. and Caines, AP., 2019. Automatic Grammatical Error Detection of Non-native Spoken Learner English ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2019-May
Doi: 10.1109/ICASSP.2019.8683080

Knill, K., Gales, M., Manakul, P. and Caines, A., 2019. Automatic grammatical error detection of non-native spoken learner English ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: 10.1109/icassp.2019.8683755

Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2019. A deep learning approach to automatic characterisation of rhythm in non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
Doi: http://doi.org/10.21437/Interspeech.2019-3186

Wong, JHM., Gales, MJF. and Wang, Y., 2019. Learning between Different Teacher and Student Models in ASR 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings,
Doi: http://doi.org/10.1109/ASRU46091.2019.9003756

2018

Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2018. A deep learning approach to assessing non-native pronunciation of English using phone distances Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1087

Knill, KM., Gales, MJF., Kyriakopoulos, K., Malinin, A., Ragni, A., Wang, Y. and Caines, AP., 2018. Impact of ASR performance on free speaking language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1312

Wan, M., Degottex, G. and Gales, MJF., 2018. Waveform-based speaker representations for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1154

Degottex, G. and Gales, M., 2018. A Spectrally Weighted Mixture of Least Square Error and Wasserstein Discriminator Loss for Generative SPSS 2018 IEEE Spoken Language Technology Workshop (SLT),
Doi: 10.1109/slt.2018.8639609

Wang, Y., Zhang, C., Gales, MJF. and Woodland, PC., 2018. Speaker adaptation and adaptive training for jointly optimised tandem systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-2432

Wang, Y., Wong, JHM., Gales, MJF., Knill, KM. and Ragni, A., 2018. Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639557

Ragni, A., Li, Q., Gales, MJF. and Wang, Y., 2018. Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639678

Malinin, A. and Gales, M., 2018. Predictive Uncertainty Estimation via Prior Networks NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, v. 31

Ragni, A. and Gales, MJF., 2018. Automatic speech recognition system development in the “wild“ Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-1085

Chen, O., Ragni, A., Gales, MJF. and Chen, X., 2018. Active memory networks for language modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
Doi: http://doi.org/10.21437/Interspeech.2018-78

Dou, Q., Wan, M., Degottex, G., Ma, Z. and Gales, MJF., 2018. Hierarchical RNNs for Waveform-Level Speech Synthesis 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639588

Del Vecchio, M., Malinin, A. and Gales, MJF., 2018. Improved Auto-Marking Confidence for Spoken Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
Doi: 10.1109/SLT.2018.8639634

Wang, Y., Chen, X., Gales, MJF., Ragni, A. and Wong, JHM., 2018. Phonetic and graphemic systems for multi-genre broadcast transcription ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2018-April
Doi: http://doi.org/10.1109/ICASSP.2018.8462353

2017 (Accepted for publication)

Kyriakopoulos, K., Gales, M. and Knill, K., 2017 (Accepted for publication). Automatic characterisation of the pronunciation of non-native English speakers using phone distance features http://www.isca-speech.org/archive/SLaTE_2017/,
Doi: http://doi.org/10.21437/SLaTE.2017-11

Malinin, A., Knill, K., Ragni, A., Wang, Y. and Gales, M., 2017 (Accepted for publication). An attention based model for off-topic spontaneous spoken response detection: An Initial Study http://www.isca-speech.org/archive/SLaTE_2017/,
Doi: http://doi.org/10.21437/SLaTE.2017-25

Wong, JHM. and Gales, MJF., 2017 (Accepted for publication). Student-teacher training with diverse decision tree ensembles

2017

Knill, KM., Gales, MJF., Kyriakopoulos, K., Ragni, A. and Wang, Y., 2017. Use of graphemic lexicons for spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: 10.21437/Interspeech.2017-978

Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating bidirectional recurrent neural network language models for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: http://doi.org/10.21437/Interspeech.2017-513

Wu, C. and Gales, MJF., 2017. Deep activation mixture model for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
Doi: http://doi.org/10.21437/Interspeech.2017-1233

Malinin, A., Knill, K. and Gales, MJF., 2017. A hierarchical attention based model for off-topic spontaneous spoken response detection 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
Doi: 10.1109/ASRU.2017.8268963

Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6,
Doi: http://doi.org/10.21437/Interapeech.2017-513

Malinin, A., Ragni, A., Knill, KM. and Gales, MJF., 2017. Incorporating uncertainty into deep learning for spoken language assessment ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 2
Doi: http://doi.org/10.18653/v1/P17-2008

Chen, X., Ragni, A., Vasilakes, J., Liu, X., Knill, K. and Gales, MJF., 2017. Recurrent neural network language models for keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2017.7953263

Wan, M., Degottex, G., Gales, MJF. and IEEE, , 2017. Integrated speaker-adaptive speech synthesis 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
Doi: http://doi.org/10.1109/ASRU.2017.8269006

Ragni, A., Wu, C., Gales, MJF., Vasilakes, J. and Knill, KM., 2017. Stimulated training for automatic speech recognition and keyword search in limited resource conditions ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2017.7953074

Wong, JHM. and Gales, MJF., 2017. Multi-task ensembles with teacher-student training 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
Doi: http://doi.org/10.1109/ASRU.2017.8268920

Ragni, A., Saunders, D., Zahemszky, P., Vasilakes, J., Gales, MJF. and Knill, KM., 2017. Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2017.7953262

Gales, MJF., Knill, KM. and Ragni, A., 2017. Low-resource speech recognition and keyword-spotting Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 10458 LNAI
Doi: 10.1007/978-3-319-66429-3_1

2016

Ragni, A., Dakin, E., Chen, X., Gales, MJF. and Knill, KM., 2016. Multi-language neural network language models Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-371

Yang, J., Ragni, A., Gales, MJF. and Knill, KM., 2016. Log-linear system combination using structured support vector machines Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-377

Malinin, A., Van Dalen, RC., Wang, Y., Knill, KM. and Gales, MJF., 2016. Off-topic response detection for spontaneous spoken English assessment 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 2
Doi: http://doi.org/10.18653/v1/p16-1102

Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Selection of multi-genre broadcast data for the training of automatic speech recognition systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-462

Wu, C., Karanasou, P., Gales, MJF. and Sim, KC., 2016. Stimulated deep neural network for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-580

Wong, JHM. and Gales, MJF., 2016. Sequence student-teacher training of deep neural networks Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
Doi: http://doi.org/10.21437/Interspeech.2016-911

Bell, P., Gales, MJF., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., McParland, A., Renals, S., Saz, O., Wester, M. and Woodland, PC., 2016. The MGB challenge: Evaluating multi-genre broadcast media recognition 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404863

Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2016. Investigation of back-off based interpolation between recurrent neural network and n-gram language models 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404792

Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. The development of the Cambridge university alignment systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404857

Woodland, PC., Liu, X., Qian, Y., Zhang, C., Gales, MJF., Karanasou, P., Lanchantin, P. and Wang, L., 2016. Cambridge university transcription systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404856

Cui, J., Kingsbury, B., Ramabhadran, B., Sethy, A., Audhkhasi, K., Cui, X., Kislal, E., Mangu, L., Nussbaum-Thom, M., Picheny, M., Tüske, Z., Golik, P., Schluter, R., Ney, H., Gales, MJF., Knill, KM., Ragni, A., Wang, H. and Woodland, P., 2016. Multilingual representations for low resource speech recognition and keyword search 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: 10.1109/ASRU.2015.7404803

Degottex, G., Lanchantin, P. and Gales, M., 2016. A Pulse Model in Log-domain for a Uniform Synthesizer Proceedings of the 9th ISCA Speech Synthesis Workshop,

Karanasou, P., Gales, MJF., Lanchantin, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Speaker diarisation and longitudinal linking in multi-genre broadcast data 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404859

Van Dalen, RC., Yang, J., Wang, H., Ragni, A., Zhang, C. and Gales, MJF., 2016. Structured discriminative models using deep neural-network features 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404789

Wang, L., Zhang, C., Woodland, PC., Gales, MJF., Karanasou, P., Lanchantin, P., Liu, X. and Qian, Y., 2016. Improved DNN-based segmentation for multi-genre broadcast audio ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472769

Chen, X., Liu, X., Qian, Y., Gales, MJF. and Woodland, PC., 2016. CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472829

Yang, J., Zhang, C., Ragni, A., Gales, MJF. and Woodland, PC., 2016. System combination with log-linear models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472764

Wu, C., Karanasou, P. and Gales, MJF., 2016. Combining i-vector representation and structured neural networks for rapid adaptation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
Doi: http://doi.org/10.1109/ICASSP.2016.7472629

2015

Van Dalen, RC. and Gales, MJF., 2015. Annotating large lattices with the exact word error Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. Improving multiple-crowd-sourced transcriptions using a speech recogniser ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178864

Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. Paraphrastic recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179004

Ragni, A., Gales, MJF. and Knill, KM., 2015. A language space representation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178849

Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Improving the training and evaluation efficiency of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179003

Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model training with noise contrastive estimation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7179005

Gales, MJF., Knill, KM. and Ragni, A., 2015. Unicode-based graphemic systems for limited resource languages ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: 10.1109/ICASSP.2015.7178960

Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. Robust excitation-based features for Automatic Speech Recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7178855

van Dalen, RC., Knill, KM. and Gales, MJF., 2015. Automatically Grading Learners’ English Using a Gaussian Process Speech and Language Technology in Education, SLaTE 2015,

Wang, H., Ragni, A., Gales, MJF., Knill, KM., Woodland, PC. and Zhang, C., 2015. Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Lanchantin, P., Veaux, C., Gales, MJF., King, S. and Yamagishi, J., 2015. Reconstructing voices within the multiple-average-voice-model framework Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Chen, X., Tan, T., Liu, X., Lanchantin, P., Wan, M., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model adaptation for multi-genre broadcast speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Liu, X., Flego, F., Wang, L., Zhang, C., Gales, M. and Woodland, P., 2015. The Cambridge university 2014 BOLT conversational telephone Mandarin Chinese lvcsr system for speech translation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Mendels, G., Cooper, E., Soto, V., Hirschberg, J., Gales, M., Knill, K., Ragni, A. and Wang, H., 2015. Improving speech recognition and keyword search for low resource languages using web data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),

Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),

Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),

Wu, C. and Gales, MJF., 2015. Multi-basis adaptive neural network for rapid adaptation in speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7178785

Wu, C. and Gales, MJF., 2015. MULTI-BASIS ADAPTIVE NEURAL NETWORK FOR RAPID ADAPTATION IN SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),

2014

Rath, SP., Knill, KM., Ragni, A. and Gales, MJF., 2014. Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Kolluru, BK., Wan, V., Latorre, J., Yanagisawa, K. and Gales, MJF., 2014. Generating multiple-accent pronunciations for TTS using joint sequence model interpolation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Chen, X., Wang, Y., Liu, X., Gales, MJF. and Woodland, PC., 2014. Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Yanagisawa, K., Chen, L. and Gales, MJF., 2014. Noise-robust TTS speaker adaptation with statistics smoothing Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Latorre, J., Yanagisawa, K., Wan, V., Kolluru, BK. and Gales, MJF., 2014. Speech intonation for TTS: Study on evaluation methodology Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Chen, X., Gales, MJF., Knill, K., Breslin, C., Chen, L., Chin, KK. and Wan, V., 2014. An initial investigation of long-term adaptation for meeting transcription Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Liu, X., Wang, Y., Chen, X., Gales, MJF. and Woodland, PC., 2014. Efficient lattice rescoring using recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854535

Liu, X., Gales, MJF. and Woodland, PC., 2014. Paraphrastic neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854534

Yang, J., Van Dalen, RC., Zhang, SX. and Gales, MJF., 2014. Infinite structured support vector machines for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854215

Yoshioka, T., Ragni, A. and Gales, MJF., 2014. Investigation of unsupervised adaptation of DNN acoustic models with filter bank input ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854825

Chen, L., Braunschweiler, N. and Gales, MJF., 2014. Speaker dependent expression predictor from text: Expressiveness and transplantation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854065

Yoshioka, T., Chen, X. and Gales, MJF., 2014. Impact of single-microphone dereverberation on DNN-based meeting transcription systems ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854660

Karanasou, P., Wang, Y., Gales, MJF. and Woodland, PC., 2014. Adaptation of deep neural network acoustic models using factorised i-vectors Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Ragni, A., Knill, KM., Rath, SP. and Gales, MJF., 2014. Data augmentation for low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Knill, KM., Gales, MJF., Ragni, A. and Rath, SP., 2014. Language independent and unsupervised acoustic models for speech recognition and keyword spotting Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

2013

Lanchantin, P., Bell, PJ., Gales, MJF., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, MS., Swietojanski, P. and Woodland, PC., 2013. Automatic transcription of multi-genre media archives CEUR Workshop Proceedings, v. 1012

Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Chen, L., Gales, MJF., Braunschweiler, N., Akamine, M. and Knill, K., 2013. Integrated automatic expression prediction and speech synthesis from text ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639218

Latorre, J., Gales, MJF., Knill, K. and Akamine, M., 2013. Training a supra-segmental parametric F0 model without interpolating F0 ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6638995

Maia, R., Akamine, M. and Gales, MJF., 2013. Complex cepstrum analysis based on the minimum mean squared error ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639217

Van Dalen, RC., Ragni, A. and Gales, MJF., 2013. Efficient decoding with generative score-spaces using the expectation semiring ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639145

Zhang, SX. and Gales, MJF., 2013. Kernelized log linear models for continuous speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639009

Wang, YQ. and Gales, MJF., 2013. An explicit independence constraint for factorised adaptation in speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Knill, KM., Gales, MJF., Rath, SP., Woodland, PC., Zhang, C. and Zhang, S-X., 2013. Investigation of multilingual deep neural networks for spoken term detection 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings,
Doi: 10.1109/ASRU.2013.6707719

Long, Y., Gales, MJF., Lanchantin, P., Liu, X., Seigel, MS. and Woodland, PC., 2013. Improving Lightly Supervised Training for Broadcast Transcription 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,

Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain Paraphrasing For Improving Language Modelling Using Out-of-domain Data 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,

Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, B., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-Realistic Expressive Text to Talking Head Synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,

Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,

Wang, Y-Q. and Gales, MJF., 2013. An Explicit Independence Constraint for Factorised Adaptation in Speech Recognition 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,

Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain paraphrasing for improving language modelling using out-of-domain data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, BK., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-realistic expressive text to talking head synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

2012 (No publication date)

Ragni, A. and Gales, MJF., 2012 (No publication date). Derivative Kernels for Noise Robust ASR

2012

Latorre, J., Wan, V., Gales, MJF., Chen, L., Chin, KK., Knill, K. and Akamine, M., 2012. Speech factorization for HMM-TTS based on cluster adaptive training. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2

Wang, Y-Q. and Gales, MJF., 2012. Model-based approaches to adaptive training in reverberant environments 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,

Wan, V., Latorre, J., Chin, KK., Chen, L., Gales, MJF., Zen, H., Knill, K. and Akamine, M., 2012. Combining multiple high quality corpora for improving HMM-TTS 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2

Eyben, F., Buchholz, S., Braunschweiler, N., Latorre, J., Wan, V., Gales, MJF. and Knill, K., 2012. Unsupervised clustering of emotion and voice styles for expressive TTS ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: 10.1109/ICASSP.2012.6288797

Maia, R., Akamine, M. and Gales, MJF., 2012. COMPLEX CEPSTRUM AS PHASE INFORMATION IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),

Ragni, A. and Gales, MJF., 2012. INFERENCE ALGORITHMS FOR GENERATIVE SCORE-SPACES 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),

Roupakia, Z., Ragni, A. and Gales, M., 2012. Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2

Chen, L., Gales, MJF., Wan, V., Latorre, J. and Akamine, M., 2012. Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,

2011

Braunschweiler, N., Gales, MJF. and Buchholz, S., 2011. Lightly supervised recognition for automatic alignment of large coherent speech recordings Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Breslin, C., Chin, KK., Gales, MJF., Knill, K. and Xu, H., 2011. Prior information for rapid speaker adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Gales, MJF. and Yu, K., 2011. Canonical state models for automatic speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian process experts for voice conversion Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Latorre, J., Gales, MJF. and Zen, H., 2011. Training a parametric-based logF0 model with the minimum generation error criterion Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse sequences for residual signal modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Liu, X., Gales, MJF. and Woodland, PC., 2011. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2011. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,

van Dalen, RC. and Gales, MJF., 2011. Asymptotically exact noise-corrupted speech likelihoods Proceedings of the 11th Annual Conference of the International Speech Communication Association,

Zhang, SX. and Gales, MJF., 2011. Structured support vector machines for noise robust continuous speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Diehl, F., Gales, MJF., Liu, X., Tomalin, M. and Woodland, PC., 2011. Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Liu, X., Gales, MJF. and Woodland, PC., 2011. Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Li, T., Woodland, PC., Diehl, F. and Gales, MJF., 2011. Graphone Model Interpolation and Arabic Pronunciation Generation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse Sequences for Residual Signal Modeling 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Zhang, S-X. and Gales, MJF., 2011. Structured Support Vector Machines for Noise Robust Continuous Speech Recognition 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian Process Experts for Voice Conversion 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Breslin, C., Chin, KK., Gales, MJF. and Knill, K., 2011. Integrated Online Speaker Clustering and Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

2010

Liu, X., Gales, MJF. and Woodland, PC., 2010. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,

Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2010. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,

Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2010. Language model combination and adaptation using weighted finite state transducers Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2010.5494941

Tomalin, M., Park, J., Diehl, F., Gales, MJF. and Woodland, PC., 2010. Recent improvements to the Cambridge Arabic speech-to-text systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: 10.1109/ICASSP.2010.5495641

Zen, H., Gales, MJF., Nankaku, Y. and Tokuda, K., 2010. Statistical parametric synthesis based on products of experts Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2010.5495691

Flego, F. and Gales, MJF., 2010. Discriminative adaptive training with VTS and JUD Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5373266

Gales, MJF., Ragni, A., AlDamarki, H. and Gautier, C., 2010. Support vector machines for noise robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5372913

Xu, H., Gales, MJF. and Chin, KK., 2010. Improving joint uncertainty decoding performance by predictive methods for noise robust speech recognition Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
Doi: http://doi.org/10.1109/ASRU.2009.5373317

Maia, R., Zen, H. and Gales, MJF., 2010. Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters

2009

Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational Dynamic Kernels for Speaker Verification INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Kim, DK. and Gales, MJF., 2009. Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Gales, MJF., 2009. Acoustic Modelling for Speech Recognition: Hidden Markov Models and Beyond? 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009),

Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological Analysis and Decomposition for Arabic Speech-to-Text Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Flego, F. and Gales, MJF., 2009. Incremental Adaptation with VTS and Joint Adaptively Trained Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Flego, F. and Gales, MJF., 2009. Incremental predictive and adaptive noise compensation IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2009.4960464

Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Training and adapting MLP features for Arabic speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: 10.1109/ICASSP.2009.4960620

Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of Contexts in Language Model Interpolation and Adaptation INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Raut, CK. and Gales, MJF., 2009. Bayesian discriminative adaptation for speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960595

van Dalen, RC. and Gales, MJF., 2009. Extended VTS for noise-robust speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960462

van Dalen, RC. and Gales, MJF., 2009. Extended VTS for noise-robust speech recognition IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2009.4960462

Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological analysis and decomposition for Arabic speech-to-text systems Proceedings of the 10th International Conference of the International Speech Communication Association,

van Dalen, RC., Flego, F. and Gales, MJF., 2009. Transforming Features to Compensate Speech Recogniser Models for Noise INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Flego, F. and Gales, MJF., 2009. Incremental adaptation with VTS and joint adaptively trained systems Proceedings of the 10th International Conference of the International Speech Communication Association,

Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient Generation and Use of MLP Features for Arabic Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Gales, MJF. and Flego, F., 2009. Combining VTS model compensation and support vector machines Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Doi: http://doi.org/10.1109/ICASSP.2009.4960460

Hieronymus, JL., Liu, X., Gales, MJF. and Woodland, PC., 2009. Exploiting Chinese character models to improve speech recognition performance Proceedings of the 10th International Conference of the International Speech Communication Association,

Kim, D. and Gales, MJF., 2009. Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition 10th Annual Conference of the International Speech Communication Association, Interspeech 2009,

Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of contexts in language model interpolation and adaptation Proceedings of the 10th International Conference of the International Speech Communication Association,

Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational dynamic kernels for speaker verification Proceedings of the 10th International Conference of the International Speech Communication Association,

Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient generation and use of MLP features for Arabic speech recognition Proceedings of the 10th International Conference of the International Speech Communication Association,

van Dalen, RC. and Gales, MJF., 2009. Transforming features to compensate speech recogniser models for noise Proceedings of the 10th Annual Conference of the International Speech Communication Associatio,

2008

Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive Training using Discriminative Mapping Transforms INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2008. Phonetic pronunciations for arabic speech-to-text systems IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2008.4517924

Longworth, C. and Gales, MJF., 2008. Multiple kernel learning for speaker verification IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2008.4517926

Yu, K., Gales, MJF. and Woodland, PC., 2008. Unsupervised discriminative adaptation using discriminative mapping transforms International Conference on Acoustics, Speech and Signal Processing, 2008,
Doi: http://doi.org/10.1109/ICASSP.2008.4518599

Gales, MJF. and Longworth, C., 2008. Discriminative classifiers with generative kernels for noise-robust ASR ICSLP - International Conference - CD-ROM,

Liu, XA., Gales, MJF. and Woodland, PC., 2008. Context dependent language model adaptation Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008) incorporating the 12th Australasian International Conference on Speech Science and Technology, SST' 08,

Longworth, C. and Gales, MJF., 2008. A generalised derivative kernel for speaker verification ICSLP - International Conference - CD-ROM,

Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms ICSLP - International Conference - CD-ROM,

van Dalen, RC. and Gales, MJF., 2008. Covariance modelling for noise-robust speech recognition ICSLP - International Conference - CD-ROM,

Gales, MJF. and Longworth, C., 2008. Discriminative Classifiers with Generative Kernels for Noise Robust ASR INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Liu, X., Gales, MJF. and Woodland, PC., 2008. Context Dependent Language Model Adaptation INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Longworth, C. and Gales, MJF., 2008. A Generalised Derivative Kernel for Speaker Verification INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

van Dalen, RC. and Gales, MJF., 2008. Covariance Modelling for Noise-Robust Speech Recognition INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

2007

Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised Training with Directed Manual Transcription for Recognising Mandarin Broadcast Audio INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,

Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition

Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition

Liu, XA., Byrne, WJ., Gales, MJF., de Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation IEEE Workshop on Automatic Speech Recognition & Understanding, 2007,
Doi: http://doi.org/10.1109/ASRU.2007.4430101

Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised training using directed manual transcription for recognising Mandarin broadcast audio Proceedings InterSpeech 2007,

Breslin, C. and Gales, MJF., 2007. Building multiple complementary systems using directed decision trees Proceedings InterSpeech 2007,

Breslin, C. and Gales, MJF., 2007. Building Multiple Complementary Systems using Directed Decision Trees INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,

Longworth, C. and Gales, MJF., 2007. Parametric and derivative kernels for speaker verification Proceedings InterSpeech 2007,

Longworth, C. and Gales, MJF., 2007. Derivative and Parametric Kernels for Speaker Verification INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,

Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, JL., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367310

Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367172

Sim, KC., Byrne, WJ., Gales, MJF., Sahbi, H. and Woodland, PC., 2007. Consensus network decoding for statistical machine translation system combination IEEE International Conference on Acoustics Speech and Signal Processing, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367174

Gales, MJF., 2007. Discriminative models for speech recognition
Doi: http://doi.org/10.1109/ITA.2007.4357576

Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, J-L., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation

Breslin, C. and Gales, MJF., 2007. Complementary system generation using directed decision trees Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07,
Doi: http://doi.org/10.1109/ICASSP.2007.366918

Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,

Liao, H. and Gales, MJF., 2007. Adaptive training with joint uncertainty decoding for robust recognition of noisy data Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.366931

Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 200, ICASSP' 07,
Doi: http://doi.org/10.1109/ICASSP.2007.366922

Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription

Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation

Liu, XA., Byrne, WJ., Gales, MJF., De Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007, Proceedings,
Doi: http://doi.org/10.1109/asru.2007.4430101

Gales, MJF., 2007. Discriminative-models for speech recognition 2007 Information Theory and Applications Workshop,

Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,

2006

Liao, H. and Gales, MJF., 2006. Issues with Uncertainty Decoding for Noise Robust Speech Recognition INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,

Liao, H. and Gales, MJF., 2006. Issue with uncertainty decoding for noise robust speech recognition

Breslin, C. and Gales, MJF., 2006. Generating complementary systems for speech recognition

Longworth, C. and Gales, MJF., 2006. Discriminative adaptation for speaker verification

Longworth, C. and Gales, MJF., 2006. Discriminative Adaptation for Speaker Verification INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,

Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition

Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference Proceedings of the 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
Doi: http://doi.org/10.1109/ICASSP.2006.1659996

Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,

Sinha, R., Gales, MJF., Kim, DY., Liu, X., Sim, KC. and Woodland, PC., 2006. The CU-HTK Mandarin broadcast news transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'06,

Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,

2005

Layton, M. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels Proceedings of Machine Learning for Signal Processing Workshop,

Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2005, v. 1
Doi: http://doi.org/10.1109/ICASSP.2005.1415247

Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Development of the CU-HTK 2004 broadcast news transcription systems IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2005.1415250

Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Training LVCSR systems on thousands of hours of data IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05, v. 1
Doi: http://doi.org/10.1109/ICASSP.2005.1415087

Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05,
Doi: http://doi.org/10.1109/ICASSP.2005.1415250

Liao, H. and Gales, MJF., 2005. Joint uncertainty decoding for noise robust speech recognition Interspeech: 9th European Conference on Speech Communciation and Technology,

Sim, KC. and Gales, MJF., 2005. Adaptation of precision matrix models on large vocabulary continuous speech recognition Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1

Sim, KC. and Gales, MJF., 2005. Temporally varying model parameters for large vocabulary continuous speech recognition Interspeech: European Conference on Speech Communciation and Technology,

Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system

Layton, MI. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels 2005 IEEE Workshop on Machine Learning for Signal Processing (MLSP),

Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems

Layton, M. and Gales, MJF., 2005. Augmented statistical models: exploiting generative models in discriminative classifiers

Yu, K. and Gales, MJF., 2005. Bayesian adaptation and adaptively trained systems Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding,
Doi: http://doi.org/10.1109/ASRU.2005.1566532

2004

Sim, KC. and Gales, MJF., 2004. Basis superposition precision matrix modeling for large vocabulary continuous speech recognition Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1

Yu, K. and Gales, MJF., 2004. Adaptive training using structured transforms Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Proceedings, 2004, v. 1
Doi: http://doi.org/10.1109/ICASSP.2004.1325986

Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '04, v. 1
Doi: http://doi.org/10.1109/ICASSP.2004.1325969

Evermann, G., Chan, HY., Gales, MJF., Jia, B., Liu, X., Mrva, D., Sim, KC., Wang, L. and Woodland, PC., 2004. Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data

Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2004. Development of the CUHTK 2004 RT04F Mandarin conversational telephone speech transcription system

Kim, DY., Chan, HY., Evermann, G., Gales, MJF., Mrva, D., Sim, KC. and Woodland, PC., 2004. Recent developments at Cambridge in broadcast news transcription

Kim, DY., Gales, MJF., Hain, T. and Woodland, PC., 2004. Using VTLN for broadcast news transcription Interspeech 2004 ICSLP: 8th International Conference on Spoken Language Processing,

Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions Proceedings of the 29th IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP),

Rosti, AVI. and Gales, MJF., 2004. Rao-blackwellised gibbs sampling for switching linear dynamical systems Proceedings of the 29th IEEE International conference on Acoustics, Speech and Signal Processing (ICASSP),

Tranter, SE., Gales, MJF., Sinha, R., Umesh, S. and Woodland, PC., 2004. The development of the Cambridge University RT-04 diarisation system

Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system

Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions

2003

Airey, SS. and Gales, MJF., 2003. Product of Gaussians as a distributed representation for speech recognition Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech), v. 2

Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems Proceedings of the 28th IEEE International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing

Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS,

Gales, MJF., Dong, Y., Povey, D. and Woodland, PC., 2003. Porting: SwitchBoard to the VoiceMail task IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1

Liu, X. and Gales, MJF., 2003. Automatic model complexity control using marginalized discriminative growth functions Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding,

Liu, X., Gales, MJF. and Woodland, PC., 2003. Automatic complexity control for HLDA systems IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1

Povey, D., Woodland, PC. and Gales, MJF., 2003. Discriminative map for acoustic model adaptation IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1

2002

Stuttle, MN. and Gales, MJF., 2002. Combining a Gaussian mixture model front end with MFCC parameters Proceedings of the 7th International Conference on Spoken Language Processing (Interspeech), v. 3

Rosti, AVI. and Gales, MJF., 2002. Factor analysed HMMs (Hidden Markov Models) Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1

Smith, ND. and Gales, MJF., 2002. SVMs for speech recognition Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. Volume 1: Speech Processing

Cordoba, R., Woodland, PC. and Gales, MJF., 2002. Improved cross-task recognition using MMIE training IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'02, v. 1
Doi: http://doi.org/10.1109/ICASSP.2002.1005682

Gales, MJF., 2002. The HMM error model Proceedings of the 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing

Smith, ND. and Gales, MJF., 2002. Using SVMs and discriminative models for speech recognition 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS,

2001

Gales, MJF., 2001. Acoustic factorisation Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),

Gales, MJF., 2001. Adaptive training for robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),

Smith, N. and Gales, MJF., 2001. Speech recognition using SVMs Proceedings of the 15th Conference on Neural Information Processing Systems, v. 2

Stuttle, MN. and Gales, MJF., 2001. A mixture of gaussians front end for speech recognition Proceedings of the 7th European Conference on Speech Communication and Technology, v. 1

Gales, MJF., 2001. Multiple-cluster adaptive training schemes Proceedings of 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing

2000

Aiyer, A., Gales, MJF. and Picheny, MA., 2000. Rapid likelihood calculation of subspace clustered Gaussian components Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), v. 3

Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L., Gales, MJF., Novak, M. and Gopinath, R., 2000. IBM's 10xReal-time broadcast news transciption used in the 1999 hub4 evaluation

Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L. and Gales, MJF., 2000. Transcription of broadcast news with time constraint: IBM's 10xRT hub4 system

1999

Gales, MJF. and Olsen, PA., 1999. Tail distribution modelling using the richter and power exponential distributions

Chen, S., Eide, EM., Gales, MJF., Gopinath, RA. and Kavensky, RA., 1999. Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 1

Chen, S., Eide, EM., Gales, MJF., Gopinath, RA., Kavensky, D. and Olsen, PA., 1999. Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news

1998

Chen, S., Gales, MJF., Gopalakrishnan, PS., Gopinath, RA., Kavensky, D., Olsen, P. and Polymenakos, L., 1998. IBM's LVCSR system for transcription of broadcast news used in the 1997 hub4 english evaluation

Gales, MJF., 1998. Semi-tied covariance matrices Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 2

Gales, MJF., 1998. Cluster adaptive training for speech recognition Proceedings of 5th International Conference on Spoken Language Processing,

1997

Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. Broadcast news transcription using HTK Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 2
Doi: http://doi.org/10.1109/ICASSP.1997.596005

Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. The development of the 1996 HTK broadcast news transcription system Proceedings of DARPA Speech Recognition Workshop,

Nock, H., Gales, MJF. and Young, SJ., 1997. A comparative study of methods for phonetic decision-tree state clustering

Gales, MJF., 1997. Transformation smoothing for speaker and environmental adaptation

1996

Woodland, PC., Gales, MJF., Pye, D. and Valtchev, V., 1996. The HTK large vocabulary recognition system for the 1995 ARPA H3 task Proceedings of the ARPA Continuous Speech Recognition Workshop,

Gales, MJF., Pye, D. and Woodland, PC., 1996. Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, v. 3

Knill, K., Gales, MJF. and Young, SJ., 1996. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP), v. 1

Woodland, PC., Gales, MJF. and Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 96, v. 1
Doi: http://doi.org/10.1109/ICASSP.1996.540291

Woodland, PC., Pye, D. and Gales, MJF., 1996. Iterative unsupervised adaptation using maximum likelihood linear regression 4th International Conference on Spoken Language Processing (ICSLP 1996), v. 2

1995

Gales, MJF. and Young, SJ., 1995. The application of parallel model combination to a large vocabulary dictation task Proceedings of the 4th European Conference on Speech Communication and Technology (EUROSPEECH '95), v. 3

Knill, K., Gales, MJF. and Young, SJ., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),

Gales, MJF. and Young, SJ., 1995. A fast and flexible implementation of parallel model combination Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech

Gopinath, RA., Gales, MJF., Gopalakrishnan, PS., Balakrishnan Aiyer, S. and Picheny, MA., 1995. Robust speech recognition in noise --- performance of the IBM continuous speech recogniser on the ARPA noise spoke task Proceedings of the ARPA Spoken Language Systems Technology Workshop,

1993

Gales, MJF. and Young, SJ., 1993. Segmental hidden Markov models EUROSPEECH 93 proceedings, v. 3

Gales, MJF. and Young, SJ., 1993. HMM recognition in noise using parallel model combination EUROSPEECH 93 proceedings, v. 2

1992

GALES, MJF. and YOUNG, S., 1992. AN IMPROVED APPROACH TO THE HIDDEN MARKOV MODEL DECOMPOSITION OF SPEECH AND NOISE ICASSP-92 - 1992 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5,

Internet publications

2019

Li, Q., Ness, PM., Ragni, A. and Gales, MJF., 2019. Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
Doi: http://doi.org/10.1109/ICASSP.2019.8683488

Datasets

2018

Wu, C., Gales, M., Ragni, A., Karanasou, P. and Sim, KC., 2018. Improving Interpretability and Regularisation in Deep Learning
Doi: http://doi.org/10.17863/CAM.18408

2016 (No publication date)

Chen, X., Liu, X., Qian, Y., Gales, MJF. and Woodland, P., 2016 (No publication date). Research data supporting "CUED-RNNLM -- An Open-Source Toolkit for Efficient Training and Evaluation of Recurrent Neural Network Language Models"

Wang, L., Zhang, C., Woodland, PC., Gales, MJF., Karanasou, P., Lanchantin, P., Liu, X. and Qian, Y., 2016 (No publication date). Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"

2015 (No publication date)

Karanasou, P., Gales, MJ., Lanchantin, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2015 (No publication date). Supplementary data for "Speaker Diarisation and Linking in Multi-Genre Broadcast Data"

Lanchantin, P., Gales, MJ., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2015 (No publication date). Supplementary data for "The Development of the Cambridge University Alignment Systems for the Multi-Genre Broadcast Challenge".

Van, DRC., Yang, J., Wang, H., Ragni, A., Zhang, C. and Gales, MJF., 2015 (No publication date). Data underpinning "Structured Discriminative Models using Deep Neural-Network Features"

Woodland, PC., Liu, X., Qian, Y., Zhang, C., Gales, MJ., Karanasou, P., Lanchantin, P. and Wang, L., 2015 (No publication date). Research data supporting "Cambridge university transcription systems for the multi-genre broadcast challenge"

Chen, X., Liu, X., Gales, MJF. and Woodland, P., 2015 (No publication date). Data underpinning "Investigation of back-off based interpolation between Recurrent Neural Network and N-Gram Language Models”

Book chapters

2009

Gales, MJF., 2009. Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models
Doi: http://doi.org/10.1002/9780470742044.ch6

Reports

2009

Kim, DK. and Gales, MJF., 2009. Noisy CMLLR for noise-robust speech recognition

2008

Gales, MJF. and Flego, F., 2008. Discriminative classifiers and generative kernels for noise robust speech recognition

2006

Liao, H. and Gales, MJF., 2006. Joint uncertainty decoding for robust large vocabulary speech recognition

2004

Liao, H. and Gales, MJF., 2004. Uncertainty decoding for noise robust automatic speech recognition

Layton, MI. and Gales, MJF., 2004. Maximum margin training of generative kernels

Sim, KC. and Gales, MJF., 2004. Precision matrix modelling for large vocabulary continuous speech recognition

Yu, K. and Gales, MJF., 2004. Discriminative cluster adaptive training

2003

Rosti, AV. and Gales, MJF., 2003. Switching linear dynamical systems for speech recognition

Airey, SS. and Gales, MJF., 2003. Product of Gaussians for speech recognition

Rosti, AV. and Gales, MJF., 2003. Factor analysed hidden Markov models for speech recognition

Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, G., Povey, D. and Wang, L., 2003. Automatic transcription of conversational telephone speech: development of the CU-HTK 2002 system

2002

Smith, ND. and Gales, MJF., 2002. Using SVMs to classify variable length speech patterns

2001

Rosti, AV. and Gales, MJF., 2001. Generalised linear Gaussian models

Gales, MJF., 2001. Transformation streams and the HMM error model

Smith, ND., Gales, MJF. and Niranjan, M., 2001. Data-dependent Kernels in SVM classification of speech patterns

1999

Gales, MJF., 1999. Maximum likelihood multiple projection schemes for hidden Markov models

1997

Gales, MJF., 1997. Adapting semi-tied full-convariance matrix HMMs

Gales, MJF., 1997. Maximum likelihood linear transformations for HMM-based speech recognition

Gales, MJF., 1997. Semi-tied full-covariance matrices for hidden Markov models

Gales, MJF., Knill, KM. and Young, SJ., 1997. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs

1996

Gales, MJF., 1996. The generation and use of regression class trees for MLLR adaptation

Gales, MJF. and Woodland, PC., 1996. Variance compensation within the MLLR framework

1994

Gales, MJF. and Young, SJ., 1994. Robust continuous speech recognition using parallel model combination

1993

Gales, MJF. and Young, SJ., 1993. Parallel model combination for speech recognition in noise

Gales, MJF. and Young, SJ., 1993. PMC for speech recognition in additive and convolutional noise

Gales, MJF. and Young, SJ., 1993. The theory of segmental hidden Markov models

Other publications

2006

Young, SJ., Evermann, G., Gales, MJF., Kershaw, D., Moore, G., Odell, JJ., Ollason, DG., Povey, D., Valtchev, V. and Woodland, PC., 2006. The HTK book version 3.4