skip to content

Cambridge Language Sciences

Interdisciplinary Research Centre
 

Publications (from Symplectic)

Theses / dissertations

2022 (No publication date)

  • Kyriakopoulos, K., 2022 (No publication date). Deep Learning for Automatic Assessment and Feedback of Spoken English
    Doi: http://doi.org/10.17863/CAM.82947
  • 1995

  • Gales, MJF., 1995. Model-based techniques for noise robust speech recognition
  • Journal articles

    2022

  • Fathullah, Y. and Gales, MJF., 2022. Self-Distribution Distillation: Efficient Uncertainty Estimation
  • Ragni, A., Gales, MJF., Rose, O., Knill, KM., Kastanos, A., Li, Q. and Ness, PM., 2022. Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 30
    Doi: http://doi.org/10.1109/TASLP.2022.3161153
  • 2021

  • Malinin, A., Band, N., Ganshin, , Alexander, , Chesnokov, G., Gal, Y., Gales, MJF., Noskov, A., Ploskonosov, A., Prokhorenkova, L., Provilkov, I., Raina, V., Raina, V., Roginskiy, , Denis, , Shmatova, M., Tigas, P. and Yangel, B., 2021. Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks
  • Dou, Q., Lu, Y., Manakul, P., Wu, X. and Gales, MJF., 2021. Attention Forcing for Machine Translation
  • Raina, V. and Gales, MJF., 2021. An Initial Investigation of Non-Native Spoken Question-Answering
  • 2019

  • Wang, L., Wang, Y. and Gales, MJF., 2019. Non-native Speaker Verification for Spoken Language Assessment
  • Dou, Q., Lu, Y., Efiong, J. and Gales, MJF., 2019. Attention Forcing for Sequence-to-sequence Model Training
  • Chen, X., Liu, X., Wang, Y., Ragni, A., Wong, JHM. and Gales, MJF., 2019. Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 27
    Doi: http://doi.org/10.1109/TASLP.2019.2922048
  • Wong, JHM., Gales, MJF. and Wang, Y., 2019. General sequence teacher-student learning IEEE/ACM Transactions on Audio Speech and Language Processing, v. 27
    Doi: http://doi.org/10.1109/TASLP.2019.2929859
  • 2018

  • Wang, Y., Gales, MJF., Knill, KM., Kyriakopoulos, K., Malinin, A., van Dalen, RC. and Rashid, M., 2018. Towards automatic assessment of spontaneous spoken English Speech Communication, v. 104
    Doi: http://doi.org/10.1016/j.specom.2018.09.002
  • Degottex, G., Lanchantin, P. and Gales, M., 2018. A Log Domain Pulse Model for Parametric Speech Synthesis IEEE/ACM Transactions on Audio, Speech, and Language Processing, v. 26
    Doi: http://doi.org/10.1109/TASLP.2017.2761546
  • 2017

  • Karanasou, P., Wu, C., Gales, M. and Woodland, PC., 2017. I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models IEEE/ACM Transactions on Audio Speech and Language Processing, v. 25
    Doi: http://doi.org/10.1109/TASLP.2017.2670141
  • Chen, X., Liu, X., Ragni, A., Wang, Y. and Gales, MJF., 2017. Future word contexts in neural network language models 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
    Doi: http://doi.org/10.1109/ASRU.2017.8268922
  • Wu, C., Gales, M., Ragni, A., Karanasou, P. and Sim, KC., 2017. Improving Interpretability and Regularisation in Deep Learning IEEE/ACM Transactions on Audio Speech and Language Processing,
    Doi: http://doi.org/10.1109/TASLP.2017.2774919
  • 2016

  • Liu, X., Chen, X., Wang, Y., Gales, MJF. and Woodland, PC., 2016. Two efficient lattice rescoring methods using recurrent neural network language models IEEE/ACM Transactions on Audio Speech and Language Processing, v. 24
    Doi: http://doi.org/10.1109/TASLP.2016.2558826
  • Chen, X., Liu, X., Wang, Y., Gales, MJF. and Woodland, PC., 2016. Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing, v. 24
    Doi: 10.1109/TASLP.2016.2598304
  • 2015

  • Yoshioka, T. and Gales, MJF., 2015. Environmentally robust ASR front-end for deep neural network acoustic models Computer Speech and Language, v. 31
    Doi: http://doi.org/10.1016/j.csl.2014.11.008
  • Chen, L., Braunschweiler, N. and Gales, MJF., 2015. Speaker and Expression Factorization for Audiobook Data: Expressiveness and Transplantation IEEE Transactions on Audio, Speech and Language Processing, v. 23
    Doi: http://doi.org/10.1109/TASLP.2014.2385478
  • 2014

  • Wan, V., Latorre, J., Yanagisawa, K., Braunschweiler, N., Chen, L., Gales, MJF. and Akamine, M., 2014. Building HMM-TTS voices on diverse data IEEE Journal on Selected Topics in Signal Processing, v. 8
    Doi: http://doi.org/10.1109/JSTSP.2013.2295058
  • Chen, L., Gales, MJF., Braunschweiler, N., Akamine, M. and Knill, K., 2014. Integrated expression prediction and speech synthesis from text IEEE Journal on Selected Topics in Signal Processing, v. 8
    Doi: http://doi.org/10.1109/JSTSP.2013.2294938
  • Liu, X., Gales, MJF. and Woodland, PC., 2014. Paraphrastic language models Computer Speech and Language, v. 28
    Doi: http://doi.org/10.1016/j.csl.2014.04.004
  • Lanchantin, P., Gales, MJF., King, S. and Yamagishi, J., 2014. Multiple-average-voice-based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6853603
  • 2013

  • Wang, YQ. and Gales, MJF., 2013. Tandem system adaptation using multiple linear feature transforms ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639209
  • Liu, X., Hieronymus, JL., Gales, MJF. and Woodland, PC., 2013. Syllable language models for Mandarin speech recognition: exploiting character language models. J Acoust Soc Am, v. 133
    Doi: http://doi.org/10.1121/1.4768800
  • van Dalen, RC. and Gales, MJF., 2013. Importance sampling to compute likelihoods of noise-corrupted speech COMPUTER SPEECH AND LANGUAGE, v. 27
    Doi: http://doi.org/10.1016/j.csl.2012.06.007
  • Zhang, S-X. and Gales, MJF., 2013. Structured SVMs for Automatic Speech Recognition IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 21
    Doi: http://doi.org/10.1109/TASL.2012.2227734
  • Liu, X., Gales, MJF. and Woodland, PC., 2013. Language model cross adaptation for LVCSR system combination Computer Speech and Language, v. 27
    Doi: http://doi.org/10.1016/j.csl.2012.07.010
  • Maia, R., Akamine, M. and Gales, MJF., 2013. Complex cepstrum for statistical parametric speech synthesis SPEECH COMMUNICATION, v. 55
    Doi: http://doi.org/10.1016/j.specom.2012.12.008
  • Long, Y., Gales, MJF., Lanchantin, P., Liu, X., Seigel, MS. and Woodland, PC., 2013. Improving lightly supervised training for broadcast transcription Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Yang, J., Van Dalen, RC. and Gales, M., 2013. Infinite support vector machines in speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Seigel, MS., Woodland, PC. and Gales, MJF., 2013. A confidence-based approach for improving keyword hypothesis scores ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639337
  • Liu, X., Gales, MJF. and Woodland, PC., 2013. Paraphrastic language models and combination with neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639308
  • Mamou, J., Cui, J., Cui, X., Gales, MJF., Kingsbury, B., Knill, K., Mangu, L., Nolden, D., Picheny, M., Ramabhadran, B., Schluter, R., Sethy, A. and Woodland, PC., 2013. System combination and score normalization for spoken term detection ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639278
  • Kingsbury, B., Cui, J., Cui, X., Gales, MJF., Knill, K., Mamou, J., Mangu, L., Nolden, D., Picheny, M., Ramabhadran, B., Schluter, R., Sethy, A. and Woodland, PC., 2013. A high-performance Cantonese keyword search system ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639279
  • Liu, X., Gales, MJF. and Woodland, PC., 2013. Use of contexts in language model interpolation and adaptation Computer Speech and Language, v. 27
    Doi: http://doi.org/10.1016/j.csl.2012.06.004
  • 2012

  • Flego, F. and Gales, MJF., 2012. Factor analysis based VTS discriminative adaptive training ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2012.6288960
  • Gales, MJF., Watanabe, S. and Fosler-Lussier, E., 2012. Structured discriminative models for speech recognition: An overview IEEE Signal Processing Magazine, v. 29
    Doi: http://doi.org/10.1109/MSP.2012.2207140
  • Bell, PJ., Gales, MJF., Lanchantin, P., Liu, X., Long, Y., Renals, S., Swietojanski, P. and Woodland, PC., 2012. Transcription of multi-genre media archives using out-of-domain data 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2012.6424244
  • Gales, MJF. and Flego, F., 2012. Model-Based Approaches for Degraded Channel Modelling in Robust ASR 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
  • Zen, H., Gales, MJF., Nankaku, Y. and Tokuda, K., 2012. Product of Experts for Statistical Parametric Speech Synthesis IEEE Transactions on Audio, Speech and Language Processing, v. 20
  • Liu, X., Gales, MJF. and Woodland, PC., 2012. Paraphrastic Language Models 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
  • Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2012. Morphological decomposition in Arabic ASR systems Computer Speech and Language,
  • Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2012. Morphological decomposition in Arabic ASR systems Computer Speech and Language, v. 26
    Doi: http://doi.org/10.1016/j.csl.2011.12.001
  • Zen, H., Braunschweiler, N., Buchholz, S., Gales, MJF., Knill, K., Krstulović, S. and Latorre, J., 2012. Statistical parametric speech synthesis based on speaker and language factorization IEEE Transactions on Audio, Speech and Language Processing, v. 20
    Doi: http://doi.org/10.1109/TASL.2012.2187195
  • Wang, Y. and Gales, MJF., 2012. Speaker and Noise Factorization for Robust Speech Recognition IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 20
    Doi: http://doi.org/10.1109/TASL.2012.2198059
  • 2011

  • Liu, X., Gales, MJF. and Woodland, PC., 2011. Improving LVCSR system combination using neural network language model cross adaptation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Li, T., Woodland, PC., Diehl, F. and Gales, MJF., 2011. Graphone model interpolation and Arabic pronunciation generation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2011. The efficient incorporation of MLP features into automatic speech recognition systems Computer Speech and Language, v. 25
    Doi: http://doi.org/10.1016/j.csl.2010.07.005
  • Kim, D. and Gales, MJF., 2011. Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition IEEE Transactions on Audio, Speech and Language Processing, v. 19
    Doi: http://doi.org/10.1109/TASL.2010.2047756
  • Van Dalen, RC. and Gales, MJF., 2011. Extended VTS for noise-robust speech recognition IEEE Transactions on Audio, Speech and Language Processing, v. 19
    Doi: http://doi.org/10.1109/TASL.2010.2061226
  • Flego, F. and Gales, MJF., 2011. Factor analysis based VTS and JUD noise estimation and compensation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947427
  • Wang, YQ. and Gales, MJF., 2011. Speaker and noise factorisation on the AURORA4 task ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947375
  • Latorre, J., Gales, MJF., Buchholz, S., Knill, K., Tamura, M., Ohtani, Y. and Akamine, M., 2011. CONTINUOUS F0 IN THE SOURCE-EXCITATION GENERATION FOR HMM-BASED TTS: DO WE NEED VOICEDIUNVOICED CLASSIFICATION? 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
  • Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2011. Investigation of acoustic units for LVCSR systems ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947447
  • Chin, KK., Xu, HT., Gales, MJF., Breslin, C. and Knill, K., 2011. RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
  • Chen, L., Gales, MJF. and Chin, KK., 2011. Constrained discriminative mapping transforms for unsupervised speaker adaptation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947565
  • Ragni, A. and Gales, MJF., 2011. Structured discriminative models for noise robust continuous speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947426
  • Zen, H. and Gales, MJF., 2011. Decision tree-based context clustering based on cross validation and hierarchical priors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947369
  • Gales, MJF. and Wang, YQ., 2011. Model-based approaches to handling additive noise in reverberant environments 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, HSCMA'11,
    Doi: http://doi.org/10.1109/HSCMA.2011.5942377
  • Xu, HT., Gales, MJF. and Chin, KK., 2011. Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition IEEE T AUDIO SPEECH, v. 19
    Doi: http://doi.org/10.1109/TASL.2010.2096214
  • Van Dalen, RC. and Gales, MJF., 2011. A variational perspective on noise-robust speech recognition 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2011.6163917
  • Wang, YQ. and Gales, MJF., 2011. Improving reverberant VTS for hands-free robust speech recognition 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2011.6163915
  • Ragni, A. and Gales, MJF., 2011. Derivative kernels for noise robust ASR 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2011.6163916
  • Zhang, SX. and Gales, MJF., 2011. Extending noise robust structured support vector machines to larger vocabulary tasks 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2011.6163898
  • Dieh, F., Gales, MJF., Liu, X., Tomalin, M. and Woodland, PC., 2011. Word boundary modelling and full covariance gaussians for Arabic Speech-to-Text systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Breslin, C., Chin, KK., Gales, MJF. and Knill, K., 2011. Integrated online speaker clustering and adaptation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • 2010

  • Zhang, SX., Ragni, A. and Gales, MJF., 2010. Structured Log Linear Models for Noise Robust Speech Recognition IEEE SIGNAL PROC LET, v. 17
    Doi: http://doi.org/10.1109/LSP.2010.2077626
  • Gales, MJF. and Flego, F., 2010. Discriminative classifiers with adaptive kernels for noise robust speech recognition Computer Speech and Language, v. 24
    Doi: http://doi.org/10.1016/j.csl.2009.09.002
  • Yu, K., Gales, MJF., Wang, L. and Woodland, PC., 2010. Unsupervised training and directed manual transcription for LVCSR Speech Communication, v. 52
    Doi: http://doi.org/10.1016/j.specom.2010.02.014
  • Yu, K., Gales, M., Wang, L. and Woodland, PC., 2010. Unsupervised training and directed manual transcription for LVCSR SPEECH COMMUN, v. 52
    Doi: http://doi.org/10.1016/j.specom.2010.02.014
  • Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2010. Language model combination and adaptation using weighted finite state transducers ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2010.5494941
  • 2009

  • Longworth, C. and Gales, MJF., 2009. Combining derivative and parametric kernels for speaker verification IEEE Transactions on Audio Speech and Language Processing, v. 17
    Doi: http://doi.org/10.1109/TASL.2008.2012193
  • Yu, K., Gales, MJF. and Woodland, PC., 2009. Unsupervised adaptation with discriminative mapping transforms IEEE Transactions on Audio Speech and Language Processing, v. 17
    Doi: http://doi.org/10.1109/TASL.2008.2011535
  • Breslin, C. and Gales, MJF., 2009. Directed decision trees for generating complementary systems Speech Communication, v. 51
    Doi: http://doi.org/10.1016/j.specom.2008.09.004
  • Hieronymus, JL., Liu, X., Gales, MJF. and Woodland, PC., 2009. Exploiting Chinese character models to improve speech recognition performance Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • 2008

  • Liao, H. and Gales, MJF., 2008. Issues with uncertainty decoding for noise robust automatic speech recognition Speech Communication, v. 50
    Doi: http://doi.org/10.1016/j.specom.2007.10.004
  • 2007

  • Tomalin, M., Gales, MJF., Liu, XA., Sim, KC., Sinha, R., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-english translation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.367172
  • Sim, KC. and Gales, MJF., 2007. Discriminative semi-parametric trajectory models for speech recognition Computer Speech and Language, v. 21
    Doi: http://doi.org/10.1016/j.csl.2007.03.004
  • Layton, M. and Gales, MJF., 2007. Acoustic modelling using continuous rational kernels Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, v. 48
    Doi: http://doi.org/10.1007/s11265-006-0027-4
  • Yu, K. and Gales, MJF., 2007. Bayesian adaptive inference and adaptive training IEEE Transactions on Audio Speech and Language Processing, v. 15
    Doi: http://doi.org/10.1109/TASL.2007.901300
  • Gales, MJF. and Young, SJ., 2007. The application of hidden Markov models in speech recognition Foundations and Trends in Signal Processing, v. 1
    Doi: http://doi.org/10.1561/20000000004
  • Liu, X. and Gales, MJF., 2007. Automatic model complexity control using marginalized discriminative growth functions IEEE Transactions on Audio Speech and Language Processing, v. 15
    Doi: http://doi.org/10.1109/TASL.2006.889804
  • Liu, X. and Gales, M., 2007. Automatic model complexity control using marginalized discriminative growth functions IEEE Transactions on Audio, Speech and Language Processing, v. 15
    Doi: http://doi.org/10.1109/TASL.2006.889804
  • 2006

  • Sinha, R., Gales, MJF., Kim, DY., Liu, XA., Sim, KC. and Woodland, PC., 2006. The CU-HTK Mandarin broadcast news transcription system ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
  • Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, GL., Povey, D. and Wang, L., 2006. Corrections to “Automatic transcription of conversational telephone speech” IEEE Transactions on Audio, Speech and Language Processing, v. 14
    Doi: http://doi.org/10.1109/TASL.2006.871051
  • Yu, K. and Gales, MJF., 2006. Discriminative cluster adaptive training IEEE Transactions on Audio Speech and Language Processing, v. 14
    Doi: http://doi.org/10.1109/TSA.2005.858555
  • Sim, KC. and Gales, MJF., 2006. Minimum phone error training of precision matrix models IEEE Transactions on Audio Speech and Language Processing, v. 14
    Doi: http://doi.org/10.1109/TSA.2005.858062
  • Gales, MJF. and Layton, MI., 2006. Training augmented models using SVMs IEICE Transactions on Information and Systems, v. E89-D
    Doi: http://doi.org/10.1093/ietisy/e89-d.3.892
  • Gales, MJF. and Airey, SS., 2006. Product of Gaussians for speech recognition Computer Speech and Language, v. 20
    Doi: http://doi.org/10.1016/j.csl.2004.12.002
  • Gales, MJF., Kim, DY., Woodland, PC., Chan, HY., Mrva, D., Sinha, R. and Tranter, SE., 2006. Progress in the CU-HTK broadcast news transcription system IEEE Transactions on Speech and Audio Processing, v. 14
    Doi: http://doi.org/10.1109/TASL.2006.878264
  • 2005

  • Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, GL., Povey, D. and Wang, L., 2005. Automatic transcription of conversational telephone speech IEEE Transactions on Speech and Audio Processing, v. 13
    Doi: http://doi.org/10.1109/TSA.2005.852999
  • Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, GL., Povey, D. and Wang, L., 2005. Automatic transcription of conversational telephone speech IEEE Transactions on Speech and Audio Processing, v. 13
    Doi: http://doi.org/10.1109/TSA.2005.852999
  • Sinha, R., Tranter, SE., Gales, MJF. and Woodland, PC., 2005. The Cambridge University March 2005 Speaker Diarisation System Interspeech: 9th European Conference on Speech Communciation and Technology,
  • 2004

  • Rosti, AVI. and Gales, MJF., 2004. Factor analysed hidden Markov models for speech recognition Computer Speech and Language, v. 18
    Doi: http://doi.org/10.1016/j.csl.2003.09.004
  • 2003

  • Povey, D., Gales, MJF., Kim, DY. and Woodland, PC., 2003. MMI-MAP and MPE-MAP for acoustic model adaptation Eurospeech Proceedings: 8th Speech Communication and Technology Conference, v. 8
  • 2002

  • Gales, MJF., 2002. Transformation streams and the HMM error model COMPUT SPEECH LANG, v. 16
    Doi: http://doi.org/10.1006/csla.2002.193
  • Chen, SS., Eide, EM., Gales, MJF., Gopinath, RA., Kanevsky, D. and Olsen, P., 2002. Automatic transcription of broadcast news Speech communication, v. 37
    Doi: http://doi.org/10.1016/S0167-6393(01)00060-7
  • Gales, MJF., 2002. Transformation streams and the HMM error model Computer Speech and Language, v. 16
    Doi: http://doi.org/10.1006/csla.2002.0193
  • Gales, MJF., 2002. Maximum likelihood multiple subspace projections for hidden markov models IEEE transactions on Speech and Audio Processing, v. 10
    Doi: http://doi.org/10.1109/89.985541
  • 2000

  • Gales, MJF., 2000. Factored semi-tied covariance matrices Advances In Neural Information Processing Systems,
  • Gales, MJF., 2000. Factored semi-tied covariance matrices Advances In Neural Information Processing Systems,
  • Gales, MJF., 2000. Cluster adaptive training of hidden markov models IEEE Transactions on Speech and Audio Processing, v. 8
    Doi: http://doi.org/10.1109/89.848223
  • 1999

  • Gales, MJF., 1999. Semi-tied covariance matrices for hidden markov models IEEE Transactions on Speech and Audio Processing, v. 7
    Doi: http://doi.org/10.1109/89.759034
  • Gales, MJF., Knill, K. and Young, SJ., 1999. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs IEEE Transactions on Speech and Audio Processing, v. 7
    Doi: 10.1109/89.748120
  • 1998

  • Gales, MJF., 1998. Predictive model-based compensation schemes for robust speech recognition Speech Communication, v. 25
    Doi: http://doi.org/10.1016/S0167-6393(98)00029-6
  • Gales, MJF., 1998. Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language, v. 12
    Doi: http://doi.org/10.1006/csla.1998.0043
  • 1997

  • Gales, MJF., 1997. Predictive model-based compensation schemes for robust speech recognition Speech Communication, v. 25
  • 1996

  • Gales, MJF. and Woodland, PC., 1996. Mean and variance adaptation within the MLLR framework Computer Speech and Language, v. 10
    Doi: http://doi.org/10.1006/csla.1996.0013
  • Gales, MJF. and Young, SJ., 1996. Robust continuous speech recognition using parallel model combination IEEE Proceedings on Speech and Audio Processing, v. 4
    Doi: http://doi.org/10.1109/89.536929
  • 1995

  • Gales, MJF. and Young, SJ., 1995. Robust speech recognition in additive and convolutional noise using parallel model combination Computer Speech and Language, v. 9
    Doi: http://doi.org/10.1006/csla.1995.0014
  • Woodland, PC., Gales, MJF., Pye, D. and Valtchev, V., 1995. Large vocabulary multilingual speech recognition using HTK Eurospeech Proceedings: 4th European Conference on Speech Communication and Technology, v. 1
  • 1993

  • GALES, MJF. and YOUNG, SJ., 1993. CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE SPEECH COMMUN, v. 12
  • Conference proceedings

    2021 (Accepted for publication)

  • Gales, M. and Malinin, A., 2021 (Accepted for publication). UNCERTAINTY ESTIMATION IN AUTOREGRESSIVE STRUCTURED PREDICTION
    Doi: http://doi.org/10.17863/CAM.63497
  • Gales, M., Malinin, A. and Ryabinin, M., 2021 (Accepted for publication). Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets
    Doi: http://doi.org/10.17863/CAM.78106
  • 2021

  • Manakul, P. and Gales, MJF., 2021. Long-span summarization via local attention and content selection ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Manakul, P. and Gales, MJF., 2021. Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Wei, X., Gales, MJF. and Knill, KM., 2021. Analysing bias in spoken language assessment using concept activation vectors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
    Doi: http://doi.org/10.1109/ICASSP39728.2021.9413988
  • Fathullah, Y., Gales, MJF. and Malinin, A., 2021. Ensemble distillation approaches for grammatical error correction ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
    Doi: http://doi.org/10.1109/ICASSP39728.2021.9413385
  • Lu, Y., Wang, Y. and Gales, MJF., 2021. Efficient use of end-to-end data in spoken language processing ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June
    Doi: http://doi.org/10.1109/ICASSP39728.2021.9414510
  • Dou, Q., Wu, X., Wan, M., Lu, Y. and Gales, MJF., 2021. Deliberation-based multi-pass speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 5
    Doi: http://doi.org/10.21437/Interspeech.2021-1405
  • 2020

  • Raina, V., Gales, MJF. and Knill, K., 2020. Complementary systems for Off-Topic spoken response detection Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Manakul, P., Gales, MJF. and Wang, L., 2020. Abstractive spoken document summarization using hierarchical model with multi-stage attention diversity optimization Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: http://doi.org/10.21437/Interspeech.2020-1683
  • Lu, Y., Gales, MJF. and Wang, Y., 2020. Spoken language 'grammatical error correction' Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: http://doi.org/10.21437/Interspeech.2020-1852
  • Knill, KM., Wang, L., Wang, Y., Wu, X. and Gales, MJF., 2020. Non-native children's automatic speech recognition: The INTERSPEECH 2020 shared task ALTA systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: http://doi.org/10.21437/Interspeech.2020-2154
  • Wu, X., Knill, KM., Gales, MJF. and Malinin, A., 2020. Ensemble approaches for uncertainty in spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: http://doi.org/10.21437/Interspeech.2020-2238
  • Kastanos, A., Ragni, A. and Gales, MJF., 2020. Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2020-May
    Doi: http://doi.org/10.1109/ICASSP40776.2020.9053264
  • Raina, V., Gales, MJF. and Knill, K., 2020. Universal adversarial attacks on spoken language assessment systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: 10.21437/Interspeech.2020-1890
  • Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2020. Automatic detection of accent and lexical pronunciation errors in spontaneous non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: 10.21437/Interspeech.2020-2881
  • Dou, Q., Efiong, J. and Gales, MJF., 2020. Attention forcing for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2020-October
    Doi: http://doi.org/10.21437/Interspeech.2020-2520
  • 2019 (Accepted for publication)

  • Li, Q., Ness, P., Ragni, A. and Gales, M., 2019 (Accepted for publication). BI-DIRECTIONAL LATTICE RECURRENT NEURAL NETWORKS FOR CONFIDENCE ESTIMATION
    Doi: http://doi.org/10.17863/CAM.36745
  • Lu, Y., Gales, M., Knill, K., Manakul, P. and Wang, Y., 2019 (Accepted for publication). Disfluency Detection for Spoken Learner English
    Doi: http://doi.org/10.17863/CAM.42082
  • Gales, M., Malinin, A. and Mlodozeniec, B., 2019 (Accepted for publication). Ensemble Distribution Distillation
    Doi: http://doi.org/10.17863/CAM.49348
  • Gales, M. and Malinin, A., 2019 (Accepted for publication). Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness Advances in Neural Information Processing Systems 32 (NeurIPS 2019),
  • 2019

  • Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2019. A deep learning approach to automatic characterisation of rhythm in non-native English speech Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
    Doi: http://doi.org/10.21437/Interspeech.2019-3186
  • Wong, JHM., Gales, MJF. and Wang, Y., 2019. Learning between Different Teacher and Student Models in ASR 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU46091.2019.9003756
  • Lu, Y., Gales, MJF., Knill, KM., Manakul, P., Wang, L. and Wang, Y., 2019. Impact of ASR performance on spoken grammatical error detection Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September
    Doi: http://doi.org/10.21437/Interspeech.2019-1706
  • Knill, KM., Gales, MJF., Manakul, PP. and Caines, AP., 2019. Automatic Grammatical Error Detection of Non-native Spoken Learner English ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2019-May
    Doi: 10.1109/ICASSP.2019.8683080
  • Knill, K., Gales, M., Manakul, P. and Caines, A., 2019. Automatic grammatical error detection of non-native spoken learner English ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    Doi: 10.1109/icassp.2019.8683755
  • 2018

  • Wang, Y., Chen, X., Gales, MJF., Ragni, A. and Wong, JHM., 2018. Phonetic and graphemic systems for multi-genre broadcast transcription ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2018-April
    Doi: http://doi.org/10.1109/ICASSP.2018.8462353
  • Kyriakopoulos, K., Knill, KM. and Gales, MJF., 2018. A deep learning approach to assessing non-native pronunciation of English using phone distances Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-1087
  • Knill, KM., Gales, MJF., Kyriakopoulos, K., Malinin, A., Ragni, A., Wang, Y. and Caines, AP., 2018. Impact of ASR performance on free speaking language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-1312
  • Wan, M., Degottex, G. and Gales, MJF., 2018. Waveform-based speaker representations for speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-1154
  • Degottex, G. and Gales, M., 2018. A Spectrally Weighted Mixture of Least Square Error and Wasserstein Discriminator Loss for Generative SPSS 2018 IEEE Spoken Language Technology Workshop (SLT),
    Doi: 10.1109/slt.2018.8639609
  • Wang, Y., Zhang, C., Gales, MJF. and Woodland, PC., 2018. Speaker adaptation and adaptive training for jointly optimised tandem systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-2432
  • Wang, Y., Wong, JHM., Gales, MJF., Knill, KM. and Ragni, A., 2018. Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
    Doi: 10.1109/SLT.2018.8639557
  • Ragni, A., Li, Q., Gales, MJF. and Wang, Y., 2018. Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
    Doi: 10.1109/SLT.2018.8639678
  • Malinin, A. and Gales, M., 2018. Predictive Uncertainty Estimation via Prior Networks NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, v. 31
  • Ragni, A. and Gales, MJF., 2018. Automatic speech recognition system development in the “wild“ Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-1085
  • Chen, O., Ragni, A., Gales, MJF. and Chen, X., 2018. Active memory networks for language modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September
    Doi: http://doi.org/10.21437/Interspeech.2018-78
  • Dou, Q., Wan, M., Degottex, G., Ma, Z. and Gales, MJF., 2018. Hierarchical RNNs for Waveform-Level Speech Synthesis 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
    Doi: 10.1109/SLT.2018.8639588
  • Del Vecchio, M., Malinin, A. and Gales, MJF., 2018. Improved Auto-Marking Confidence for Spoken Language Assessment 2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings,
    Doi: 10.1109/SLT.2018.8639634
  • 2017 (Accepted for publication)

  • Wong, JHM. and Gales, MJF., 2017 (Accepted for publication). Student-teacher training with diverse decision tree ensembles
  • Kyriakopoulos, K., Gales, M. and Knill, K., 2017 (Accepted for publication). Automatic characterisation of the pronunciation of non-native English speakers using phone distance features http://www.isca-speech.org/archive/SLaTE_2017/,
    Doi: http://doi.org/10.21437/SLaTE.2017-11
  • Malinin, A., Knill, K., Ragni, A., Wang, Y. and Gales, M., 2017 (Accepted for publication). An attention based model for off-topic spontaneous spoken response detection: An Initial Study http://www.isca-speech.org/archive/SLaTE_2017/,
    Doi: http://doi.org/10.21437/SLaTE.2017-25
  • 2017

  • Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6,
    Doi: http://doi.org/10.21437/Interapeech.2017-513
  • Wan, M., Degottex, G., Gales, MJF. and IEEE, , 2017. Integrated speaker-adaptive speech synthesis 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU),
    Doi: http://doi.org/10.1109/ASRU.2017.8269006
  • Wong, JHM. and Gales, MJF., 2017. Multi-task ensembles with teacher-student training 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
    Doi: http://doi.org/10.1109/ASRU.2017.8268920
  • Malinin, A., Ragni, A., Knill, KM. and Gales, MJF., 2017. Incorporating uncertainty into deep learning for spoken language assessment ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 2
    Doi: http://doi.org/10.18653/v1/P17-2008
  • Chen, X., Ragni, A., Vasilakes, J., Liu, X., Knill, K. and Gales, MJF., 2017. Recurrent neural network language models for keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2017.7953263
  • Ragni, A., Wu, C., Gales, MJF., Vasilakes, J. and Knill, KM., 2017. Stimulated training for automatic speech recognition and keyword search in limited resource conditions ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: 10.1109/ICASSP.2017.7953074
  • Ragni, A., Saunders, D., Zahemszky, P., Vasilakes, J., Gales, MJF. and Knill, KM., 2017. Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: 10.1109/ICASSP.2017.7953262
  • Gales, MJF., Knill, KM. and Ragni, A., 2017. Low-resource speech recognition and keyword-spotting Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 10458 LNAI
    Doi: 10.1007/978-3-319-66429-3_1
  • Knill, KM., Gales, MJF., Kyriakopoulos, K., Ragni, A. and Wang, Y., 2017. Use of graphemic lexicons for spoken language assessment Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
    Doi: 10.21437/Interspeech.2017-978
  • Chen, X., Ragni, A., Liu, X. and Gales, MJF., 2017. Investigating bidirectional recurrent neural network language models for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
    Doi: http://doi.org/10.21437/Interspeech.2017-513
  • Wu, C. and Gales, MJF., 2017. Deep activation mixture model for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2017-August
    Doi: http://doi.org/10.21437/Interspeech.2017-1233
  • Malinin, A., Knill, K. and Gales, MJF., 2017. A hierarchical attention based model for off-topic spontaneous spoken response detection 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings, v. 2018-January
    Doi: 10.1109/ASRU.2017.8268963
  • 2016

  • Yang, J., Ragni, A., Gales, MJF. and Knill, KM., 2016. Log-linear system combination using structured support vector machines Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-377
  • Malinin, A., Van Dalen, RC., Wang, Y., Knill, KM. and Gales, MJF., 2016. Off-topic response detection for spontaneous spoken English assessment 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 2
    Doi: http://doi.org/10.18653/v1/p16-1102
  • Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Selection of multi-genre broadcast data for the training of automatic speech recognition systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-462
  • Wu, C., Karanasou, P., Gales, MJF. and Sim, KC., 2016. Stimulated deep neural network for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-580
  • Wong, JHM. and Gales, MJF., 2016. Sequence student-teacher training of deep neural networks Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-911
  • Bell, P., Gales, MJF., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., McParland, A., Renals, S., Saz, O., Wester, M. and Woodland, PC., 2016. The MGB challenge: Evaluating multi-genre broadcast media recognition 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404863
  • Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2016. Investigation of back-off based interpolation between recurrent neural network and n-gram language models 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404792
  • Lanchantin, P., Gales, MJF., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. The development of the Cambridge university alignment systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404857
  • Woodland, PC., Liu, X., Qian, Y., Zhang, C., Gales, MJF., Karanasou, P., Lanchantin, P. and Wang, L., 2016. Cambridge university transcription systems for the multi-genre broadcast challenge 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404856
  • Cui, J., Kingsbury, B., Ramabhadran, B., Sethy, A., Audhkhasi, K., Cui, X., Kislal, E., Mangu, L., Nussbaum-Thom, M., Picheny, M., Tüske, Z., Golik, P., Schluter, R., Ney, H., Gales, MJF., Knill, KM., Ragni, A., Wang, H. and Woodland, P., 2016. Multilingual representations for low resource speech recognition and keyword search 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: 10.1109/ASRU.2015.7404803
  • Degottex, G., Lanchantin, P. and Gales, M., 2016. A Pulse Model in Log-domain for a Uniform Synthesizer Proceedings of the 9th ISCA Speech Synthesis Workshop,
  • Karanasou, P., Gales, MJF., Lanchantin, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2016. Speaker diarisation and longitudinal linking in multi-genre broadcast data 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404859
  • Van Dalen, RC., Yang, J., Wang, H., Ragni, A., Zhang, C. and Gales, MJF., 2016. Structured discriminative models using deep neural-network features 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404789
  • Wang, L., Zhang, C., Woodland, PC., Gales, MJF., Karanasou, P., Lanchantin, P., Liu, X. and Qian, Y., 2016. Improved DNN-based segmentation for multi-genre broadcast audio ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
    Doi: http://doi.org/10.1109/ICASSP.2016.7472769
  • Chen, X., Liu, X., Qian, Y., Gales, MJF. and Woodland, PC., 2016. CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
    Doi: http://doi.org/10.1109/ICASSP.2016.7472829
  • Yang, J., Zhang, C., Ragni, A., Gales, MJF. and Woodland, PC., 2016. System combination with log-linear models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
    Doi: http://doi.org/10.1109/ICASSP.2016.7472764
  • Wu, C., Karanasou, P. and Gales, MJF., 2016. Combining i-vector representation and structured neural networks for rapid adaptation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2016-May
    Doi: http://doi.org/10.1109/ICASSP.2016.7472629
  • Ragni, A., Dakin, E., Chen, X., Gales, MJF. and Knill, KM., 2016. Multi-language neural network language models Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-371
  • 2015

  • Gales, MJF., Knill, KM. and Ragni, A., 2015. Unicode-based graphemic systems for limited resource languages ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: 10.1109/ICASSP.2015.7178960
  • Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. Robust excitation-based features for Automatic Speech Recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7178855
  • van Dalen, RC., Knill, KM. and Gales, MJF., 2015. Automatically Grading Learners’ English Using a Gaussian Process Speech and Language Technology in Education, SLaTE 2015,
  • Wang, H., Ragni, A., Gales, MJF., Knill, KM., Woodland, PC. and Zhang, C., 2015. Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • Lanchantin, P., Veaux, C., Gales, MJF., King, S. and Yamagishi, J., 2015. Reconstructing voices within the multiple-average-voice-model framework Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • Chen, X., Tan, T., Liu, X., Lanchantin, P., Wan, M., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model adaptation for multi-genre broadcast speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • Liu, X., Flego, F., Wang, L., Zhang, C., Gales, M. and Woodland, P., 2015. The Cambridge university 2014 BOLT conversational telephone Mandarin Chinese lvcsr system for speech translation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. IMPROVING MULTIPLE-CROWD-SOURCED TRANSCRIPTIONS USING A SPEECH RECOGNISER 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
  • Mendels, G., Cooper, E., Soto, V., Hirschberg, J., Gales, M., Knill, K., Ragni, A. and Wang, H., 2015. Improving speech recognition and keyword search for low resource languages using web data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
  • Drugman, T., Stylianou, Y., Chen, L., Chen, X. and Gales, MJF., 2015. ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
  • Wu, C. and Gales, MJF., 2015. MULTI-BASIS ADAPTIVE NEURAL NETWORK FOR RAPID ADAPTATION IN SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP),
  • Wu, C. and Gales, MJF., 2015. Multi-basis adaptive neural network for rapid adaptation in speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7178785
  • Van Dalen, RC. and Gales, MJF., 2015. Annotating large lattices with the exact word error Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • Van Dalen, RC., Knill, KM., Tsiakoulis, P. and Gales, MJF., 2015. Improving multiple-crowd-sourced transcriptions using a speech recogniser ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: 10.1109/ICASSP.2015.7178864
  • Liu, X., Chen, X., Gales, MJF. and Woodland, PC., 2015. Paraphrastic recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7179004
  • Ragni, A., Gales, MJF. and Knill, KM., 2015. A language space representation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: 10.1109/ICASSP.2015.7178849
  • Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Improving the training and evaluation efficiency of recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7179003
  • Chen, X., Liu, X., Gales, MJF. and Woodland, PC., 2015. Recurrent neural network language model training with noise contrastive estimation for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7179005
  • 2014

  • Chen, X., Gales, MJF., Knill, K., Breslin, C., Chen, L., Chin, KK. and Wan, V., 2014. An initial investigation of long-term adaptation for meeting transcription Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Liu, X., Wang, Y., Chen, X., Gales, MJF. and Woodland, PC., 2014. Efficient lattice rescoring using recurrent neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854535
  • Liu, X., Gales, MJF. and Woodland, PC., 2014. Paraphrastic neural network language models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854534
  • Yang, J., Van Dalen, RC., Zhang, SX. and Gales, MJF., 2014. Infinite structured support vector machines for speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854215
  • Yoshioka, T., Ragni, A. and Gales, MJF., 2014. Investigation of unsupervised adaptation of DNN acoustic models with filter bank input ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854825
  • Chen, L., Braunschweiler, N. and Gales, MJF., 2014. Speaker dependent expression predictor from text: Expressiveness and transplantation ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854065
  • Yoshioka, T., Chen, X. and Gales, MJF., 2014. Impact of single-microphone dereverberation on DNN-based meeting transcription systems ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854660
  • Karanasou, P., Wang, Y., Gales, MJF. and Woodland, PC., 2014. Adaptation of deep neural network acoustic models using factorised i-vectors Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Ragni, A., Knill, KM., Rath, SP. and Gales, MJF., 2014. Data augmentation for low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Knill, KM., Gales, MJF., Ragni, A. and Rath, SP., 2014. Language independent and unsupervised acoustic models for speech recognition and keyword spotting Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Rath, SP., Knill, KM., Ragni, A. and Gales, MJF., 2014. Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Chen, X., Wang, Y., Liu, X., Gales, MJF. and Woodland, PC., 2014. Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Kolluru, BK., Wan, V., Latorre, J., Yanagisawa, K. and Gales, MJF., 2014. Generating multiple-accent pronunciations for TTS using joint sequence model interpolation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Yanagisawa, K., Chen, L. and Gales, MJF., 2014. Noise-robust TTS speaker adaptation with statistics smoothing Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Latorre, J., Yanagisawa, K., Wan, V., Kolluru, BK. and Gales, MJF., 2014. Speech intonation for TTS: Study on evaluation methodology Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • 2013

  • Van Dalen, RC., Ragni, A. and Gales, MJF., 2013. Efficient decoding with generative score-spaces using the expectation semiring ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639145
  • Zhang, SX. and Gales, MJF., 2013. Kernelized log linear models for continuous speech recognition ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639009
  • Wang, YQ. and Gales, MJF., 2013. An explicit independence constraint for factorised adaptation in speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Knill, KM., Gales, MJF., Rath, SP., Woodland, PC., Zhang, C. and Zhang, S-X., 2013. Investigation of multilingual deep neural networks for spoken term detection 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings,
    Doi: 10.1109/ASRU.2013.6707719
  • Long, Y., Gales, MJF., Lanchantin, P., Liu, X., Seigel, MS. and Woodland, PC., 2013. Improving Lightly Supervised Training for Broadcast Transcription 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
  • Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain Paraphrasing For Improving Language Modelling Using Out-of-domain Data 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
  • Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, B., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-Realistic Expressive Text to Talking Head Synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
  • Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
  • Wang, Y-Q. and Gales, MJF., 2013. An Explicit Independence Constraint for Factorised Adaptation in Speech Recognition 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5,
  • Liu, X., Gales, MJF. and Woodland, PC., 2013. Cross-domain paraphrasing for improving language modelling using out-of-domain data Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Wan, V., Anderson, R., Blokland, A., Braunschweiler, N., Chen, L., Kolluru, BK., Latorre, J., Maia, R., Stenger, B., Yanagisawa, K., Stylianou, Y., Akamine, M., Gales, MJF. and Cipolla, R., 2013. Photo-realistic expressive text to talking head synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Lanchantin, P., Bell, PJ., Gales, MJF., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, MS., Swietojanski, P. and Woodland, PC., 2013. Automatic transcription of multi-genre media archives CEUR Workshop Proceedings, v. 1012
  • Maia, R., Gales, MJF., Stylianou, Y. and Akamine, M., 2013. Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Chen, L., Gales, MJF., Braunschweiler, N., Akamine, M. and Knill, K., 2013. Integrated automatic expression prediction and speech synthesis from text ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639218
  • Latorre, J., Gales, MJF., Knill, K. and Akamine, M., 2013. Training a supra-segmental parametric F0 model without interpolating F0 ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6638995
  • Maia, R., Akamine, M. and Gales, MJF., 2013. Complex cepstrum analysis based on the minimum mean squared error ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639217
  • 2012 (No publication date)

  • Ragni, A. and Gales, MJF., 2012 (No publication date). Derivative Kernels for Noise Robust ASR
  • 2012

  • Eyben, F., Buchholz, S., Braunschweiler, N., Latorre, J., Wan, V., Gales, MJF. and Knill, K., 2012. Unsupervised clustering of emotion and voice styles for expressive TTS ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: 10.1109/ICASSP.2012.6288797
  • Maia, R., Akamine, M. and Gales, MJF., 2012. COMPLEX CEPSTRUM AS PHASE INFORMATION IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
  • Ragni, A. and Gales, MJF., 2012. INFERENCE ALGORITHMS FOR GENERATIVE SCORE-SPACES 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
  • Roupakia, Z., Ragni, A. and Gales, M., 2012. Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
  • Chen, L., Gales, MJF., Wan, V., Latorre, J. and Akamine, M., 2012. Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
  • Latorre, J., Wan, V., Gales, MJF., Chen, L., Chin, KK., Knill, K. and Akamine, M., 2012. Speech factorization for HMM-TTS based on cluster adaptive training. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
  • Wang, Y-Q. and Gales, MJF., 2012. Model-based approaches to adaptive training in reverberant environments 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
  • Wan, V., Latorre, J., Chin, KK., Chen, L., Gales, MJF., Zen, H., Knill, K. and Akamine, M., 2012. Combining multiple high quality corpora for improving HMM-TTS 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, v. 2
  • 2011

  • Braunschweiler, N., Gales, MJF. and Buchholz, S., 2011. Lightly supervised recognition for automatic alignment of large coherent speech recordings Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Breslin, C., Chin, KK., Gales, MJF., Knill, K. and Xu, H., 2011. Prior information for rapid speaker adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Gales, MJF. and Yu, K., 2011. Canonical state models for automatic speech recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Zhang, SX. and Gales, MJF., 2011. Structured support vector machines for noise robust continuous speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Latorre, J., Gales, MJF. and Zen, H., 2011. Training a parametric-based logF0 model with the minimum generation error criterion Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Liu, X., Gales, MJF. and Woodland, PC., 2011. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2011. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Diehl, F., Gales, MJF., Liu, X., Tomalin, M. and Woodland, PC., 2011. Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • van Dalen, RC. and Gales, MJF., 2011. Asymptotically exact noise-corrupted speech likelihoods Proceedings of the 11th Annual Conference of the International Speech Communication Association,
  • Liu, X., Gales, MJF. and Woodland, PC., 2011. Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Li, T., Woodland, PC., Diehl, F. and Gales, MJF., 2011. Graphone Model Interpolation and Arabic Pronunciation Generation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse Sequences for Residual Signal Modeling 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Zhang, S-X. and Gales, MJF., 2011. Structured Support Vector Machines for Noise Robust Continuous Speech Recognition 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian Process Experts for Voice Conversion 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Breslin, C., Chin, KK., Gales, MJF. and Knill, K., 2011. Integrated Online Speaker Clustering and Adaptation 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Pilkington, NCV., Zen, H. and Gales, MJF., 2011. Gaussian process experts for voice conversion Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Maia, R., Zen, H., Knill, K., Gales, MJF. and Buchholz, S., 2011. Multipulse sequences for residual signal modeling Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • 2010

  • Liu, X., Gales, MJF. and Woodland, PC., 2010. Language model cross adaptation for LVCSR system combination Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,
  • Park, J., Liu, X., Gales, MJF. and Woodland, PC., 2010. Improved neural network based language modelling and adaptation Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,
  • Maia, R., Zen, H. and Gales, MJF., 2010. Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters
  • Liu, X., Gales, MJF., Hieronymus, JL. and Woodland, PC., 2010. Language model combination and adaptation using weighted finite state transducers Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    Doi: http://doi.org/10.1109/ICASSP.2010.5494941
  • Tomalin, M., Park, J., Diehl, F., Gales, MJF. and Woodland, PC., 2010. Recent improvements to the Cambridge Arabic speech-to-text systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    Doi: 10.1109/ICASSP.2010.5495641
  • Zen, H., Gales, MJF., Nankaku, Y. and Tokuda, K., 2010. Statistical parametric synthesis based on products of experts Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    Doi: http://doi.org/10.1109/ICASSP.2010.5495691
  • Flego, F. and Gales, MJF., 2010. Discriminative adaptive training with VTS and JUD Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
    Doi: http://doi.org/10.1109/ASRU.2009.5373266
  • Gales, MJF., Ragni, A., AlDamarki, H. and Gautier, C., 2010. Support vector machines for noise robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
    Doi: http://doi.org/10.1109/ASRU.2009.5372913
  • Xu, H., Gales, MJF. and Chin, KK., 2010. Improving joint uncertainty decoding performance by predictive methods for noise robust speech recognition Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding,
    Doi: http://doi.org/10.1109/ASRU.2009.5373317
  • Maia, R., Zen, H. and Gales, MJF., 2010. Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters
  • 2009

  • Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient generation and use of MLP features for Arabic speech recognition Proceedings of the 10th International Conference of the International Speech Communication Association,
  • van Dalen, RC. and Gales, MJF., 2009. Transforming features to compensate speech recogniser models for noise Proceedings of the 10th Annual Conference of the International Speech Communication Associatio,
  • Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational Dynamic Kernels for Speaker Verification INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Kim, DK. and Gales, MJF., 2009. Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Gales, MJF., 2009. Acoustic Modelling for Speech Recognition: Hidden Markov Models and Beyond? 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009),
  • Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological Analysis and Decomposition for Arabic Speech-to-Text Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Flego, F. and Gales, MJF., 2009. Incremental Adaptation with VTS and Joint Adaptively Trained Systems INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Flego, F. and Gales, MJF., 2009. Incremental predictive and adaptive noise compensation IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960464
  • Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Training and adapting MLP features for Arabic speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
    Doi: 10.1109/ICASSP.2009.4960620
  • Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of Contexts in Language Model Interpolation and Adaptation INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Raut, CK. and Gales, MJF., 2009. Bayesian discriminative adaptation for speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960595
  • van Dalen, RC. and Gales, MJF., 2009. Extended VTS for noise-robust speech recognition Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960462
  • van Dalen, RC. and Gales, MJF., 2009. Extended VTS for noise-robust speech recognition IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960462
  • van Dalen, RC., Flego, F. and Gales, MJF., 2009. Transforming Features to Compensate Speech Recogniser Models for Noise INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Morphological analysis and decomposition for Arabic speech-to-text systems Proceedings of the 10th International Conference of the International Speech Communication Association,
  • Park, J., Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2009. Efficient Generation and Use of MLP Features for Arabic Speech Recognition INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Flego, F. and Gales, MJF., 2009. Incremental adaptation with VTS and joint adaptively trained systems Proceedings of the 10th International Conference of the International Speech Communication Association,
  • Gales, MJF. and Flego, F., 2009. Combining VTS model compensation and support vector machines Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    Doi: http://doi.org/10.1109/ICASSP.2009.4960460
  • Hieronymus, JL., Liu, X., Gales, MJF. and Woodland, PC., 2009. Exploiting Chinese character models to improve speech recognition performance Proceedings of the 10th International Conference of the International Speech Communication Association,
  • Kim, D. and Gales, MJF., 2009. Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition 10th Annual Conference of the International Speech Communication Association, Interspeech 2009,
  • Liu, X., Gales, MJF. and Woodland, PC., 2009. Use of contexts in language model interpolation and adaptation Proceedings of the 10th International Conference of the International Speech Communication Association,
  • Longworth, C., van Dalen, RC. and Gales, MJF., 2009. Variational dynamic kernels for speaker verification Proceedings of the 10th International Conference of the International Speech Communication Association,
  • 2008

  • Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive Training using Discriminative Mapping Transforms INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Diehl, F., Gales, MJF., Tomalin, M. and Woodland, PC., 2008. Phonetic pronunciations for arabic speech-to-text systems IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2008.4517924
  • Longworth, C. and Gales, MJF., 2008. Multiple kernel learning for speaker verification IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2008.4517926
  • Yu, K., Gales, MJF. and Woodland, PC., 2008. Unsupervised discriminative adaptation using discriminative mapping transforms International Conference on Acoustics, Speech and Signal Processing, 2008,
    Doi: http://doi.org/10.1109/ICASSP.2008.4518599
  • Gales, MJF. and Longworth, C., 2008. Discriminative classifiers with generative kernels for noise-robust ASR ICSLP - International Conference - CD-ROM,
  • Liu, XA., Gales, MJF. and Woodland, PC., 2008. Context dependent language model adaptation Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008) incorporating the 12th Australasian International Conference on Speech Science and Technology, SST' 08,
  • Longworth, C. and Gales, MJF., 2008. A generalised derivative kernel for speaker verification ICSLP - International Conference - CD-ROM,
  • Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms ICSLP - International Conference - CD-ROM,
  • Raut, CK., Yu, K. and Gales, MJF., 2008. Adaptive training using discriminative mapping transforms Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • van Dalen, RC. and Gales, MJF., 2008. Covariance modelling for noise-robust speech recognition ICSLP - International Conference - CD-ROM,
  • Gales, MJF. and Longworth, C., 2008. Discriminative Classifiers with Generative Kernels for Noise Robust ASR INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Longworth, C. and Gales, MJF., 2008. A Generalised Derivative Kernel for Speaker Verification INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Liu, X., Gales, MJF. and Woodland, PC., 2008. Context Dependent Language Model Adaptation INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • van Dalen, RC. and Gales, MJF., 2008. Covariance Modelling for Noise-Robust Speech Recognition INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • 2007

  • Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised Training with Directed Manual Transcription for Recognising Mandarin Broadcast Audio INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
  • Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition
  • Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition
  • Breslin, C. and Gales, MJF., 2007. Building Multiple Complementary Systems using Directed Decision Trees INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
  • Liu, XA., Byrne, WJ., Gales, MJF., de Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation IEEE Workshop on Automatic Speech Recognition & Understanding, 2007,
    Doi: http://doi.org/10.1109/ASRU.2007.4430101
  • Yu, K., Gales, MJF. and Woodland, PC., 2007. Unsupervised training using directed manual transcription for recognising Mandarin broadcast audio Proceedings InterSpeech 2007,
  • Longworth, C. and Gales, MJF., 2007. Derivative and Parametric Kernels for Speaker Verification INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
  • Breslin, C. and Gales, MJF., 2007. Building multiple complementary systems using directed decision trees Proceedings InterSpeech 2007,
  • Longworth, C. and Gales, MJF., 2007. Parametric and derivative kernels for speaker verification Proceedings InterSpeech 2007,
  • Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, JL., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.367310
  • Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 2007, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.367172
  • Sim, KC., Byrne, WJ., Gales, MJF., Sahbi, H. and Woodland, PC., 2007. Consensus network decoding for statistical machine translation system combination IEEE International Conference on Acoustics Speech and Signal Processing, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.367174
  • Gales, MJF. and van Dalen, RC., 2007. Predictive linear transforms for noise robust speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,
  • Gales, MJF., 2007. Discriminative models for speech recognition
    Doi: http://doi.org/10.1109/ITA.2007.4357576
  • Gales, MJF., Liu, X., Sinha, R., Woodland, PC., Yu, K., Matsoukas, S., Ng, T., Nguyen, K., Nguyen, L., Gauvain, J-L., Lamel, L. and Messaoudi, A., 2007. Speech recognition system combination for machine translation
  • Breslin, C. and Gales, MJF., 2007. Complementary system generation using directed decision trees Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07,
    Doi: http://doi.org/10.1109/ICASSP.2007.366918
  • Liao, H. and Gales, MJF., 2007. Adaptive training with joint uncertainty decoding for robust recognition of noisy data Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP' 07, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.366931
  • Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing 200, ICASSP' 07,
    Doi: http://doi.org/10.1109/ICASSP.2007.366922
  • Wang, L., Gales, MJF. and Woodland, PC., 2007. Unsupervised training for Mandarin broadcast news and conversation transcription
  • Tomalin, M., Gales, MJF., Liu, XA., Sinha, KC., Wang, L., Woodland, PC. and Yu, K., 2007. Improving speech transcription for Mandarin-English translation
  • Gales, MJF., 2007. Discriminative-models for speech recognition 2007 Information Theory and Applications Workshop,
  • Liu, XA., Byrne, WJ., Gales, MJF., De Gispert, A., Tomalin, M., Woodland, PC. and Yu, K., 2007. Discriminative language model adaptation for Mandarin broadcast speech transcription and translation 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007, Proceedings,
    Doi: http://doi.org/10.1109/asru.2007.4430101
  • Gales, MJF., Diehl, F., Raut, CK., Tomalin, M., Woodland, PC. and Yu, K., 2007. Development of a phonetic system for large vocabulary Arabic speech recognition 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2,
  • 2006

  • Liao, H. and Gales, MJF., 2006. Issues with Uncertainty Decoding for Noise Robust Speech Recognition INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,
  • Liao, H. and Gales, MJF., 2006. Issue with uncertainty decoding for noise robust speech recognition
  • Longworth, C. and Gales, MJF., 2006. Discriminative Adaptation for Speaker Verification INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5,
  • Breslin, C. and Gales, MJF., 2006. Generating complementary systems for speech recognition
  • Longworth, C. and Gales, MJF., 2006. Discriminative adaptation for speaker verification
  • Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,
  • Layton, MI. and Gales, MJF., 2006. Augmented statistical models for speech recognition
  • Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13,
  • Yu, K. and Gales, MJF., 2006. Incremental adaptation using Bayesian inference Proceedings of the 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2006.1659996
  • Sinha, R., Gales, MJF., Kim, DY., Liu, X., Sim, KC. and Woodland, PC., 2006. The CU-HTK Mandarin broadcast news transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'06,
  • 2005

  • Layton, M. and Gales, MJF., 2005. Augmented statistical models: exploiting generative models in discriminative classifiers
  • Yu, K. and Gales, MJF., 2005. Bayesian adaptation and adaptively trained systems Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding,
    Doi: http://doi.org/10.1109/ASRU.2005.1566532
  • Layton, M. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels Proceedings of Machine Learning for Signal Processing Workshop,
  • Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2005, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2005.1415247
  • Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Development of the CU-HTK 2004 broadcast news transcription systems IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2005.1415250
  • Evermann, G., Chan, HY., Gales, MJF., Jia, B., Mrva, D., Woodland, PC. and Yu, K., 2005. Training LVCSR systems on thousands of hours of data IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2005.1415087
  • Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '05,
    Doi: http://doi.org/10.1109/ICASSP.2005.1415250
  • Liao, H. and Gales, MJF., 2005. Joint uncertainty decoding for noise robust speech recognition Interspeech: 9th European Conference on Speech Communciation and Technology,
  • Sim, KC. and Gales, MJF., 2005. Adaptation of precision matrix models on large vocabulary continuous speech recognition Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
  • Sim, KC. and Gales, MJF., 2005. Temporally varying model parameters for large vocabulary continuous speech recognition Interspeech: European Conference on Speech Communciation and Technology,
  • Layton, MI. and Gales, MJF., 2005. Acoustic modelling using continuous rational kernels 2005 IEEE Workshop on Machine Learning for Signal Processing (MLSP),
  • Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2005. Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system
  • Liu, X., Gales, MJF., Sim, KC. and Yu, K., 2005. Investigation of acoustic modeling techniques for LVCSR systems
  • 2004

  • Sim, KC. and Gales, MJF., 2004. Basis superposition precision matrix modeling for large vocabulary continuous speech recognition Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1
  • Yu, K. and Gales, MJF., 2004. Adaptive training using structured transforms Proceedings of the 29th IEEE International Conference on Acoustics, Speech and Signal Proceedings, 2004, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2004.1325986
  • Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP '04, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2004.1325969
  • Evermann, G., Chan, HY., Gales, MJF., Jia, B., Liu, X., Mrva, D., Sim, KC., Wang, L. and Woodland, PC., 2004. Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data
  • Gales, MJF., Jia, B., Liu, X., Sim, KC., Woodland, PC. and Yu, K., 2004. Development of the CUHTK 2004 RT04F Mandarin conversational telephone speech transcription system
  • Kim, DY., Chan, HY., Evermann, G., Gales, MJF., Mrva, D., Sim, KC. and Woodland, PC., 2004. Recent developments at Cambridge in broadcast news transcription
  • Kim, DY., Gales, MJF., Hain, T. and Woodland, PC., 2004. Using VTLN for broadcast news transcription Interspeech 2004 ICSLP: 8th International Conference on Spoken Language Processing,
  • Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions Proceedings of the 29th IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP),
  • Rosti, AVI. and Gales, MJF., 2004. Rao-blackwellised gibbs sampling for switching linear dynamical systems Proceedings of the 29th IEEE International conference on Acoustics, Speech and Signal Processing (ICASSP),
  • Tranter, SE., Gales, MJF., Sinha, R., Umesh, S. and Woodland, PC., 2004. The development of the Cambridge University RT-04 diarisation system
  • Evermann, G., Chan, HY., Gales, MJF., Hain, T., Liu, X., Mrva, D., Wang, L. and Woodland, PC., 2004. Development of the 2003 CU-HTK conversational telephone speech transcription system
  • Liu, X. and Gales, MJF., 2004. Automatic model complexity control and compression using discriminative growth functions
  • 2003

  • Liu, X., Gales, MJF. and Woodland, PC., 2003. Automatic complexity control for HLDA systems IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
  • Povey, D., Woodland, PC. and Gales, MJF., 2003. Discriminative map for acoustic model adaptation IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
  • Airey, SS. and Gales, MJF., 2003. Product of Gaussians as a distributed representation for speech recognition Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech), v. 2
  • Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems Proceedings of the 28th IEEE International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
  • Airey, SS. and Gales, MJF., 2003. Product of Gaussians and multiple stream systems 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS,
  • Gales, MJF., Dong, Y., Povey, D. and Woodland, PC., 2003. Porting: SwitchBoard to the VoiceMail task IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'03, v. 1
  • Liu, X. and Gales, MJF., 2003. Automatic model complexity control using marginalized discriminative growth functions Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding,
  • 2002

  • Stuttle, MN. and Gales, MJF., 2002. Combining a Gaussian mixture model front end with MFCC parameters Proceedings of the 7th International Conference on Spoken Language Processing (Interspeech), v. 3
  • Rosti, AVI. and Gales, MJF., 2002. Factor analysed HMMs (Hidden Markov Models) Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 1
  • Smith, ND. and Gales, MJF., 2002. SVMs for speech recognition Proceedings of the 26th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. Volume 1: Speech Processing
  • Cordoba, R., Woodland, PC. and Gales, MJF., 2002. Improved cross-task recognition using MMIE training IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP'02, v. 1
    Doi: http://doi.org/10.1109/ICASSP.2002.1005682
  • Gales, MJF., 2002. The HMM error model Proceedings of the 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
  • Smith, ND. and Gales, MJF., 2002. Using SVMs and discriminative models for speech recognition 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS,
  • 2001

  • Gales, MJF., 2001. Acoustic factorisation Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),
  • Stuttle, MN. and Gales, MJF., 2001. A mixture of gaussians front end for speech recognition Proceedings of the 7th European Conference on Speech Communication and Technology, v. 1
  • Gales, MJF., 2001. Adaptive training for robust ASR Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001),
  • Smith, N. and Gales, MJF., 2001. Speech recognition using SVMs Proceedings of the 15th Conference on Neural Information Processing Systems, v. 2
  • Gales, MJF., 2001. Multiple-cluster adaptive training schemes Proceedings of 26th International Conference on Acoustics, Speech, and Signal Processing, v. Volume 1: Speech Processing
  • 2000

  • Aiyer, A., Gales, MJF. and Picheny, MA., 2000. Rapid likelihood calculation of subspace clustered Gaussian components Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), v. 3
  • Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L., Gales, MJF., Novak, M. and Gopinath, R., 2000. IBM's 10xReal-time broadcast news transciption used in the 1999 hub4 evaluation
  • Eide, E., Maison, B., Kavensky, D., Olsen, P., Chen, S., Mangu, L. and Gales, MJF., 2000. Transcription of broadcast news with time constraint: IBM's 10xRT hub4 system
  • 1999

  • Gales, MJF. and Olsen, PA., 1999. Tail distribution modelling using the richter and power exponential distributions
  • Chen, S., Eide, EM., Gales, MJF., Gopinath, RA. and Kavensky, RA., 1999. Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 1
  • Chen, S., Eide, EM., Gales, MJF., Gopinath, RA., Kavensky, D. and Olsen, PA., 1999. Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news
  • 1998

  • Gales, MJF., 1998. Cluster adaptive training for speech recognition Proceedings of 5th International Conference on Spoken Language Processing,
  • Chen, S., Gales, MJF., Gopalakrishnan, PS., Gopinath, RA., Kavensky, D., Olsen, P. and Polymenakos, L., 1998. IBM's LVCSR system for transcription of broadcast news used in the 1997 hub4 english evaluation
  • Gales, MJF., 1998. Semi-tied covariance matrices Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), v. 2
  • 1997

  • Nock, H., Gales, MJF. and Young, SJ., 1997. A comparative study of methods for phonetic decision-tree state clustering
  • Gales, MJF., 1997. Transformation smoothing for speaker and environmental adaptation
  • Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. Broadcast news transcription using HTK Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 2
    Doi: http://doi.org/10.1109/ICASSP.1997.596005
  • Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. The development of the 1996 HTK broadcast news transcription system Proceedings of DARPA Speech Recognition Workshop,
  • 1996

  • Woodland, PC., Gales, MJF., Pye, D. and Valtchev, V., 1996. The HTK large vocabulary recognition system for the 1995 ARPA H3 task Proceedings of the ARPA Continuous Speech Recognition Workshop,
  • Gales, MJF., Pye, D. and Woodland, PC., 1996. Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, v. 3
  • Knill, K., Gales, MJF. and Young, SJ., 1996. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP), v. 1
  • Woodland, PC., Gales, MJF. and Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 96, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1996.540291
  • Woodland, PC., Pye, D. and Gales, MJF., 1996. Iterative unsupervised adaptation using maximum likelihood linear regression 4th International Conference on Spoken Language Processing (ICSLP 1996), v. 2
  • 1995

  • Knill, K., Gales, MJF. and Young, SJ., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),
  • Gales, MJF. and Young, SJ., 1995. A fast and flexible implementation of parallel model combination Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech
  • Gopinath, RA., Gales, MJF., Gopalakrishnan, PS., Balakrishnan Aiyer, S. and Picheny, MA., 1995. Robust speech recognition in noise --- performance of the IBM continuous speech recogniser on the ARPA noise spoke task Proceedings of the ARPA Spoken Language Systems Technology Workshop,
  • Gales, MJF. and Young, SJ., 1995. The application of parallel model combination to a large vocabulary dictation task Proceedings of the 4th European Conference on Speech Communication and Technology (EUROSPEECH '95), v. 3
  • 1993

  • Gales, MJF. and Young, SJ., 1993. HMM recognition in noise using parallel model combination EUROSPEECH 93 proceedings, v. 2
  • Gales, MJF. and Young, SJ., 1993. Segmental hidden Markov models EUROSPEECH 93 proceedings, v. 3
  • 1992

  • GALES, MJF. and YOUNG, S., 1992. AN IMPROVED APPROACH TO THE HIDDEN MARKOV MODEL DECOMPOSITION OF SPEECH AND NOISE ICASSP-92 - 1992 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5,
  • Internet publications

    2019

  • Li, Q., Ness, PM., Ragni, A. and Gales, MJF., 2019. Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
    Doi: http://doi.org/10.1109/ICASSP.2019.8683488
  • Datasets

    2018

  • Wu, C., Gales, M., Ragni, A., Karanasou, P. and Sim, KC., 2018. Improving Interpretability and Regularisation in Deep Learning
    Doi: http://doi.org/10.17863/CAM.18408
  • 2016 (No publication date)

  • Wang, L., Zhang, C., Woodland, PC., Gales, MJF., Karanasou, P., Lanchantin, P., Liu, X. and Qian, Y., 2016 (No publication date). Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"
  • Chen, X., Liu, X., Qian, Y., Gales, MJF. and Woodland, P., 2016 (No publication date). Research data supporting "CUED-RNNLM -- An Open-Source Toolkit for Efficient Training and Evaluation of Recurrent Neural Network Language Models"
  • 2015 (No publication date)

  • Lanchantin, P., Gales, MJ., Karanasou, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2015 (No publication date). Supplementary data for "The Development of the Cambridge University Alignment Systems for the Multi-Genre Broadcast Challenge".
  • Van, DRC., Yang, J., Wang, H., Ragni, A., Zhang, C. and Gales, MJF., 2015 (No publication date). Data underpinning "Structured Discriminative Models using Deep Neural-Network Features"
  • Woodland, PC., Liu, X., Qian, Y., Zhang, C., Gales, MJ., Karanasou, P., Lanchantin, P. and Wang, L., 2015 (No publication date). Research data supporting "Cambridge university transcription systems for the multi-genre broadcast challenge"
  • Chen, X., Liu, X., Gales, MJF. and Woodland, P., 2015 (No publication date). Data underpinning "Investigation of back-off based interpolation between Recurrent Neural Network and N-Gram Language Models”
  • Karanasou, P., Gales, MJ., Lanchantin, P., Liu, X., Qian, Y., Wang, L., Woodland, PC. and Zhang, C., 2015 (No publication date). Supplementary data for "Speaker Diarisation and Linking in Multi-Genre Broadcast Data"
  • Reports

    2009

  • Kim, DK. and Gales, MJF., 2009. Noisy CMLLR for noise-robust speech recognition
  • 2008

  • Gales, MJF. and Flego, F., 2008. Discriminative classifiers and generative kernels for noise robust speech recognition
  • 2006

  • Liao, H. and Gales, MJF., 2006. Joint uncertainty decoding for robust large vocabulary speech recognition
  • 2004

  • Liao, H. and Gales, MJF., 2004. Uncertainty decoding for noise robust automatic speech recognition
  • Layton, MI. and Gales, MJF., 2004. Maximum margin training of generative kernels
  • Sim, KC. and Gales, MJF., 2004. Precision matrix modelling for large vocabulary continuous speech recognition
  • Yu, K. and Gales, MJF., 2004. Discriminative cluster adaptive training
  • 2003

  • Rosti, AV. and Gales, MJF., 2003. Switching linear dynamical systems for speech recognition
  • Airey, SS. and Gales, MJF., 2003. Product of Gaussians for speech recognition
  • Rosti, AV. and Gales, MJF., 2003. Factor analysed hidden Markov models for speech recognition
  • Hain, T., Woodland, PC., Evermann, G., Gales, MJF., Liu, X., Moore, G., Povey, D. and Wang, L., 2003. Automatic transcription of conversational telephone speech: development of the CU-HTK 2002 system
  • 2002

  • Smith, ND. and Gales, MJF., 2002. Using SVMs to classify variable length speech patterns
  • 2001

  • Rosti, AV. and Gales, MJF., 2001. Generalised linear Gaussian models
  • Gales, MJF., 2001. Transformation streams and the HMM error model
  • Smith, ND., Gales, MJF. and Niranjan, M., 2001. Data-dependent Kernels in SVM classification of speech patterns
  • 1999

  • Gales, MJF., 1999. Maximum likelihood multiple projection schemes for hidden Markov models
  • 1997

  • Gales, MJF., 1997. Adapting semi-tied full-convariance matrix HMMs
  • Gales, MJF., 1997. Maximum likelihood linear transformations for HMM-based speech recognition
  • Gales, MJF., 1997. Semi-tied full-covariance matrices for hidden Markov models
  • Gales, MJF., Knill, KM. and Young, SJ., 1997. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs
  • 1996

  • Gales, MJF., 1996. The generation and use of regression class trees for MLLR adaptation
  • Gales, MJF. and Woodland, PC., 1996. Variance compensation within the MLLR framework
  • 1994

  • Gales, MJF. and Young, SJ., 1994. Robust continuous speech recognition using parallel model combination
  • 1993

  • Gales, MJF. and Young, SJ., 1993. Parallel model combination for speech recognition in noise
  • Gales, MJF. and Young, SJ., 1993. PMC for speech recognition in additive and convolutional noise
  • Gales, MJF. and Young, SJ., 1993. The theory of segmental hidden Markov models
  • Book chapters

    2009

  • Gales, MJF., 2009. Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models
    Doi: http://doi.org/10.1002/9780470742044.ch6
  • Other publications

    2006

  • Young, SJ., Evermann, G., Gales, MJF., Kershaw, D., Moore, G., Odell, JJ., Ollason, DG., Povey, D., Valtchev, V. and Woodland, PC., 2006. The HTK book version 3.4
  • Professor of Information Engineering, Machine Intelligence Laboratory
    Departments and institutes: 
    Professor Mark  Gales

    Contact Details

    Affiliations

    Classifications: