Professor Steve Young | Cambridge Language Sciences

Research

My research interests include speech recognition, synthesis and spoken dialogue systems. I am the inventor and original author of the HTK Toolkit for building hidden Markov model-based recognition systems (see http://htk.eng.cam.ac.uk), and I co-developed the original HTK large vocabulary speech recognition system which has figured strongly in DARPA/NIST evaluations since it was first introduced in the early nineties. More recently I have worked on statistical dialogue systems and pioneered the use of Partially Observable Markov Decision Processes for modelling them. I also have industrial experience including four years working in the Apple Siri development team.

Publications

Key publications:

J. Williams and S. Young (2007). Partially Observable Markov Decision Processes for Spoken Dialog Systems. Computer Speech and Language 21(2):231-422.

S. Young, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson and K. Yu (2010). The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management. Computer Speech and Language, 24(2): 150-174.

F. Jurcicek, B. Thomson and S. Young (2012). Reinforcement learning for parameter estimation in statistical spoken dialogue systems. Computer Speech and Language, 26(3):127-228

S. Young (2010). Cognitive User Interfaces. IEEE Signal Processing Magazine,27(3): 128-140.

K. Yu and S. Young (2011). Continuous F0 Modelling for HMM based Statistical Parametric Speech Synthesis. IEEE Trans. Audio, Speech and Language Processing, to appear, 19(5):1071-1079.

Publications (from Symplectic)

Conference proceedings

2017

Vulic, I., Mrkšic, N., Reichart, R., Séaghdha, D., Young, S. and Korhonen, A., 2017. Morph-fitting: Fine-tuning word vector spaces with simple language-specific rules ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 1
Doi: http://doi.org/10.18653/v1/P17-1006

Wen, TH., Vandyke, D., Mrkšíc, N., Gašíc, M., Rojas-Barahona, LM., Su, PH., Ultes, S. and Young, S., 2017. A network-based end-to-end trainable task-oriented dialogue system 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, v. 1
Doi: http://doi.org/10.18653/v1/e17-1042

2016

Mrkšić, N., Séaghdha, D., Thomson, B., Gašić, M., Rojas-Barahona, L., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2016. Counter-fitting word vectors to linguistic constraints 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,

Wen, TH., Gašić, M., Mrkšić, N., Rojas-Barahona, LM., Su, PH., Vandyke, D. and Young, S., 2016. Multi-domain neural network language generation for spoken dialogue systems 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
Doi: http://doi.org/10.18653/v1/n16-1015

Young, S., 2016. Towards open domain spoken dialogue systems Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9577

Gasic, M., Mrksic, N., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2016. Policy committee for adaptation in multi-domain spoken dialogue systems 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404871

Vandyke, D., Su, PH., Gasic, M., Mrksic, N., Wen, TH. and Young, S., 2016. Multi-domain dialogue success classifiers for policy training 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
Doi: http://doi.org/10.1109/ASRU.2015.7404865

2015

Wen, TH., Gašić, M., Kim, D., Mrkšić, N., Su, PH., Vandyke, D. and Young, S., 2015. Stochastic language generation in dialogue using recurrent neural networks with convolutional sentence reranking SIGDIAL 2015 - 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
Doi: http://doi.org/10.18653/v1/w15-4639

Mrkšić, N., Séaghdha, DO., Thomson, B., Gašić, M., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2015. Multi-domain dialog state tracking using recurrent neural networks ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference, v. 2
Doi: http://doi.org/10.3115/v1/p15-2130

Gasic, M., Kim, D., Tsiakoulis, P. and Young, S., 2015. Distributed dialogue policies for multi-domain statistical dialogue management ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
Doi: http://doi.org/10.1109/ICASSP.2015.7178997

Wen, TH., Gašić, M., Mrkšić, N., Su, PH., Vandyke, D. and Young, S., 2015. Semantically conditioned lstm-based Natural language generation for spoken dialogue systems Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing,
Doi: http://doi.org/10.18653/v1/d15-1199

Su, PH., Vandyke, D., Gašíc, M., Kim, D., Mrkšíc, N., Wen, TH. and Young, S., 2015. Learning from real users: Rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January

Su, PH., Vandyke, D., Gašić, M., Mrkšić, N., Wen, TH. and Young, S., 2015. Reward shaping with recurrent neural networks for speeding up on-line policy learning in spoken dialogue systems SIGDIAL 2015 - 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
Doi: http://doi.org/10.18653/v1/w15-4655

2014

Tsiakoulis, P., Breslin, C., Gəsić, M., Henderson, M., Kim, D. and Young, S., 2014. Dialogue context sensitive speech synthesis using factorized decision trees Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Kim, D., Breslin, C., Tsiakoulis, P., Gašić, M., Henderson, M. and Young, S., 2014. Inverse reinforcement learning for micro-turn management Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Gašić, M., Kim, D., Tsiakoulis, P., Breslin, C., Henderson, M., Szummer, M., Thomson, B. and Young, S., 2014. Incremental on-line adaptation of POMDP-based dialogue managers to extended domains Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Kim, D., Henderson, M., Gasic, M., Tsiakoulis, P. and Young, S., 2014. The use of discriminative belief tracking in POMDP-based dialogue systems 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2014.7078600

Henderson, M., Thomson, B. and Young, S., 2014. Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2014.7078601

Henderson, M., Thomson, B. and Young, S., 2014. Word-based dialog state tracking with recurrent neural networks SIGDIAL 2014 - 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
Doi: http://doi.org/10.3115/v1/w14-4340

2013

Gašić, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. POMDP-based dialogue manager adaptation to extended domains SIGDIAL 2013 - 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,

Henderson, M., Thomson, B. and Young, S., 2013. Deep neural network approach for the dialog state tracking challenge SIGDIAL 2013 - 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,

2012

Tsiakoulis, P., Gasic, M., Henderson, M., Planells-Lerma, J., Prombonas, J., Thomson, B., Yu, K., Young, S. and Tzirkel, E., 2012. Statistical methods for building robust spoken dialogue systems in an automobile
Doi: http://doi.org/10.1201/b12320

Gašić, M., Tsiakoulis, P., Henderson, M., Thomson, B., Yu, K., Tzirkel, E. and Young, S., 2012. The effect of cognitive load on a statistical dialogue system SIGDIAL 2012 - 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,

2011

Daubigney, L., Gasic, M., Chandramohan, S., Geist, M., Pietquin, O. and Young, S., 2011. Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

Jurcicek, F., Keizer, S., Gasic, M., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2011. Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,

2010

Yu, K., Zen, H., Mairesse, F. and Young, S., 2010. Context Adaptive Training with Factorized Decision Trees for HMM-Based Speech Synthesis 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,

Yu, K., Mairesse, F. and Young, S., 2010. WORD-LEVEL EMPHASIS MODELLING IN HMM-BASED SPEECH SYNTHESIS 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,

Young, S., 2010. Still Talking to Machines (Cognitively Speaking) 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,

Jurcicek, F., Thomson, B., Keizer, S., Mairesse, F., Gasic, M., Yu, K. and Young, S., 2010. Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,

2009

Jurcicek, F., Gasic, M., Keizer, S., Mairesse, E., Thomson, B., Yu, K. and Young, S., 2009. Transformation-based Learning for Semantic parsing INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,

Mairesse, F., Gasic, M., Jurcicek, F., Keizer, S., Thomson, B., Yu, K. and Young, SJ., 2009. Spoken language understanding from unaligned data using discriminative classification models Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960692

Yu, K., Toda, T., Gasic, M., Keizer, S., Mairesse, F., Thomson, B. and Young, SJ., 2009. Probabilistic modelling of F0 in unvoiced regions in HMM-based speech synthesis Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960448

Keizer, S., Gasic, M., Mairesse, F., Thomson, B., Yu, K. and Young, SJ., 2009. Modelling user behaviour in the HIS-POMDP dialogue manager Proceedings of the IEEE Workshop on Spoken Language Technology, SLT' 08,
Doi: http://doi.org/10.1109/SLT.2008.4777855

Gasic, M., Lefevre, F., Jurcicek, F., Keizer, S., Mairesse, F., Thomson, BRM., Yu, K. and Young, SJ., 2009. Back-off action selection in summary space-based POMDP-based dialogue systems IEEE Worskhop on Automatic Speech Recognition and Understanding, ASRU 2009,
Doi: http://doi.org/10.1109/ASRU.2009.5373416

Jurcicek, F., Gasic, M., Keizer, S., Mairesse, F., Thomson, B. and Young, SJ., 2009. Transformation-based learning for semantic parsing Proceedings of the 10th Annual Conference of the International Speech Communication Associatio,

Lefevre, F., Gasic, M., Jurcicek, F., Keizer, S., Mairesse, F., Thomson, B., Yu, K. and Young, SJ., 2009. k-nearest neighbour Monte Carlo control algorithm for POMDP-based dialogue systems Proceedings of the 2009 SIGDIAL Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue,

Toda, T. and Young, SJ., 2009. Trajectory training considering global variance for HMM-based speech synthesis Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
Doi: http://doi.org/10.1109/ICASSP.2009.4960511

2008

del Pozo, A. and Young, S., 2008. The Linear Transformation of LF Glottal Waveforms for Voice Conversion INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Thomson, B., Yu, K., Gasic, M., Keizer, S., Mairesse, E., Schatzmann, J. and Young, S., 2008. Evaluating semantic-level confidence scores with multiple hypotheses INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Del Pozo, A. and Young, SJ., 2008. The linear transformation of LF glottal waveforms for voice conversion

Thomson, B., Gasic, M., Keizer, S., Mairesse, E., Schatzmann, J., Yu, K. and Young, S., 2008. User study of the Bayesian Update of Dialogue State approach to dialogue management INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

Inanoglu, Z. and Young, SJ., 2008. Emotion conversion using Fo segment selection

Thomson, B., Yu, K., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J. and Young, SJ., 2008. Evaluating semantic-level confidence scores with multiple hypotheses

Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Yu, K. and Young, SJ., 2008. Training and evaluation of the HIS POMDP Dialogue System in noise Proceedings of the 9th SIGDial Workshop on Discourse and Dialogue,

Thomson, BRM., Schatzmann, J. and Young, SJ., 2008. Bayesian update of dialogue state for robust dialogue systems IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.2008.4518765

Schatzmann, J., Thomson, B. and Young, SJ., 2008. Error simulation for training statistical dialogue systems Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU' 07,
Doi: http://doi.org/10.1109/ASRU.2007.4430167

Del Pozo, A. and Young, SJ., 2008. Repairing tracheoesophegeal speech duration

Thomson, B., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Yu, K. and Young, SJ., 2008. User study of the Bayesian Update of Dialogue State approach to dialogue management

Inanoglu, Z. and Young, S., 2008. Emotion Conversion using F0 Segment Selection INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,

2007

Young, SJ., Schatzmann, J., Weilhammer, K. and Ye, H., 2007. The hidden information state approach to dialog management

Inanoglu, Z. and Young, S., 2007. A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,

Inanoglu, Z. and Young, SJ., 2007. A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality Interspeech 2007,

Schatzmann, J., Thomson, B. and Young, SJ., 2007. Statistical user simulation with a hidden agenda

Schatzmann, J., Thomson, BRM., Weilhammer, K., Ye, H. and Young, SJ., 2007. Agenda-based user simulation for bootstrapping a POMDP dialogue system

Thomson, BRM., Schatzmann, J., Weilhammer, K., Ye, H. and Young, SJ., 2007. Training a real-world POMDP-based dialog system

Young, SJ., Schatzmann, J., Weilhammer, K. and Ye, H., 2007. The hidden information state approach to dialog management 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 4
Doi: http://doi.org/10.1109/ICASSP.2007.367185

2006

Ye, H. and Young, SJ., 2006. A clustering approach to semantic decoding

Del Pozo, A. and Young, SJ., 2006. Continuous tracheoesophageal speech repair Proceedings of the 14th European Signal Processing Conference,

Weilhammer, H., Stuttle, M. and Young, SJ., 2006. Bootstrapping language models for dialogue systems ICSLP - International Conference - CD-ROM, v. CONF 9

Williams, J. and Young, SJ., 2006. Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI) Statistical and Empirical Approaches for Spoken Dialogue Systems: Papers from the 2006 AAAI Workshop,

Ye, H. and Young, SJ., 2006. A clustering approach to semantic decoding ICSLP - International Conference - CD-ROM, v. CONF 9

Young, SJ., 2006. Using POMDPs for dialog management IEEE Spoken Language Technology Workshop 2006,
Doi: http://doi.org/10.1109/SLT.2006.326785

2005

Schatzmann, J., Georgila, K. and Young, SJ., 2005. Quantitative evaluation of user simulation techniques for spoken dialogue systems Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,

Schatzmann, K., Weilhammer, K., Stuttle, M. and Young, SJ., 2005. Effects of the user model on simulation-based learning of dialogue strategies Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),
Doi: http://doi.org/10.1109/ASRU.2005.1566539

Seneviratne, V. and Young, SJ., 2005. The hidden vector state language model Proceedings of the 9th European Conference on Speech Communication and Technology, v. 9

Williams, J., Poupart, P. and Young, SJ., 2005. Factored partially observable Markov decision processes for dialogue management Proceedings of the 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems,

Williams, J., Poupart, P. and Young, SJ., 2005. Partially observable Markov decision processes with continuous observations for dialogue management Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,

Williams, J. and Young, SJ., 2005. Scaling up POMDPs for dialog management: the summary POMDP method Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),
Doi: http://doi.org/10.1109/ASRU.2005.1566498

Inanoglu, Z. and Young, S., 2005. Intonation modelling and adaptation for emotional prosody generation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 3784 LNCS
Doi: http://doi.org/10.1007/11573548_37

Ye, H. and Young, SJ., 2005. Improving the speech recognition performance of beginners in spoken conversational interaction for language learning Proceedings of the 9th European Conference on Speech Communciation and Technology (Interspeech 2005),

Williams, JD. and Young, S., 2005. Scaling up POMDPs for dialog management: The "Summary POMDP" method 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),

2004

Ye, H. and Young, SJ., 2004. High quality voice morphing

He, Y. and Young, SJ., 2004. Robustness issues in a data-driven spoken language understanding system

Stuttle, M., Williams, J. and Young, SJ., 2004. A framework for dialogue data collection with a simulated ASR channel Proceedings of the 8th International Conference on Spoken Language Processing (ICSLP),

Williams, J. and Young, SJ., 2004. Characterizing task-oriented dialog using a simulated ASR chanel Proceedings of the 8th International Conference on Spoken Language Processing (Interspeech 2004),

Ye, H. and Young, SJ., 2004. High quality voice morphing Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
Doi: http://doi.org/10.1109/ICASSP.2004.1325909

Ye, H. and Young, SJ., 2004. High quality voice morphing IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1

Ye, H. and Young, SJ., 2004. Voice conversion for unknown speakers Proceedings of the 8th International Conference on Spoken Language Processing (ICSLP),

2003

He, Y. and Young, SJ., 2003. A data-driven spoken language understanding system Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'03),
Doi: http://doi.org/10.1109/ASRU.2003.1318505

He, Y. and Young, SJ., 2003. Hidden vector state model for hierarchical semantic parsing Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1

Williams, J. and Young, SJ., 2003. Using Wizard-of-Oz simulations to bootstrap reinforcement learning-based dialog management systems 4th SIGdial Workshop on Discourse and Dialogue,

Ye, H. and Young, SJ., 2003. Perceptually weighted linear transformations for voice conversion Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH),

2002

Scheffler, K. and Young, SJ., 2002. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning

Young, SJ., 2002. Talking to machines (statistically speaking) Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP),

2001

Nock, H. and Young, SJ., 2001. A comparison of exact and approximate algorithms for decoding and training loosely coupled HMMs Proceedings of the Institute of Acoustics Workshop on Innovation in Speech Processing (WISP 2001), v. 23

Scheffler, K. and Young, SJ., 2001. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning Proceedings of the Conference on Empirical Methods in Natural Language Processing (NAACL),

Tuerk, A. and Young, SJ., 2001. Indicator variable dependent output probability modelling via continuous posterior functions Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
Doi: http://doi.org/10.1109/ICASSP.2001.940870

Young, SJ., 2001. Statistical modelling in continuous speech recognition (CSR) Proceedings of the 17th International Conference on Uncertainty in Artificial Intelligence (UAI 2001),

2000

Moore, G. and Young, S., 2000. Class-based language model adaptation using mixtures of word-class weights Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), v. 4

Nock, H. and Young, S., 2000. Loosely coupled HMMs for ASR Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), v. 3

Scheffler, K. and Young, SJ., 2000. Probabilistic simulation of human-machine dialogues Proceedings of the International Conference on International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. Volume 2: Signal Processing Theory and Methods II

1999

Tuerk, A. and Young, S., 1999. Modelling speaking rate using a between frame distance metric Proceedings of the 6th European Conference on Speech Communication and Technology (EUROSPEECH'99),

Witt, S. and Young, S., 1999. Off-line acoustic modelling of non-native accents Proceedings of the 6th European Conference on Speech Communication and Technology (EUROSPEECH'99),

Young, SJ., 1999. Overview of spoken dialogue systems for telephony applications IEE Colloquium Digest, v. 209

1998

Young, SJ., 1998. Speech understanding and spoken dialogue systems IEE Colloquium Digest, v. Issue 499

Hain, T., Johnson, SE., Tuerk, A., Woodland, PC. and Young, SJ., 1998. Segment generation and clustering in the HTK broadcast news transcription system Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,

Nock, H. and Young, SJ., 1998. Detecting and correcting poor pronunciations for multiword units Proceedings of the Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition,

Witt, S. and Young, SJ., 1998. Bilingual model combination for non-native speech recognition Proceedings of the Institute of Acousticss Conference on Speech and Hearing,

Woodland, PC., Hain, T., Johnson, SE., Niesler, TR., Tuerk, A. and Young, SJ., 1998. Experiments in broadcast news transcription Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing v.1 (ICASSP '98), v. 2
Doi: http://doi.org/10.1109/ICASSP.1998.675413

1997

Brown, M., Foote, J., Jones, G., Sparck Jones, K. and Young, SJ., 1997. Open-vocabulary speech indexing for voice and video mail retrieval Proceedings of the 4th ACM International Conference on Multimedia,

Nock, H., Gales, MJF. and Young, SJ., 1997. A comparative study of methods for phonetic decision-tree state clustering

Shih, HH. and Young, SJ., 1997. A study on the portability of a grammatical inference system

Witt, S. and Young, SJ., 1997. Computer-assisted pronunciation teaching based on automatic speech recognition Proceedings of the Conference on Language Teaching and Language Technology,

Witt, S. and Young, SJ., 1997. Language learning based on non-native speech recognition

Witt, S. and Young, SJ., 1997. Pronunciation teaching based on automatic speech recognition Proceedings of the Conference on Language Teaching and Language Technology,

Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. Broadcast news transcription using HTK Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 2
Doi: http://doi.org/10.1109/ICASSP.1997.596005

Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. The development of the 1996 HTK broadcast news transcription system Proceedings of DARPA Speech Recognition Workshop,

Young, SJ., 1997. Acoustic modelling for large vocabulary continuous speech recognition Proceedings of the NATO Advanced Study Institute Conference on Computational Models of Speech Pattern Processing, v. 169

Young, SJ., 1997. Speech recognition evaluation: a review of the ARPA CSR programme

Young, SJ., Brown, M., Foote, J., Jones, G. and Sparck Jones, K., 1997. Acoustic indexing for multimedia retrieval and browsing IEEE International Conference on Acoustics Speech and Signal Processing, v. 1
Doi: http://doi.org/10.1109/ICASSP.1997.599600

Knill, K. and Young, S., 1997. Hidden Markov models in speech and language processing CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, v. 2

1996

Blackburn, C. and Young, SJ., 1996. A self-learning speech synthesis system

Blackburn, C. and Young, SJ., 1996. Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data Proceedings of the 4th International Conference on Spoken Language Processing, v. 2

Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Retrieving spoken documents by combining multiple index sources Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,

Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Robust talker-independent audio document retrieval Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
Doi: http://doi.org/10.1109/ICASSP.1996.541094

Knill, K., Gales, MJF. and Young, SJ., 1996. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP), v. 1

Knill, K. and Young, SJ., 1996. Fast implementation methods for Viterbi-based word-spotting Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
Doi: 10.1109/ICASSP.1996.541148

Niesler, TR., Woodland, PC. and Young, SJ., 1996. A variable-length category-based n-gram language model IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 96, v. 1
Doi: http://doi.org/10.1109/ICASSP.1996.540316

Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1996. Lattice-based discriminative training for large vocabulary speech recognition 1996 IEEE International Conference on Acoustics Speech and Signal Processing conference proceedings, v. 2
Doi: http://doi.org/10.1109/ICASSP.1996.543193

Valtchev, V., Woodland, PC. and Young, SJ., 1996. Discriminative optimisation of large vocabulary recognition systems Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP 1996), v. 1

1995

Knill, K., Gales, MJF. and Young, SJ., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),

Blackburn, C. and Young, SJ., 1995. A novel self-organising speech production system using pseudo-articulators Proceedings of the 13th International Congress of Phonetic Sciences (ICPhS 95), v. 2

Blackburn, C. and Young, SJ., 1995. Learning new articulator trajectories for a speech production model using artificial neural networks Proceedings of the IEEE International Conference on Neural Networks (ICNN-95), v. 4

Blackburn, CS. and Young, SJ., 1995. Towards improved speech recognition using a speech production model EUROSPEECH 95 proceedings, v. 3

Brown, M., Foote, J., Sparck Jones, K. and Young, SJ., 1995. Automatic content-based retrieval of broadcast news Proceedings of the 3rd ACM International Multimedia Conference and Exhibition (Multimedia-95),

Foote, J., Brown, M., Jones, G., Sparck Jones, K. and Young, S., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),

Foote, JT., Jones, GJF., Sparck Jones, K. and Young, SJ., 1995. Talker-independent keyword spotting for information retrieval EUROSPEECH 95 Proceedings, v. 3

Gales, MJF. and Young, SJ., 1995. A fast and flexible implementation of parallel model combination Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech

Gales, MJF. and Young, SJ., 1995. The application of parallel model combination to a large vocabulary dictation task Proceedings of the 4th European Conference on Speech Communication and Technology (EUROSPEECH '95), v. 3

Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1995. Video mail retrieval: the effect of word spotting accuracy on precision Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech

Sparck Jones, K., Foote, J., Jones, G. and Young, SJ., 1995. Spoken document retrieval - a multimedia tool Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval,

Woodland, PC., Leggetter, CJ., Odell, JJ., Valtchev, V. and Young, SJ., 1995. Spoken language systems technology workshop Proceedings of the ARPA Spoken Language Systems Technology Workshop,

Woodland, PC., Leggetter, CJ., Odell, JJ., Valtchev, V. and Young, SJ., 1995. The 1994 HTK large vocabulary speech recognition system ICASSP-95: International Conference on Acoustics, Speech, and Signal Processing, v. 1
Doi: http://doi.org/10.1109/ICASSP.1995.479276

1994

Young, SJ. and Shih, HH., 1994. Computer assisted grammar construction Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 862 LNAI
Doi: http://doi.org/10.1007/3-540-58473-0_156

James, DA. and Young, SJ., 1994. A fast lattice-based approach to vocabulary independent wordspotting Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 1

Nolazco Flores, J. and Young, SJ., 1994. Continuous speech recognition in noise using spectral subtraction and HMM adaptation Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 1

Odell, JJ., Valtchev, V., Woodland, PC. and Young, SJ., 1994. A one-pass decoder design for large vocabulary recognition Proceedings of the ARPA Human language Technology Workshop,

Odell, JJ., Woodland, PC. and Young, SJ., 1994. Tree-based state clustering for large vocabulary speech recognition Proceedings, ISSIPNN '94: International Symposium on Speech Image Processing and Neural Networks,

Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1994. A dynamic network decoder design for large vocabulary speech recognition ICSLP 94: International Conference on Spoken Language Processing, v. 3

Woodland, PC., Odell, JJ., Valtchev, V. and Young, SJ., 1994. Large vocabulary continuous speech recognition using HTK ICASSP-94: IEEE International Conference on Acoustics Speech and Signal Processing,
Doi: http://doi.org/10.1109/ICASSP.1994.389562

Woodland, PC., Odell, JJ., Valtchev, V. and Young, SJ., 1994. The HTK large vocabulary continuous speech recognition system: an overview Proceedings of the ARPA Human language Technology Workshop,

Young, SJ., 1994. Detecting misrecognitions and out-of-vocabulary words Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 2: Speech Processing, Audio, Underwater Acoustics, VLSI and Neural Networks

Young, SJ., Odell, JJ. and Woodland, PC., 1994. Tree-based state tying for high accuracy acoustic modelling Proceedings of the ARPA Human language Technology Workshop,

1993

Gales, MJF. and Young, SJ., 1993. HMM recognition in noise using parallel model combination EUROSPEECH 93 proceedings, v. 2

Gales, MJF. and Young, SJ., 1993. Segmental hidden Markov models EUROSPEECH 93 proceedings, v. 3

Nolazco Flores, JA. and Young, SJ., 1993. Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction

Woodland, PC. and Young, SJ., 1993. The HTK tied-state continuous speech recogniser EUROSPEECH 93 proceedings, v. 3

Young, SJ. and Woodland, PC., 1993. The use of state tying in continuous speech recognition EUROSPEECH 93 proceedings, v. 3

1992

Young, SJ., 1992. The general use of tying in phoneme-based HMM speech recognisers ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
Doi: http://doi.org/10.1109/ICASSP.1992.225844

Wang, MQ. and Young, SJ., 1992. Speech recognition using hidden Markov model decomposition and a general background speech model ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
Doi: http://doi.org/10.1109/ICASSP.1992.225924

GALES, MJF. and YOUNG, S., 1992. AN IMPROVED APPROACH TO THE HIDDEN MARKOV MODEL DECOMPOSITION OF SPEECH AND NOISE ICASSP-92 - 1992 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5,

Woodland, PC. and Young, SJ., 1992. Benchmark DARPA RM results using the HTK portable HMM toolkit

1984

Willis, AR., Bruce, IPC. and Young, SJ., 1984. An experimental database query system using automatic Proceedings of the 7th International Conference on Computer Communication: The New World of the Information Society,

Journal articles

2017

Eldar, YC., Hero, AOIII., Deng, L., Fessler, J., Kovacevic, J., Poor, HV. and Young, S., 2017. Challenges and Open Problems in Signal Processing: Panel Discussion Summary from ICASSP 2017 IEEE SIGNAL PROCESSING MAGAZINE, v. 34
Doi: http://doi.org/10.1109/MSP.2017.2743842

Gašić, M., Mrkšić, N., Rojas-Barahona, LM., Su, PH., Ultes, S., Vandyke, D., Wen, TH. and Young, S., 2017. Dialogue manager domain adaptation using Gaussian process reinforcement learning Computer Speech and Language, v. 45
Doi: http://doi.org/10.1016/j.csl.2016.09.003

2016

Su, PH., Gašić, M., Mrkšić, N., Rojas-Barahona, L., Ultes, S., Vandyke, D., Wen, TH. and Young, S., 2016. On-line active reward learning for policy optimisation in spoken dialogue systems 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 4
Doi: http://doi.org/10.18653/v1/p16-1230

2014

Gašić, M. and Young, S., 2014. Gaussian processes for POMDP-based dialogue manager optimization IEEE Transactions on Audio, Speech and Language Processing, v. 22
Doi: http://doi.org/10.1109/TASL.2013.2282190

Tsiakoulis, P., Breslin, C., Gasic, M., Henderson, M., Kim, D., Szummer, M., Thomson, B. and Young, S., 2014. Dialogue context sensitive HMM-based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2014.6854061

Mairesse, F. and Young, S., 2014. Stochastic language generation in dialogue using factored language models Computational Linguistics, v. 40
Doi: http://doi.org/10.1162/COLI_a_00199

2013

Young, S., Gašić, M., Thomson, B. and Williams, JD., 2013. POMDP-based statistical spoken dialog systems: A review Proceedings of the IEEE, v. 101
Doi: http://doi.org/10.1109/JPROC.2012.2225812

Breslin, C., Gasic, M., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. Continuous asr for flexible incremental dialogue ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639296

Gasic, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. On-line policy optimisation of Bayesian spoken dialogue systems via human interaction ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2013.6639297

2012

Williams, JD., Yu, K., Chaib-Draa, B., Lemon, O., Pieraccini, R., Pietquin, O., Poupart, P. and Young, S., 2012. Introduction to the issue on advances in spoken dialogue systems and mobile interface IEEE Journal on Selected Topics in Signal Processing, v. 6
Doi: http://doi.org/10.1109/JSTSP.2012.2234401

Thomson, B., Gasic, M., Henderson, M., Tsiakoulis, P. and Young, S., 2012. N-best error simulation for training spoken dialogue systems 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2012.6424194

Henderson, M., Gasic, M., Thomson, B., Tsiakoulis, P., Yu, K. and Young, S., 2012. Discriminative spoken language understanding using word confusion networks 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2012.6424218

Gašić, M., Henderson, M., Thomson, B., Tsiakoulis, P. and Young, S., 2012. Policy optimisation of POMDP-based dialogue systems without state space compression 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2012.6424165

Jurčíček, F., Thomson, B. and Young, S., 2012. Reinforcement learning for parameter estimation in statistical spoken dialogue systems Computer Speech and Language, v. 26
Doi: http://doi.org/10.1016/j.csl.2011.09.004

2011

Jurčíček, F., Keizer, S., Gašić, M., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2011. Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Daubigney, L., Gašić, M., Chandramohan, S., Geist, M., Pietquin, O. and Young, S., 2011. Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,

Yu, K. and Young, S., 2011. Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis IEEE T AUDIO SPEECH, v. 19
Doi: http://doi.org/10.1109/TASL.2010.2076805

Yu, K. and Young, S., 2011. Joint modelling of voicing label and continuous F0 for HMM based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
Doi: http://doi.org/10.1109/ICASSP.2011.5947372

Gašić, M. and Young, S., 2011. Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager ACM Transactions on Speech and Language Processing, v. 7
Doi: http://doi.org/10.1145/1966407.1966409

Jurčíček, F., Thomson, B. and Young, S., 2011. Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs ACM Transactions on Speech and Language Processing, v. 7
Doi: http://doi.org/10.1145/1966407.1966411

Jurčíček, F., Thomson, B. and Young, S., 2011. Reinforcement learning for parameter estimation in statistical spoken dialogue systems Computer Speech and Language,

Yu, K., Zen, H., Mairesse, F. and Young, S., 2011. Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis SPEECH COMMUN, v. 53
Doi: http://doi.org/10.1016/j.specom.2011.03.003

Black, AW., Burger, S., Conkie, A., Hastie, H., Keizer, S., Lemon, O., Merigaud, N., Parent, G., Schubiner, G., Thomson, B., Williams, JD., Yu, K., Young, S. and Eskenazi, M., 2011. Spoken Dialog Challenge 2010: Comparison of live and control test results Proceedings of the SIGDIAL 2011 Conference: 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue,

Gašić, M., Jurčiček, F., Thomson, B., Yu, K. and Young, S., 2011. On-line policy optimisation of spoken dialogue systems via live interaction with human subjects 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
Doi: http://doi.org/10.1109/ASRU.2011.6163950

2010

Thomson, B. and Young, SJ., 2010. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems Computer Speech and Language, v. 24
Doi: http://doi.org/10.1016/j.csl.2009.07.003

Young, SJ., 2010. Cognitive user interfaces IEEE Signal Processing Magazine, v. 27
Doi: http://doi.org/10.1109/MSP.2010.935874

Thomson, B., Yu, K., Keizer, S., Gašić, M., Jurčíček, F., Mairesse, F. and Young, S., 2010. Bayesian dialogue system for the let's go spoken dialogue challenge 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2010.5700896

Thomson, B., Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Yu, K. and Young, S., 2010. Parameter learning for POMDP spoken dialogue models 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings,
Doi: http://doi.org/10.1109/SLT.2010.5700863

Lefèvre, F., Mairesse, F. and Young, S., 2010. Cross-Lingual spoken language understanding from unaligned data using discriminative classification models and machine translation Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,

Keizer, S., Gašić, M., Jurčíček, F., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2010. Parameter estimation for agenda-based user simulation Proceedings of the SIGDIAL 2010 Conference: 11th Annual Meeting of the Special Interest Group onDiscourse and Dialogue,

Gasic, M., Jurčíček, F., Keizer, S., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2010. Gaussian processes for fast policy optimisation of POMDP-based dialogue managers Proceedings of the SIGDIAL 2010 Conference: 11th Annual Meeting of the Special Interest Group onDiscourse and Dialogue,

Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K. and Young, S., 2010. Phrase-based statistical language generation using graphical models and active learning ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference,

2009

Inanoglu, Z. and Young, S., 2009. Data-driven emotion conversation in spoken English Speech Communication, v. 51
Doi: http://doi.org/10.1016/j.specom.2008.09.006

Schatzmann, J. and Young, SJ., 2009. The hidden agenda user simulation model IEEE Transactions on Audio, Speech and Language Processing, v. 17
Doi: http://doi.org/10.1109/TASL.2008.2012071

Young, SJ., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B. and Yu, K., 2009. The hidden information state model: a practical framework for POMDP-based spoken dialogue management Computer Speech and Language, v. 24
Doi: http://doi.org/10.1016/j.csl.2009.04.001

2007

Williams, J. and Young, SJ., 2007. Scaling POMDPs for spoken dialog management IEEE Audio, Speech and Language Processing, v. 15
Doi: http://doi.org/10.1109/TASL.2007.902050

Williams, J. and Young, SJ., 2007. Partially observable Markov decision processes for spoken dialog systems Computer Speech and Language, v. 21
Doi: http://doi.org/10.1016/j.csl.2006.06.008

Gales, MJF. and Young, SJ., 2007. The application of hidden Markov models in speech recognition Foundations and Trends in Signal Processing, v. 1
Doi: http://doi.org/10.1561/20000000004

Schatzmann, J., Thomson, B. and Young, S., 2007. Statistical user simulation with a hidden agenda Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue,

2006

Ye, H. and Young, S., 2006. Quality-enhanced voice morphing using maximum likelihood transformations IEEE T AUDIO SPEECH, v. 14
Doi: http://doi.org/10.1109/TSA.2005.860839

Ye, H. and Young, SJ., 2006. Quality-enhanced voice morphing using maximum likelihood transformations IEEE Transactions on Audio, Speech and Language Processing, v. 14
Doi: http://doi.org/10.1109/TSA.2005.860839

Williams, JD. and Young, S., 2006. Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI) AAAI Workshop - Technical Report, v. WS-06-14

Schatzmann, J., Weilhammer, K., Stuttle, M. and Young, SJ., 2006. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies Knowledge Engineering Review, v. 21
Doi: http://doi.org/10.1017/S0269888906000944

He, Y. and Young, SJ., 2006. Spoken language understanding using the hidden vector state model Speech Communication, v. 48
Doi: http://doi.org/10.1016/j.specom.2005.06.002

2005

Williams, JD., Poupart, P. and Young, S., 2005. Partially Observable Markov Decision Processes with continuous observations for dialogue management Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,

Schatzmann, J., Stuttle, MN., Weilhammer, K. and Young, S., 2005. Effects of the user model on simulation-based learning of dialogue strategies Proceedings of ASRU 2005: 2005 IEEE Automatic Speech Recognition and Understanding Workshop, v. 2005
Doi: http://doi.org/10.1109/ASRU.2005.1566539

He, Y. and Young, SJ., 2005. Semantic processing using the hidden vector state model Computer Speech and Language, v. 19
Doi: http://doi.org/10.1016/j.csl.2004.03.001

2002

Nock, H. and Young, SJ., 2002. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models Cognitive Science, v. 26
Doi: http://doi.org/10.1016/S0364-0213(02)00071-X

2001

Blackburn, C. and Young, SJ., 2001. Enhanced speech recognition using an articulatory production model trained on X-ray data Computer Speech and Language, v. 15
Doi: http://doi.org/10.1006/csla.2001.0165

2000

Witt, SM. and Young, SJ., 2000. Phone-level pronunciation scoring and assessment for interactive language learning Speech Communication, v. 30
Doi: http://doi.org/10.1016/S0167-6393(99)00044-8

Wilks, YA., Sampson, G., Ostler, N., Cunningham, H., Young, SJ. and Hajicova, E., 2000. The role of taxonomy in language engineering - Discussion PHILOS T ROY SOC A, v. 358

Young, SJ., Carson-Berndsen, J., Kazakov, D., Alshawi, H. and Pereira, F., 2000. Finite-state models, event logics and statistics in speech recognition - Discussion PHILOS T ROY SOC A, v. 358

Young, SJ., 2000. Probabilistic methods in spoken-dialogue systems Philosophical Transactions of the Royal Society of London Series A: Mathematical, Physical and Engineering Sciences, v. 358
Doi: http://doi.org/10.1098/rsta.2000.0593

Blackburn, CS. and Young, SJ., 2000. A self-learning predictive model of articulator movements during speech production Journal of the Acoustical Society of America, v. 107
Doi: http://doi.org/10.1121/1.428450

Witt, S. and Young, SJ., 2000. Phone-level pronunciation scoring and assessment for interactive language learning Speech Communication, v. 30
Doi: http://doi.org/10.1016/S0167-6393(99)00044-8

1999

Knill, KM. and Young, SJ., 1999. Low-cost implementation of open set keyword spotting Computer Speech and Language, v. 13
Doi: 10.1006/csla.1999.0122

Gales, MJF., Knill, K. and Young, SJ., 1999. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs IEEE Transactions on Speech and Audio Processing, v. 7
Doi: 10.1109/89.748120

1998

Young, SJ. and Chase, LL., 1998. Speech recognition evaluation: a review of the U.S. CSR and LVCSR programmes Computer Speech and Language, v. 12
Doi: http://doi.org/10.1006/csla.1998.0101

1997

Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1997. MMIE training of large vocabulary recognition systems SPEECH COMMUN, v. 22

Foote, JT., Young, SJ., Jones, GJF. and Sparck Jones, K., 1997. Unconstrained keyword spotting using phone lattices with application to spoken document retrieval Computer Speech and Language, v. 11
Doi: http://doi.org/10.1006/csla.1997.0027

Young, SJ., Adda Decker, M., Aubert, X., Dugast, C., Gauvain, JL., Kershaw, DJ., Lamel, L., Van Leeuwen, D., Pye, D., Robinson, AJ., Steeneken, HJM. and Woodland, PC., 1997. Multilingual large vocabulary speech recognition: the European SQALE project Computer Speech and Language, v. 11
Doi: http://doi.org/10.1006/csla.1996.0023

1996

Young, S., 1996. A review of large-vocabulary continuous-speech recognition IEEE SIGNAL PROC MAG, v. 13

Gales, MJF. and Young, SJ., 1996. Robust continuous speech recognition using parallel model combination IEEE Proceedings on Speech and Audio Processing, v. 4
Doi: http://doi.org/10.1109/89.536929

Sparck Jones, K., Jones, GJF., Foote, JT. and Young, SJ., 1996. Experiments in spoken document retrieval Information Processing and Management, v. 32
Doi: http://doi.org/10.1016/0306-4573(95)00077-1

Young, SJ., 1996. A review of large-vocabulary continuous-speech IEEE Signal Processing Magazine, v. 13
Doi: http://doi.org/10.1109/79.536824

1995

Shih, HH., Young, SJ. and Waegner, NP., 1995. An inference approach to grammar construction Computer Speech and Language, v. 9
Doi: http://doi.org/10.1006/csla.1995.0012

Young, SJ., 1995. Large vocabulary speech recognition Acoustics Bulletin, v. 20

Shih, HH., Young, SJ. and Waegner, NP., 1995. Inference approach to grammar construction Computer Speech and Language, v. 9
Doi: http://doi.org/10.1006/csla.1995.0012

Gales, MJF. and Young, SJ., 1995. Robust speech recognition in additive and convolutional noise using parallel model combination Computer Speech and Language, v. 9
Doi: http://doi.org/10.1006/csla.1995.0014

1994

Young, SJ. and Woodland, PC., 1994. State clustering in hidden Markov model-based continuous speech recognition Computer Speech and Language, v. 8
Doi: http://doi.org/10.1006/csla.1994.1019

Young, SJ., Woodland, PC. and Byrne, WJ., 1994. Spontaneous speech recognition for the credit card corpus using the HTK toolkit IEEE Transactions on Speech and Audio Processing, v. 2
Doi: http://doi.org/10.1109/89.326619

Odell, JJ., Valtchev, V., Woodland, PC. and Young, SJ., 1994. Recent developments in the HTK continuous speech recognition system Proceedings of the Institute of Acoustics, v. 16

SAMARIA, F. and YOUNG, S., 1994. HMM-BASED ARCHITECTURE FOR FACE IDENTIFICATION IMAGE VISION COMPUT, v. 12

1993

Valtchev, V., Kapadia, S. and Young, SJ., 1993. Recurrent input transformations for Hidden Markov models Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, v. 2

Kapadia, S., Valtchev, V. and Young, SJ., 1993. MMI training for continuous phoneme recognition on the TIMIT database Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, v. 2

GALES, MJF. and YOUNG, SJ., 1993. CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE SPEECH COMMUN, v. 12

1992

Rainton, D. and Young, SJ., 1992. Time-frequency spectral estimation of speech Computer Speech and Language, v. 6
Doi: http://doi.org/10.1016/0885-2308(92)90042-3

1991

Young, SJ., Russell, NH. and Thornton, JHS., 1991. The use of syntax and multiple alternatives in the VODIS voice operated database inquiry system Computer Speech and Language, v. 5
Doi: http://doi.org/10.1016/0885-2308(91)90018-L

Lari, K. and Young, SJ., 1991. Applications of stochastic context-free grammars using the Inside-Outside algorithm Computer Speech and Language, v. 5
Doi: http://doi.org/10.1016/0885-2308(91)90009-F

YOUNG, SJ., 1991. COMPETITIVE TRAINING - A CONNECTIONIST APPROACH TO THE DISCRIMINATIVE TRAINING OF HIDDEN MARKOV-MODELS IEE PROC-I, v. 138

1990

Lari, K. and Young, SJ., 1990. The estimation of stochastic context-free grammars using the Inside-Outside algorithm Computer Speech and Language, v. 4
Doi: http://doi.org/10.1016/0885-2308(90)90022-X

Young, SJ., 1990. Competitive training in hidden Markov models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2

1989

Young, SJ. and Proctor, CE., 1989. The design and implementation of dialogue control in voice operated database inquiry systems Computer Speech and Language, v. 3
Doi: http://doi.org/10.1016/0885-2308(89)90002-8

1988

Young, SJ., Russell, NH. and Thornton, JHS., 1988. SPEECH RECOGNITION IN VODIS II. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,

1986

YOUNG, SJ. and PROCTOR, C., 1986. UFL - AN EXPERIMENTAL FRAME LANGUAGE BASED ON ABSTRACT-DATA-TYPES COMPUT J, v. 29

YOUNG, SJ., 1986. DESIGNING A CONVERSATIONAL SPEECH INTERFACE IEE PROC-E, v. 133

1980

Young, SJ., 1980. P-notation: High level description language for software design Microprocessors and Microsystems, v. 4
Doi: http://doi.org/10.1016/0141-9331(80)90325-7

Young, SJ., 1980. Low-level-device programming with a high-level language IEE Proceedings Part E: Computers and Digital Techniques, v. 127

YOUNG, SJ. and FALLSIDE, F., 1980. SYNTHESIS BY RULE OF PROSODIC FEATURES IN WORD CONCATENATION SYNTHESIS INT J MAN MACH STUD, v. 12

1979

YOUNG, SJ. and FALLSIDE, F., 1979. SPEECH SYNTHESIS FROM CONCEPT - METHOD FOR SPEECH OUTPUT FROM INFORMATION-SYSTEMS J ACOUST SOC AM, v. 66

1978

FALLSIDE, F. and YOUNG, SJ., 1978. SPEECH OUTPUT FROM A COMPUTER-CONTROLLED WATER-SUPPLY NETWORK P I ELECTR ENG, v. 125

FALLSIDE, F. and YOUNG, SJ., 1978. SPEECH OUTPUT SYSTEMS AND CAPTAIN-KIRK PROBLEM ELECTRON POWER, v. 24

Book chapters

2008

Williams, J., Poupart, P. and Young, SJ., 2008. Partially observable Markov decision processes with continuous observations for dialogue management
Doi: http://doi.org/10.1007/978-1-4020-6821-8_8

Young, SJ., 2008. HMMs and related speech recognition technologies

1997

Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1997. The video mail retrieval project

Other publications

2006

Young, SJ., Evermann, G., Gales, MJF., Kershaw, D., Moore, G., Odell, JJ., Ollason, DG., Povey, D., Valtchev, V. and Woodland, PC., 2006. The HTK book version 3.4

1995

Young, SJ., Jansen, J., Odell, JJ., Ollason, DG. and Woodland, PC., 1995. The HTK book

1993

Young, SJ., Woodland, PC. and Byrne, WJ., 1993. HTK V1.5: User, Reference and Programmer Manuals

Reports

2005

Williams, JD. and Young, SJ., 2005. The SACTI-1 corpus: guide for research users

2003

Young, SJ., 2003. The hidden vector state language model

Young, SJ., 2003. The statistical approach to the design of spoken dialogue systems

2001

Tuerk, A. and Young, SJ., 2001. A system for computer assisted grammar construction

1999

Scheffler, KH. and Young, SJ., 1999. Simulation of human-machine dialogues

1997

Gales, MJF., Knill, KM. and Young, SJ., 1997. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs

1996

Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Video mail retrieval using voice: report on collection of naturalistic requests and relevance assessments

1995

Knill, KM. and Young, SJ., 1995. Techniques for automatically transcribing unknown keywords for open keyword set HMM-based word-spotting

1994

Fransen, JFJ., Pye, D., Robinson, AJ., Woodland, PC. and Young, SJ., 1994. WSJCAM0 corpus and recording description

Gales, MJF. and Young, SJ., 1994. Robust continuous speech recognition using parallel model combination

Knill, KM. and Young, SJ., 1994. Speaker dependent keyword spotting for accessing stored speech

Shih, HH. and Young, SJ., 1994. A system for computer assisted grammar construction

1993

Gales, MJF. and Young, SJ., 1993. Parallel model combination for speech recognition in noise

Gales, MJF. and Young, SJ., 1993. PMC for speech recognition in additive and convolutional noise

Gales, MJF. and Young, SJ., 1993. The theory of segmental hidden Markov models

James, DA. and Young, SJ., 1993. On the application of information retrieval techniques and keyword spotting to video document retrieval

Nalazco Flores, JA. and Young, SJ., 1993. CSS-PMC: a combined enhancement/compensation scheme for continuous speech recognition in noise

Young, SJ., 1993. The HTK hidden Markov model toolkit: design and philosophy

Nolazco Flores, JA. and Young, SJ., 1993. Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction

1992

Beattie, VL. and Young, SJ., 1992. Hidden Markov model state-based noise cancellation

Wong, G. and Young, SJ., 1992. Vector quantization bigram hidden Markov modelling for improved phoneme recognition

1990

Beattie, VL. and Young, SJ., 1990. Hidden Markov model performance in noise

Young, SJ., 1990. Competitive training: a connectionist approach to the discriminative training of hidden Markov models

1989

Young, SJ., Russell, NH. and Thornton, JHS., 1989. Token passing: a simple conceptual model for connected speech recognition systems

Books

1997

1997. Corpus-based methods in language and speech processing

1982

Young, SJ., 1982. Real time languages: design and development

Theses / dissertations

1978

Young, SJ., 1978. Speech synthesis from concept with applications to speech output from systems