skip to content

Cambridge Language Sciences

Interdisciplinary Research Centre
 

Research

My research interests include speech recognition, synthesis and spoken dialogue systems. I am the inventor and original author of the HTK Toolkit for building hidden Markov model-based recognition systems (see http://htk.eng.cam.ac.uk), and I co-developed the original HTK large vocabulary speech recognition system which has figured strongly in DARPA/NIST evaluations since it was first introduced in the early nineties. More recently I have worked on statistical dialogue systems and pioneered the use of Partially Observable Markov Decision Processes for modelling them.  I also have industrial experience including four years working in the Apple Siri development team.

Publications

Key publications: 

J. Williams and S. Young (2007). Partially Observable Markov Decision Processes for Spoken Dialog Systems. Computer Speech and Language 21(2):231-422.

S. Young, M. Gasic, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson and K. Yu (2010). The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management. Computer Speech and Language, 24(2): 150-174.

F. Jurcicek, B. Thomson and S. Young (2012). Reinforcement learning for parameter estimation in statistical spoken dialogue systems. Computer Speech and Language, 26(3):127-228

S. Young (2010). Cognitive User Interfaces. IEEE Signal Processing Magazine,27(3): 128-140.

K. Yu and S. Young (2011). Continuous F0 Modelling for HMM based Statistical Parametric Speech Synthesis. IEEE Trans. Audio, Speech and Language Processing, to appear, 19(5):1071-1079.

Publications (from Symplectic)

Journal articles

2017

  • Gašić, M., Mrkšić, N., Rojas-Barahona, LM., Su, PH., Ultes, S., Vandyke, D., Wen, TH. and Young, S., 2017. Dialogue manager domain adaptation using Gaussian process reinforcement learning Computer Speech and Language, v. 45
    Doi: http://doi.org/10.1016/j.csl.2016.09.003
  • Eldar, YC., Hero, AOIII., Deng, L., Fessler, J., Kovacevic, J., Poor, HV. and Young, S., 2017. Challenges and Open Problems in Signal Processing: Panel Discussion Summary from ICASSP 2017 IEEE SIGNAL PROCESSING MAGAZINE, v. 34
    Doi: http://doi.org/10.1109/MSP.2017.2743842
  • 2016

  • Su, PH., Gašić, M., Mrkšić, N., Rojas-Barahona, L., Ultes, S., Vandyke, D., Wen, TH. and Young, S., 2016. On-line active reward learning for policy optimisation in spoken dialogue systems 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 4
    Doi: http://doi.org/10.18653/v1/p16-1230
  • Wen, TH., Gašić, M., Mrkšić, N., Rojas-Barahona, LM., Su, PH., Vandyke, D. and Young, S., 2016. Multi-domain neural network language generation for spoken dialogue systems 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
  • 2014

  • Gašić, M. and Young, S., 2014. Gaussian processes for POMDP-based dialogue manager optimization IEEE Transactions on Audio, Speech and Language Processing, v. 22
    Doi: http://doi.org/10.1109/TASL.2013.2282190
  • Tsiakoulis, P., Breslin, C., Gasic, M., Henderson, M., Kim, D., Szummer, M., Thomson, B. and Young, S., 2014. Dialogue context sensitive HMM-based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2014.6854061
  • Mairesse, F. and Young, S., 2014. Stochastic language generation in dialogue using factored language models Computational Linguistics, v. 40
    Doi: http://doi.org/10.1162/COLI_a_00199
  • 2013

  • Young, S., Gašić, M., Thomson, B. and Williams, JD., 2013. POMDP-based statistical spoken dialog systems: A review Proceedings of the IEEE, v. 101
    Doi: http://doi.org/10.1109/JPROC.2012.2225812
  • Breslin, C., Gasic, M., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. Continuous asr for flexible incremental dialogue ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639296
  • Gasic, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. On-line policy optimisation of Bayesian spoken dialogue systems via human interaction ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2013.6639297
  • 2012

  • Williams, JD., Yu, K., Chaib-Draa, B., Lemon, O., Pieraccini, R., Pietquin, O., Poupart, P. and Young, S., 2012. Introduction to the issue on advances in spoken dialogue systems and mobile interface IEEE Journal on Selected Topics in Signal Processing, v. 6
    Doi: http://doi.org/10.1109/JSTSP.2012.2234401
  • Thomson, B., Gasic, M., Henderson, M., Tsiakoulis, P. and Young, S., 2012. N-best error simulation for training spoken dialogue systems 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2012.6424194
  • Henderson, M., Gasic, M., Thomson, B., Tsiakoulis, P., Yu, K. and Young, S., 2012. Discriminative spoken language understanding using word confusion networks 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2012.6424218
  • Gašić, M., Henderson, M., Thomson, B., Tsiakoulis, P. and Young, S., 2012. Policy optimisation of POMDP-based dialogue systems without state space compression 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2012.6424165
  • Jurčíček, F., Thomson, B. and Young, S., 2012. Reinforcement learning for parameter estimation in statistical spoken dialogue systems Computer Speech and Language, v. 26
    Doi: http://doi.org/10.1016/j.csl.2011.09.004
  • 2011

  • Yu, K., Zen, H., Mairesse, F. and Young, S., 2011. Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis SPEECH COMMUN, v. 53
    Doi: http://doi.org/10.1016/j.specom.2011.03.003
  • Black, AW., Burger, S., Conkie, A., Hastie, H., Keizer, S., Lemon, O., Merigaud, N., Parent, G., Schubiner, G., Thomson, B., Williams, JD., Yu, K., Young, S. and Eskenazi, M., 2011. Spoken Dialog Challenge 2010: Comparison of live and control test results Proceedings of the SIGDIAL 2011 Conference: 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue,
  • Gašić, M., Jurčiček, F., Thomson, B., Yu, K. and Young, S., 2011. On-line policy optimisation of spoken dialogue systems via live interaction with human subjects 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2011.6163950
  • Black, AW., Burger, S., Conkie, A., Hastie, H., Keizer, S., Lemon, O., Merigaud, N., Parent, G., Schubiner, G., Thomson, B., Williams, JD., Yu, K., Young, S. and Eskenazi, M., 2011. Spoken Dialog Challenge 2010: Comparison of live and control test results Proceedings of the SIGDIAL 2011 Conference: 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue,
  • Jurčíček, F., Keizer, S., Gašić, M., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2011. Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Daubigney, L., Gašić, M., Chandramohan, S., Geist, M., Pietquin, O. and Young, S., 2011. Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Yu, K. and Young, S., 2011. Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis IEEE T AUDIO SPEECH, v. 19
    Doi: http://doi.org/10.1109/TASL.2010.2076805
  • Yu, K. and Young, S., 2011. Joint modelling of voicing label and continuous F0 for HMM based speech synthesis ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
    Doi: http://doi.org/10.1109/ICASSP.2011.5947372
  • Gašić, M. and Young, S., 2011. Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager ACM Transactions on Speech and Language Processing, v. 7
    Doi: http://doi.org/10.1145/1966407.1966409
  • Jurčíček, F., Thomson, B. and Young, S., 2011. Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs ACM Transactions on Speech and Language Processing, v. 7
    Doi: http://doi.org/10.1145/1966407.1966411
  • Jurčíček, F., Thomson, B. and Young, S., 2011. Reinforcement learning for parameter estimation in statistical spoken dialogue systems Computer Speech and Language,
  • 2010

  • Keizer, S., Gašić, M., Jurčíček, F., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2010. Parameter estimation for agenda-based user simulation Proceedings of the SIGDIAL 2010 Conference: 11th Annual Meeting of the Special Interest Group onDiscourse and Dialogue,
  • Gasic, M., Jurčíček, F., Keizer, S., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2010. Gaussian processes for fast policy optimisation of POMDP-based dialogue managers Proceedings of the SIGDIAL 2010 Conference: 11th Annual Meeting of the Special Interest Group onDiscourse and Dialogue,
  • Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K. and Young, S., 2010. Phrase-based statistical language generation using graphical models and active learning ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference,
  • Thomson, B. and Young, SJ., 2010. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems Computer Speech and Language, v. 24
    Doi: http://doi.org/10.1016/j.csl.2009.07.003
  • Young, SJ., 2010. Cognitive user interfaces IEEE Signal Processing Magazine, v. 27
    Doi: http://doi.org/10.1109/MSP.2010.935874
  • Thomson, B., Yu, K., Keizer, S., Gašić, M., Jurčíček, F., Mairesse, F. and Young, S., 2010. Bayesian dialogue system for the let's go spoken dialogue challenge 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2010.5700896
  • Thomson, B., Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Yu, K. and Young, S., 2010. Parameter learning for POMDP spoken dialogue models 2010 IEEE Workshop on Spoken Language Technology, SLT 2010 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2010.5700863
  • Lefèvre, F., Mairesse, F. and Young, S., 2010. Cross-Lingual spoken language understanding from unaligned data using discriminative classification models and machine translation Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010,
  • 2009

  • Inanoglu, Z. and Young, S., 2009. Data-driven emotion conversation in spoken English Speech Communication, v. 51
    Doi: http://doi.org/10.1016/j.specom.2008.09.006
  • Schatzmann, J. and Young, SJ., 2009. The hidden agenda user simulation model IEEE Transactions on Audio, Speech and Language Processing, v. 17
    Doi: http://doi.org/10.1109/TASL.2008.2012071
  • Young, SJ., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B. and Yu, K., 2009. The hidden information state model: a practical framework for POMDP-based spoken dialogue management Computer Speech and Language, v. 24
    Doi: http://doi.org/10.1016/j.csl.2009.04.001
  • 2007

  • Schatzmann, J., Thomson, B. and Young, S., 2007. Statistical user simulation with a hidden agenda Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue,
  • Williams, J. and Young, SJ., 2007. Scaling POMDPs for spoken dialog management IEEE Audio, Speech and Language Processing, v. 15
    Doi: http://doi.org/10.1109/TASL.2007.902050
  • Williams, J. and Young, SJ., 2007. Partially observable Markov decision processes for spoken dialog systems Computer Speech and Language, v. 21
    Doi: http://doi.org/10.1016/j.csl.2006.06.008
  • Gales, MJF. and Young, SJ., 2007. The application of hidden Markov models in speech recognition Foundations and Trends in Signal Processing, v. 1
    Doi: http://doi.org/10.1561/20000000004
  • 2006

  • Ye, H. and Young, S., 2006. Quality-enhanced voice morphing using maximum likelihood transformations IEEE T AUDIO SPEECH, v. 14
    Doi: http://doi.org/10.1109/TSA.2005.860839
  • Ye, H. and Young, SJ., 2006. Quality-enhanced voice morphing using maximum likelihood transformations IEEE Transactions on Audio, Speech and Language Processing, v. 14
    Doi: http://doi.org/10.1109/TSA.2005.860839
  • Williams, JD. and Young, S., 2006. Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI) AAAI Workshop - Technical Report, v. WS-06-14
  • Schatzmann, J., Weilhammer, K., Stuttle, M. and Young, SJ., 2006. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies Knowledge Engineering Review, v. 21
    Doi: http://doi.org/10.1017/S0269888906000944
  • He, Y. and Young, SJ., 2006. Spoken language understanding using the hidden vector state model Speech Communication, v. 48
    Doi: http://doi.org/10.1016/j.specom.2005.06.002
  • 2005

  • Williams, JD., Poupart, P. and Young, S., 2005. Partially Observable Markov Decision Processes with continuous observations for dialogue management Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,
  • Schatzmann, J., Stuttle, MN., Weilhammer, K. and Young, S., 2005. Effects of the user model on simulation-based learning of dialogue strategies Proceedings of ASRU 2005: 2005 IEEE Automatic Speech Recognition and Understanding Workshop, v. 2005
    Doi: http://doi.org/10.1109/ASRU.2005.1566539
  • He, Y. and Young, SJ., 2005. Semantic processing using the hidden vector state model Computer Speech and Language, v. 19
    Doi: http://doi.org/10.1016/j.csl.2004.03.001
  • 2002

  • Nock, H. and Young, SJ., 2002. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models Cognitive Science, v. 26
    Doi: http://doi.org/10.1016/S0364-0213(02)00071-X
  • 2001

  • Blackburn, C. and Young, SJ., 2001. Enhanced speech recognition using an articulatory production model trained on X-ray data Computer Speech and Language, v. 15
    Doi: http://doi.org/10.1006/csla.2001.0165
  • 2000

  • Witt, SM. and Young, SJ., 2000. Phone-level pronunciation scoring and assessment for interactive language learning Speech Communication, v. 30
    Doi: http://doi.org/10.1016/S0167-6393(99)00044-8
  • Wilks, YA., Sampson, G., Ostler, N., Cunningham, H., Young, SJ. and Hajicova, E., 2000. The role of taxonomy in language engineering - Discussion PHILOS T ROY SOC A, v. 358
  • Young, SJ., Carson-Berndsen, J., Kazakov, D., Alshawi, H. and Pereira, F., 2000. Finite-state models, event logics and statistics in speech recognition - Discussion PHILOS T ROY SOC A, v. 358
  • Young, SJ., 2000. Probabilistic methods in spoken-dialogue systems Philosophical Transactions of the Royal Society of London Series A: Mathematical, Physical and Engineering Sciences, v. 358
    Doi: http://doi.org/10.1098/rsta.2000.0593
  • Blackburn, CS. and Young, SJ., 2000. A self-learning predictive model of articulator movements during speech production Journal of the Acoustical Society of America, v. 107
    Doi: http://doi.org/10.1121/1.428450
  • Witt, S. and Young, SJ., 2000. Phone-level pronunciation scoring and assessment for interactive language learning Speech Communication, v. 30
    Doi: http://doi.org/10.1016/S0167-6393(99)00044-8
  • 1999

  • Gales, MJF., Knill, KM. and Young, SJ., 1999. State-based gaussian selection in large vocabulary continuous speech recognition using HMM's IEEE Transactions on Speech and Audio Processing, v. 7
    Doi: 10.1109/89.748120
  • Knill, KM. and Young, SJ., 1999. Low-cost implementation of open set keyword spotting Computer Speech and Language, v. 13
    Doi: 10.1006/csla.1999.0122
  • 1998

  • Young, SJ. and Chase, LL., 1998. Speech recognition evaluation: a review of the U.S. CSR and LVCSR programmes Computer Speech and Language, v. 12
    Doi: http://doi.org/10.1006/csla.1998.0101
  • 1997

  • Foote, JT., Young, SJ., Jones, GJF. and Sparck Jones, K., 1997. Unconstrained keyword spotting using phone lattices with application to spoken document retrieval Computer Speech and Language, v. 11
    Doi: http://doi.org/10.1006/csla.1997.0027
  • Young, SJ., Adda Decker, M., Aubert, X., Dugast, C., Gauvain, JL., Kershaw, DJ., Lamel, L., Van Leeuwen, D., Pye, D., Robinson, AJ., Steeneken, HJM. and Woodland, PC., 1997. Multilingual large vocabulary speech recognition: the European SQALE project Computer Speech and Language, v. 11
    Doi: http://doi.org/10.1006/csla.1996.0023
  • Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1997. MMIE training of large vocabulary recognition systems SPEECH COMMUN, v. 22
  • 1996

  • Gales, MJF. and Young, SJ., 1996. Robust continuous speech recognition using parallel model combination IEEE Proceedings on Speech and Audio Processing, v. 4
    Doi: http://doi.org/10.1109/89.536929
  • Sparck Jones, K., Jones, GJF., Foote, JT. and Young, SJ., 1996. Experiments in spoken document retrieval Information Processing and Management, v. 32
    Doi: http://doi.org/10.1016/0306-4573(95)00077-1
  • Young, SJ., 1996. A review of large-vocabulary continuous-speech IEEE Signal Processing Magazine, v. 13
    Doi: http://doi.org/10.1109/79.536824
  • Young, S., 1996. A review of large-vocabulary continuous-speech recognition IEEE SIGNAL PROC MAG, v. 13
  • 1995

  • Gales, MJF. and Young, SJ., 1995. Robust speech recognition in additive and convolutional noise using parallel model combination Computer Speech and Language, v. 9
    Doi: http://doi.org/10.1006/csla.1995.0014
  • Shih, HH., Young, SJ. and Waegner, NP., 1995. An inference approach to grammar construction Computer Speech and Language, v. 9
    Doi: http://doi.org/10.1006/csla.1995.0012
  • Young, SJ., 1995. Large vocabulary speech recognition Acoustics Bulletin, v. 20
  • Shih, HH., Young, SJ. and Waegner, NP., 1995. Inference approach to grammar construction Computer Speech and Language, v. 9
    Doi: http://doi.org/10.1006/csla.1995.0012
  • 1994

  • SAMARIA, F. and YOUNG, S., 1994. HMM-BASED ARCHITECTURE FOR FACE IDENTIFICATION IMAGE VISION COMPUT, v. 12
  • Young, SJ. and Woodland, PC., 1994. State clustering in hidden Markov model-based continuous speech recognition Computer Speech and Language, v. 8
    Doi: http://doi.org/10.1006/csla.1994.1019
  • Young, SJ., Woodland, PC. and Byrne, WJ., 1994. Spontaneous speech recognition for the credit card corpus using the HTK toolkit IEEE Transactions on Speech and Audio Processing, v. 2
    Doi: http://doi.org/10.1109/89.326619
  • Odell, JJ., Valtchev, V., Woodland, PC. and Young, SJ., 1994. Recent developments in the HTK continuous speech recognition system Proceedings of the Institute of Acoustics, v. 16
  • 1993

  • GALES, MJF. and YOUNG, SJ., 1993. CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE SPEECH COMMUN, v. 12
  • Valtchev, V., Kapadia, S. and Young, SJ., 1993. Recurrent input transformations for Hidden Markov models Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, v. 2
  • Kapadia, S., Valtchev, V. and Young, SJ., 1993. MMI training for continuous phoneme recognition on the TIMIT database Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, v. 2
  • 1992

  • Rainton, D. and Young, SJ., 1992. Time-frequency spectral estimation of speech Computer Speech and Language, v. 6
    Doi: http://doi.org/10.1016/0885-2308(92)90042-3
  • 1991

  • YOUNG, SJ., 1991. COMPETITIVE TRAINING - A CONNECTIONIST APPROACH TO THE DISCRIMINATIVE TRAINING OF HIDDEN MARKOV-MODELS IEE PROC-I, v. 138
  • Young, SJ., Russell, NH. and Thornton, JHS., 1991. The use of syntax and multiple alternatives in the VODIS voice operated database inquiry system Computer Speech and Language, v. 5
    Doi: http://doi.org/10.1016/0885-2308(91)90018-L
  • Lari, K. and Young, SJ., 1991. Applications of stochastic context-free grammars using the Inside-Outside algorithm Computer Speech and Language, v. 5
    Doi: http://doi.org/10.1016/0885-2308(91)90009-F
  • 1990

  • Lari, K. and Young, SJ., 1990. The estimation of stochastic context-free grammars using the Inside-Outside algorithm Computer Speech and Language, v. 4
    Doi: http://doi.org/10.1016/0885-2308(90)90022-X
  • Young, SJ., 1990. Competitive training in hidden Markov models ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2
  • 1989

  • Young, SJ. and Proctor, CE., 1989. The design and implementation of dialogue control in voice operated database inquiry systems Computer Speech and Language, v. 3
    Doi: http://doi.org/10.1016/0885-2308(89)90002-8
  • 1988

  • Young, SJ., Russell, NH. and Thornton, JHS., 1988. SPEECH RECOGNITION IN VODIS II. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings,
  • 1986

  • YOUNG, SJ., 1986. DESIGNING A CONVERSATIONAL SPEECH INTERFACE IEE PROC-E, v. 133
  • YOUNG, SJ. and PROCTOR, C., 1986. UFL - AN EXPERIMENTAL FRAME LANGUAGE BASED ON ABSTRACT-DATA-TYPES COMPUT J, v. 29
  • 1980

  • Young, SJ., 1980. P-notation: High level description language for software design Microprocessors and Microsystems, v. 4
    Doi: http://doi.org/10.1016/0141-9331(80)90325-7
  • Young, SJ., 1980. Low-level-device programming with a high-level language IEE Proceedings Part E: Computers and Digital Techniques, v. 127
  • YOUNG, SJ. and FALLSIDE, F., 1980. SYNTHESIS BY RULE OF PROSODIC FEATURES IN WORD CONCATENATION SYNTHESIS INT J MAN MACH STUD, v. 12
  • 1979

  • YOUNG, SJ. and FALLSIDE, F., 1979. SPEECH SYNTHESIS FROM CONCEPT - METHOD FOR SPEECH OUTPUT FROM INFORMATION-SYSTEMS J ACOUST SOC AM, v. 66
  • 1978

  • FALLSIDE, F. and YOUNG, SJ., 1978. SPEECH OUTPUT FROM A COMPUTER-CONTROLLED WATER-SUPPLY NETWORK P I ELECTR ENG, v. 125
  • FALLSIDE, F. and YOUNG, SJ., 1978. SPEECH OUTPUT SYSTEMS AND CAPTAIN-KIRK PROBLEM ELECTRON POWER, v. 24
  • Conference proceedings

    2017

  • Vulic, I., Mrkšic, N., Reichart, R., Séaghdha, D., Young, S. and Korhonen, A., 2017. Morph-fitting: Fine-tuning word vector spaces with simple language-specific rules ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 1
    Doi: http://doi.org/10.18653/v1/P17-1006
  • Wen, TH., Vandyke, D., Mrkšíc, N., Gašíc, M., Rojas-Barahona, LM., Su, PH., Ultes, S. and Young, S., 2017. A network-based end-to-end trainable task-oriented dialogue system 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, v. 1
    Doi: http://doi.org/10.18653/v1/e17-1042
  • 2016

  • Gasic, M., Mrksic, N., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2016. Policy committee for adaptation in multi-domain spoken dialogue systems 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404871
  • Vandyke, D., Su, PH., Gasic, M., Mrksic, N., Wen, TH. and Young, S., 2016. Multi-domain dialogue success classifiers for policy training 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings,
    Doi: http://doi.org/10.1109/ASRU.2015.7404865
  • Mrkšić, N., Séaghdha, D., Thomson, B., Gašić, M., Rojas-Barahona, L., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2016. Counter-fitting word vectors to linguistic constraints 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
  • Wen, TH., Gašić, M., Mrkšić, N., Rojas-Barahona, LM., Su, PH., Vandyke, D. and Young, S., 2016. Multi-domain neural network language generation for spoken dialogue systems 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/n16-1015
  • Mrkšić, N., Séaghdha, D., Thomson, B., Gašić, M., Rojas-Barahona, L., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2016. Counter-fitting word vectors to linguistic constraints 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/n16-1018
  • Young, S., 2016. Towards open domain spoken dialogue systems Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9577
  • 2015

  • Su, PH., Vandyke, D., Gašić, M., Mrkšić, N., Wen, TH. and Young, S., 2015. Reward shaping with recurrent neural networks for speeding up on-line policy learning in spoken dialogue systems SIGDIAL 2015 - 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/w15-4655
  • Wen, TH., Gašić, M., Kim, D., Mrkšić, N., Su, PH., Vandyke, D. and Young, S., 2015. Stochastic language generation in dialogue using recurrent neural networks with convolutional sentence reranking SIGDIAL 2015 - 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/w15-4639
  • Mrkšić, N., Séaghdha, DO., Thomson, B., Gašić, M., Su, PH., Vandyke, D., Wen, TH. and Young, S., 2015. Multi-domain dialog state tracking using recurrent neural networks ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference, v. 2
    Doi: http://doi.org/10.3115/v1/p15-2130
  • Gasic, M., Kim, D., Tsiakoulis, P. and Young, S., 2015. Distributed dialogue policies for multi-domain statistical dialogue management ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2015-August
    Doi: http://doi.org/10.1109/ICASSP.2015.7178997
  • Wen, TH., Gašić, M., Mrkšić, N., Su, PH., Vandyke, D. and Young, S., 2015. Semantically conditioned lstm-based Natural language generation for spoken dialogue systems Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing,
    Doi: http://doi.org/10.18653/v1/d15-1199
  • Su, PH., Vandyke, D., Gašíc, M., Kim, D., Mrkšíc, N., Wen, TH. and Young, S., 2015. Learning from real users: Rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2015-January
  • 2014

  • Henderson, M., Thomson, B. and Young, S., 2014. Word-based dialog state tracking with recurrent neural networks SIGDIAL 2014 - 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
    Doi: http://doi.org/10.3115/v1/w14-4340
  • Tsiakoulis, P., Breslin, C., Gəsić, M., Henderson, M., Kim, D. and Young, S., 2014. Dialogue context sensitive speech synthesis using factorized decision trees Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Kim, D., Breslin, C., Tsiakoulis, P., Gašić, M., Henderson, M. and Young, S., 2014. Inverse reinforcement learning for micro-turn management Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Gašić, M., Kim, D., Tsiakoulis, P., Breslin, C., Henderson, M., Szummer, M., Thomson, B. and Young, S., 2014. Incremental on-line adaptation of POMDP-based dialogue managers to extended domains Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH,
  • Kim, D., Henderson, M., Gasic, M., Tsiakoulis, P. and Young, S., 2014. The use of discriminative belief tracking in POMDP-based dialogue systems 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2014.7078600
  • Henderson, M., Thomson, B. and Young, S., 2014. Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings,
    Doi: http://doi.org/10.1109/SLT.2014.7078601
  • 2013

  • Gašić, M., Breslin, C., Henderson, M., Kim, D., Szummer, M., Thomson, B., Tsiakoulis, P. and Young, S., 2013. POMDP-based dialogue manager adaptation to extended domains SIGDIAL 2013 - 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
  • Henderson, M., Thomson, B. and Young, S., 2013. Deep neural network approach for the dialog state tracking challenge SIGDIAL 2013 - 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
  • 2012

  • Gašić, M., Tsiakoulis, P., Henderson, M., Thomson, B., Yu, K., Tzirkel, E. and Young, S., 2012. The effect of cognitive load on a statistical dialogue system SIGDIAL 2012 - 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference,
  • Tsiakoulis, P., Gasic, M., Henderson, M., Planells-Lerma, J., Prombonas, J., Thomson, B., Yu, K., Young, S. and Tzirkel, E., 2012. Statistical methods for building robust spoken dialogue systems in an automobile
    Doi: http://doi.org/10.1201/b12320
  • 2011

  • Daubigney, L., Gasic, M., Chandramohan, S., Geist, M., Pietquin, O. and Young, S., 2011. Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • Jurcicek, F., Keizer, S., Gasic, M., Mairesse, F., Thomson, B., Yu, K. and Young, S., 2011. Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5,
  • 2010

  • Jurcicek, F., Thomson, B., Keizer, S., Mairesse, F., Gasic, M., Yu, K. and Young, S., 2010. Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,
  • Yu, K., Zen, H., Mairesse, F. and Young, S., 2010. Context Adaptive Training with Factorized Decision Trees for HMM-Based Speech Synthesis 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,
  • Yu, K., Mairesse, F. and Young, S., 2010. WORD-LEVEL EMPHASIS MODELLING IN HMM-BASED SPEECH SYNTHESIS 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
  • Young, S., 2010. Still Talking to Machines (Cognitively Speaking) 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4,
  • 2009

  • Jurcicek, F., Gasic, M., Keizer, S., Mairesse, E., Thomson, B., Yu, K. and Young, S., 2009. Transformation-based Learning for Semantic parsing INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
  • Mairesse, F., Gasic, M., Jurcicek, F., Keizer, S., Thomson, B., Yu, K. and Young, SJ., 2009. Spoken language understanding from unaligned data using discriminative classification models Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960692
  • Yu, K., Toda, T., Gasic, M., Keizer, S., Mairesse, F., Thomson, B. and Young, SJ., 2009. Probabilistic modelling of F0 in unvoiced regions in HMM-based speech synthesis Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960448
  • Keizer, S., Gasic, M., Mairesse, F., Thomson, B., Yu, K. and Young, SJ., 2009. Modelling user behaviour in the HIS-POMDP dialogue manager Proceedings of the IEEE Workshop on Spoken Language Technology, SLT' 08,
    Doi: http://doi.org/10.1109/SLT.2008.4777855
  • Gasic, M., Lefevre, F., Jurcicek, F., Keizer, S., Mairesse, F., Thomson, BRM., Yu, K. and Young, SJ., 2009. Back-off action selection in summary space-based POMDP-based dialogue systems IEEE Worskhop on Automatic Speech Recognition and Understanding, ASRU 2009,
    Doi: http://doi.org/10.1109/ASRU.2009.5373416
  • Jurcicek, F., Gasic, M., Keizer, S., Mairesse, F., Thomson, B. and Young, SJ., 2009. Transformation-based learning for semantic parsing Proceedings of the 10th Annual Conference of the International Speech Communication Associatio,
  • Lefevre, F., Gasic, M., Jurcicek, F., Keizer, S., Mairesse, F., Thomson, B., Yu, K. and Young, SJ., 2009. k-nearest neighbour Monte Carlo control algorithm for POMDP-based dialogue systems Proceedings of the 2009 SIGDIAL Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue,
  • Toda, T. and Young, SJ., 2009. Trajectory training considering global variance for HMM-based speech synthesis Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 2009,
    Doi: http://doi.org/10.1109/ICASSP.2009.4960511
  • 2008

  • Thomson, B., Yu, K., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J. and Young, SJ., 2008. Evaluating semantic-level confidence scores with multiple hypotheses
  • Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Yu, K. and Young, SJ., 2008. Training and evaluation of the HIS POMDP Dialogue System in noise Proceedings of the 9th SIGDial Workshop on Discourse and Dialogue,
  • Thomson, BRM., Schatzmann, J. and Young, SJ., 2008. Bayesian update of dialogue state for robust dialogue systems IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.2008.4518765
  • Schatzmann, J., Thomson, B. and Young, SJ., 2008. Error simulation for training statistical dialogue systems Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU' 07,
    Doi: http://doi.org/10.1109/ASRU.2007.4430167
  • Del Pozo, A. and Young, SJ., 2008. Repairing tracheoesophegeal speech duration
  • Thomson, B., Gasic, M., Keizer, S., Mairesse, F., Schatzmann, J., Yu, K. and Young, SJ., 2008. User study of the Bayesian Update of Dialogue State approach to dialogue management
  • del Pozo, A. and Young, S., 2008. The Linear Transformation of LF Glottal Waveforms for Voice Conversion INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Thomson, B., Yu, K., Gasic, M., Keizer, S., Mairesse, E., Schatzmann, J. and Young, S., 2008. Evaluating semantic-level confidence scores with multiple hypotheses INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Thomson, B., Gasic, M., Keizer, S., Mairesse, E., Schatzmann, J., Yu, K. and Young, S., 2008. User study of the Bayesian Update of Dialogue State approach to dialogue management INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Inanoglu, Z. and Young, S., 2008. Emotion Conversion using F0 Segment Selection INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5,
  • Del Pozo, A. and Young, SJ., 2008. The linear transformation of LF glottal waveforms for voice conversion
  • Inanoglu, Z. and Young, SJ., 2008. Emotion conversion using Fo segment selection
  • 2007

  • Inanoglu, Z. and Young, SJ., 2007. A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality Interspeech 2007,
  • Schatzmann, J., Thomson, B. and Young, SJ., 2007. Statistical user simulation with a hidden agenda
  • Schatzmann, J., Thomson, BRM., Weilhammer, K., Ye, H. and Young, SJ., 2007. Agenda-based user simulation for bootstrapping a POMDP dialogue system
  • Thomson, BRM., Schatzmann, J., Weilhammer, K., Ye, H. and Young, SJ., 2007. Training a real-world POMDP-based dialog system
  • Young, SJ., Schatzmann, J., Weilhammer, K. and Ye, H., 2007. The hidden information state approach to dialog management 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 4
    Doi: http://doi.org/10.1109/ICASSP.2007.367185
  • Young, SJ., Schatzmann, J., Weilhammer, K. and Ye, H., 2007. The hidden information state approach to dialog management
  • Inanoglu, Z. and Young, S., 2007. A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4,
  • 2006

  • Del Pozo, A. and Young, SJ., 2006. Continuous tracheoesophageal speech repair Proceedings of the 14th European Signal Processing Conference,
  • Weilhammer, H., Stuttle, M. and Young, SJ., 2006. Bootstrapping language models for dialogue systems ICSLP - International Conference - CD-ROM, v. CONF 9
  • Williams, J. and Young, SJ., 2006. Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI) Statistical and Empirical Approaches for Spoken Dialogue Systems: Papers from the 2006 AAAI Workshop,
  • Ye, H. and Young, SJ., 2006. A clustering approach to semantic decoding ICSLP - International Conference - CD-ROM, v. CONF 9
  • Young, SJ., 2006. Using POMDPs for dialog management IEEE Spoken Language Technology Workshop 2006,
    Doi: http://doi.org/10.1109/SLT.2006.326785
  • Ye, H. and Young, SJ., 2006. A clustering approach to semantic decoding
  • 2005

  • Inanoglu, Z. and Young, S., 2005. Intonation modelling and adaptation for emotional prosody generation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 3784 LNCS
    Doi: http://doi.org/10.1007/11573548_37
  • Schatzmann, J., Georgila, K. and Young, SJ., 2005. Quantitative evaluation of user simulation techniques for spoken dialogue systems Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,
  • Schatzmann, K., Weilhammer, K., Stuttle, M. and Young, SJ., 2005. Effects of the user model on simulation-based learning of dialogue strategies Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),
    Doi: http://doi.org/10.1109/ASRU.2005.1566539
  • Seneviratne, V. and Young, SJ., 2005. The hidden vector state language model Proceedings of the 9th European Conference on Speech Communication and Technology, v. 9
  • Williams, J., Poupart, P. and Young, SJ., 2005. Factored partially observable Markov decision processes for dialogue management Proceedings of the 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems,
  • Williams, J., Poupart, P. and Young, SJ., 2005. Partially observable Markov decision processes with continuous observations for dialogue management Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue,
  • Williams, J. and Young, SJ., 2005. Scaling up POMDPs for dialog management: the summary POMDP method Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),
    Doi: http://doi.org/10.1109/ASRU.2005.1566498
  • Ye, H. and Young, SJ., 2005. Improving the speech recognition performance of beginners in spoken conversational interaction for language learning Proceedings of the 9th European Conference on Speech Communciation and Technology (Interspeech 2005),
  • Williams, JD. and Young, S., 2005. Scaling up POMDPs for dialog management: The "Summary POMDP" method 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU),
  • 2004

  • Ye, H. and Young, SJ., 2004. Voice conversion for unknown speakers Proceedings of the 8th International Conference on Spoken Language Processing (ICSLP),
  • Ye, H. and Young, SJ., 2004. High quality voice morphing
  • He, Y. and Young, SJ., 2004. Robustness issues in a data-driven spoken language understanding system
  • Stuttle, M., Williams, J. and Young, SJ., 2004. A framework for dialogue data collection with a simulated ASR channel Proceedings of the 8th International Conference on Spoken Language Processing (ICSLP),
  • Williams, J. and Young, SJ., 2004. Characterizing task-oriented dialog using a simulated ASR chanel Proceedings of the 8th International Conference on Spoken Language Processing (Interspeech 2004),
  • Ye, H. and Young, SJ., 2004. High quality voice morphing Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
    Doi: http://doi.org/10.1109/ICASSP.2004.1325909
  • Ye, H. and Young, SJ., 2004. High quality voice morphing IEEE International Conference on Acoustics, Speech and Signal Processing, v. 1
  • 2003

  • He, Y. and Young, SJ., 2003. A data-driven spoken language understanding system Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'03),
    Doi: http://doi.org/10.1109/ASRU.2003.1318505
  • He, Y. and Young, SJ., 2003. Hidden vector state model for hierarchical semantic parsing Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
  • Williams, J. and Young, SJ., 2003. Using Wizard-of-Oz simulations to bootstrap reinforcement learning-based dialog management systems 4th SIGdial Workshop on Discourse and Dialogue,
  • Ye, H. and Young, SJ., 2003. Perceptually weighted linear transformations for voice conversion Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH),
  • 2002

  • Scheffler, K. and Young, SJ., 2002. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning
  • Young, SJ., 2002. Talking to machines (statistically speaking) Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP),
  • 2001

  • Nock, H. and Young, SJ., 2001. A comparison of exact and approximate algorithms for decoding and training loosely coupled HMMs Proceedings of the Institute of Acoustics Workshop on Innovation in Speech Processing (WISP 2001), v. 23
  • Scheffler, K. and Young, SJ., 2001. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning Proceedings of the Conference on Empirical Methods in Natural Language Processing (NAACL),
  • Tuerk, A. and Young, SJ., 2001. Indicator variable dependent output probability modelling via continuous posterior functions Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
    Doi: http://doi.org/10.1109/ICASSP.2001.940870
  • Young, SJ., 2001. Statistical modelling in continuous speech recognition (CSR) Proceedings of the 17th International Conference on Uncertainty in Artificial Intelligence (UAI 2001),
  • 2000

  • Moore, G. and Young, S., 2000. Class-based language model adaptation using mixtures of word-class weights Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), v. 4
  • Nock, H. and Young, S., 2000. Loosely coupled HMMs for ASR Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), v. 3
  • Scheffler, K. and Young, SJ., 2000. Probabilistic simulation of human-machine dialogues Proceedings of the International Conference on International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. Volume 2: Signal Processing Theory and Methods II
  • 1999

  • Tuerk, A. and Young, S., 1999. Modelling speaking rate using a between frame distance metric Proceedings of the 6th European Conference on Speech Communication and Technology (EUROSPEECH'99),
  • Witt, S. and Young, S., 1999. Off-line acoustic modelling of non-native accents Proceedings of the 6th European Conference on Speech Communication and Technology (EUROSPEECH'99),
  • Young, SJ., 1999. Overview of spoken dialogue systems for telephony applications IEE Colloquium Digest, v. 209
  • 1998

  • Young, SJ., 1998. Speech understanding and spoken dialogue systems IEE Colloquium Digest, v. Issue 499
  • Hain, T., Johnson, SE., Tuerk, A., Woodland, PC. and Young, SJ., 1998. Segment generation and clustering in the HTK broadcast news transcription system Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,
  • Nock, H. and Young, SJ., 1998. Detecting and correcting poor pronunciations for multiword units Proceedings of the Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition,
  • Witt, S. and Young, SJ., 1998. Bilingual model combination for non-native speech recognition Proceedings of the Institute of Acousticss Conference on Speech and Hearing,
  • Woodland, PC., Hain, T., Johnson, SE., Niesler, TR., Tuerk, A. and Young, SJ., 1998. Experiments in broadcast news transcription Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing v.1 (ICASSP '98), v. 2
    Doi: http://doi.org/10.1109/ICASSP.1998.675413
  • 1997

  • Witt, S. and Young, SJ., 1997. Computer-assisted pronunciation teaching based on automatic speech recognition Proceedings of the Conference on Language Teaching and Language Technology,
  • Witt, S. and Young, SJ., 1997. Language learning based on non-native speech recognition
  • Witt, S. and Young, SJ., 1997. Pronunciation teaching based on automatic speech recognition Proceedings of the Conference on Language Teaching and Language Technology,
  • Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. Broadcast news transcription using HTK Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 2
    Doi: http://doi.org/10.1109/ICASSP.1997.596005
  • Woodland, PC., Gales, MJF., Pye, D. and Young, SJ., 1997. The development of the 1996 HTK broadcast news transcription system Proceedings of DARPA Speech Recognition Workshop,
  • Young, SJ., 1997. Acoustic modelling for large vocabulary continuous speech recognition Proceedings of the NATO Advanced Study Institute Conference on Computational Models of Speech Pattern Processing, v. 169
  • Knill, K. and Young, S., 1997. Hidden Markov models in speech and language processing CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, v. 2
  • Young, SJ., 1997. Speech recognition evaluation: a review of the ARPA CSR programme
  • Young, SJ., Brown, M., Foote, J., Jones, G. and Sparck Jones, K., 1997. Acoustic indexing for multimedia retrieval and browsing IEEE International Conference on Acoustics Speech and Signal Processing, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1997.599600
  • Brown, M., Foote, J., Jones, G., Sparck Jones, K. and Young, SJ., 1997. Open-vocabulary speech indexing for voice and video mail retrieval Proceedings of the 4th ACM International Conference on Multimedia,
  • Nock, H., Gales, MJF. and Young, SJ., 1997. A comparative study of methods for phonetic decision-tree state clustering
  • Shih, HH. and Young, SJ., 1997. A study on the portability of a grammatical inference system
  • 1996

  • Blackburn, C. and Young, SJ., 1996. A self-learning speech synthesis system
  • Blackburn, C. and Young, SJ., 1996. Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data Proceedings of the 4th International Conference on Spoken Language Processing, v. 2
  • Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Retrieving spoken documents by combining multiple index sources Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,
  • Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Robust talker-independent audio document retrieval Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), v. 1
    Doi: http://doi.org/10.1109/ICASSP.1996.541094
  • Knill, KM., Gales, MJF. and Young, SJ., 1996. Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs International Conference on Spoken Language Processing, ICSLP, Proceedings, v. 1
  • Knill, KM. and Young, SJ., 1996. Fast implementation methods for Viterbi-based word-spotting ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
  • Niesler, TR., Woodland, PC. and Young, SJ., 1996. A variable-length category-based n-gram language model IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP 96, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1996.540316
  • Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1996. Lattice-based discriminative training for large vocabulary speech recognition 1996 IEEE International Conference on Acoustics Speech and Signal Processing conference proceedings, v. 2
    Doi: http://doi.org/10.1109/ICASSP.1996.543193
  • Valtchev, V., Woodland, PC. and Young, SJ., 1996. Discriminative optimisation of large vocabulary recognition systems Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP 1996), v. 1
  • 1995

  • Knill, K., Gales, MJF. and Young, SJ., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),
  • Blackburn, C. and Young, SJ., 1995. A novel self-organising speech production system using pseudo-articulators Proceedings of the 13th International Congress of Phonetic Sciences (ICPhS 95), v. 2
  • Blackburn, C. and Young, SJ., 1995. Learning new articulator trajectories for a speech production model using artificial neural networks Proceedings of the IEEE International Conference on Neural Networks (ICNN-95), v. 4
  • Blackburn, CS. and Young, SJ., 1995. Towards improved speech recognition using a speech production model EUROSPEECH 95 proceedings, v. 3
  • Brown, M., Foote, J., Sparck Jones, K. and Young, SJ., 1995. Automatic content-based retrieval of broadcast news Proceedings of the 3rd ACM International Multimedia Conference and Exhibition (Multimedia-95),
  • Foote, J., Brown, M., Jones, G., Sparck Jones, K. and Young, S., 1995. Video mail retrieval using voice: an overview of the stage 2 system Proceedings of the Final Workshop on Multimedia Information Retrieval (Miro '95),
  • Foote, JT., Jones, GJF., Sparck Jones, K. and Young, SJ., 1995. Talker-independent keyword spotting for information retrieval EUROSPEECH 95 Proceedings, v. 3
  • Gales, MJF. and Young, SJ., 1995. A fast and flexible implementation of parallel model combination Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech
  • Gales, MJF. and Young, SJ., 1995. The application of parallel model combination to a large vocabulary dictation task Proceedings of the 4th European Conference on Speech Communication and Technology (EUROSPEECH '95), v. 3
  • Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1995. Video mail retrieval: the effect of word spotting accuracy on precision Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ISCSSP), v. 1: Speech
  • Sparck Jones, K., Foote, J., Jones, G. and Young, SJ., 1995. Spoken document retrieval - a multimedia tool Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval,
  • Woodland, PC., Leggetter, CJ., Odell, JJ., Valtchev, V. and Young, SJ., 1995. Spoken language systems technology workshop Proceedings of the ARPA Spoken Language Systems Technology Workshop,
  • Woodland, PC., Leggetter, CJ., Odell, JJ., Valtchev, V. and Young, SJ., 1995. The 1994 HTK large vocabulary speech recognition system ICASSP-95: International Conference on Acoustics, Speech, and Signal Processing, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1995.479276
  • 1994

  • James, DA. and Young, SJ., 1994. A fast lattice-based approach to vocabulary independent wordspotting Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 1
  • Nolazco Flores, J. and Young, SJ., 1994. Continuous speech recognition in noise using spectral subtraction and HMM adaptation Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 1
  • Odell, JJ., Valtchev, V., Woodland, PC. and Young, SJ., 1994. A one-pass decoder design for large vocabulary recognition Proceedings of the ARPA Human language Technology Workshop,
  • Odell, JJ., Woodland, PC. and Young, SJ., 1994. Tree-based state clustering for large vocabulary speech recognition Proceedings, ISSIPNN '94: International Symposium on Speech Image Processing and Neural Networks,
  • Valtchev, V., Odell, JJ., Woodland, PC. and Young, SJ., 1994. A dynamic network decoder design for large vocabulary speech recognition ICSLP 94: International Conference on Spoken Language Processing, v. 3
  • Woodland, PC., Odell, JJ., Valtchev, V. and Young, SJ., 1994. Large vocabulary continuous speech recognition using HTK ICASSP-94: IEEE International Conference on Acoustics Speech and Signal Processing,
    Doi: http://doi.org/10.1109/ICASSP.1994.389562
  • Woodland, PC., Odell, JJ., Valtchev, V. and Young, SJ., 1994. The HTK large vocabulary continuous speech recognition system: an overview Proceedings of the ARPA Human language Technology Workshop,
  • Young, SJ., 1994. Detecting misrecognitions and out-of-vocabulary words Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-94), v. 2: Speech Processing, Audio, Underwater Acoustics, VLSI and Neural Networks
  • Young, SJ., Odell, JJ. and Woodland, PC., 1994. Tree-based state tying for high accuracy acoustic modelling Proceedings of the ARPA Human language Technology Workshop,
  • Young, SJ. and Shih, HH., 1994. Computer assisted grammar construction Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 862 LNAI
    Doi: http://doi.org/10.1007/3-540-58473-0_156
  • 1993

  • Nolazco Flores, JA. and Young, SJ., 1993. Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction
  • Woodland, PC. and Young, SJ., 1993. The HTK tied-state continuous speech recogniser EUROSPEECH 93 proceedings, v. 3
  • Young, SJ. and Woodland, PC., 1993. The use of state tying in continuous speech recognition EUROSPEECH 93 proceedings, v. 3
  • Gales, MJF. and Young, SJ., 1993. HMM recognition in noise using parallel model combination EUROSPEECH 93 proceedings, v. 2
  • Gales, MJF. and Young, SJ., 1993. Segmental hidden Markov models EUROSPEECH 93 proceedings, v. 3
  • 1992

  • Woodland, PC. and Young, SJ., 1992. Benchmark DARPA RM results using the HTK portable HMM toolkit
  • Young, SJ., 1992. The general use of tying in phoneme-based HMM speech recognisers ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1992.225844
  • Wang, MQ. and Young, SJ., 1992. Speech recognition using hidden Markov model decomposition and a general background speech model ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1
    Doi: http://doi.org/10.1109/ICASSP.1992.225924
  • GALES, MJF. and YOUNG, S., 1992. AN IMPROVED APPROACH TO THE HIDDEN MARKOV MODEL DECOMPOSITION OF SPEECH AND NOISE ICASSP-92 - 1992 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5,
  • 1984

  • Willis, AR., Bruce, IPC. and Young, SJ., 1984. An experimental database query system using automatic Proceedings of the 7th International Conference on Computer Communication: The New World of the Information Society,
  • Book chapters

    2008

  • Williams, J., Poupart, P. and Young, SJ., 2008. Partially observable Markov decision processes with continuous observations for dialogue management
    Doi: http://doi.org/10.1007/978-1-4020-6821-8_8
  • Young, SJ., 2008. HMMs and related speech recognition technologies
  • 1997

  • Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1997. The video mail retrieval project
  • Other publications

    2006

  • Young, SJ., Evermann, G., Gales, MJF., Kershaw, D., Moore, G., Odell, JJ., Ollason, DG., Povey, D., Valtchev, V. and Woodland, PC., 2006. The HTK book version 3.4
  • 1995

  • Young, SJ., Jansen, J., Odell, JJ., Ollason, DG. and Woodland, PC., 1995. The HTK book
  • 1993

  • Young, SJ., Woodland, PC. and Byrne, WJ., 1993. HTK V1.5: User, Reference and Programmer Manuals
  • Reports

    2005

  • Williams, JD. and Young, SJ., 2005. The SACTI-1 corpus: guide for research users
  • 2003

  • Young, SJ., 2003. The hidden vector state language model
  • Young, SJ., 2003. The statistical approach to the design of spoken dialogue systems
  • 2001

  • Tuerk, A. and Young, SJ., 2001. A system for computer assisted grammar construction
  • 1999

  • Scheffler, KH. and Young, SJ., 1999. Simulation of human-machine dialogues
  • 1997

  • Gales, MJF., Knill, KM. and Young, SJ., 1997. State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs
  • 1996

  • Jones, G., Foote, J., Sparck Jones, K. and Young, SJ., 1996. Video mail retrieval using voice: report on collection of naturalistic requests and relevance assessments
  • 1995

  • Knill, KM. and Young, SJ., 1995. Techniques for automatically transcribing unknown keywords for open keyword set HMM-based word-spotting
  • 1994

  • Fransen, JFJ., Pye, D., Robinson, AJ., Woodland, PC. and Young, SJ., 1994. WSJCAM0 corpus and recording description
  • Gales, MJF. and Young, SJ., 1994. Robust continuous speech recognition using parallel model combination
  • Knill, KM. and Young, SJ., 1994. Speaker dependent keyword spotting for accessing stored speech
  • Shih, HH. and Young, SJ., 1994. A system for computer assisted grammar construction
  • 1993

  • Young, SJ., 1993. The HTK hidden Markov model toolkit: design and philosophy
  • Nolazco Flores, JA. and Young, SJ., 1993. Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction
  • Gales, MJF. and Young, SJ., 1993. Parallel model combination for speech recognition in noise
  • Gales, MJF. and Young, SJ., 1993. PMC for speech recognition in additive and convolutional noise
  • Gales, MJF. and Young, SJ., 1993. The theory of segmental hidden Markov models
  • James, DA. and Young, SJ., 1993. On the application of information retrieval techniques and keyword spotting to video document retrieval
  • Nalazco Flores, JA. and Young, SJ., 1993. CSS-PMC: a combined enhancement/compensation scheme for continuous speech recognition in noise
  • 1992

  • Beattie, VL. and Young, SJ., 1992. Hidden Markov model state-based noise cancellation
  • Wong, G. and Young, SJ., 1992. Vector quantization bigram hidden Markov modelling for improved phoneme recognition
  • 1990

  • Beattie, VL. and Young, SJ., 1990. Hidden Markov model performance in noise
  • Young, SJ., 1990. Competitive training: a connectionist approach to the discriminative training of hidden Markov models
  • 1989

  • Young, SJ., Russell, NH. and Thornton, JHS., 1989. Token passing: a simple conceptual model for connected speech recognition systems
  • Books

    1997

  • 1997. Corpus-based methods in language and speech processing
  • 1982

  • Young, SJ., 1982. Real time languages: design and development
  • Theses / dissertations

    1978

  • Young, SJ., 1978. Speech synthesis from concept with applications to speech output from systems
  • Emeritus Professor of Information Engineering, Emmanuel College
    Professor Steve  Young

    Affiliations

    Classifications: