skip to content

Cambridge Language Sciences

Interdisciplinary Research Centre
 
Read more at: Chris Bryant

Chris Bryant

Grammatical error detection and correction, CALL, NLP

Theses / dissertations

2019

  • Bryant, CJ., 2019. Automatic annotation of error types for grammatical error correction
    Doi: http://doi.org/10.17863/CAM.40832
  • Conference proceedings

    2017

  • Bryant, CJ., Felice, M. and Briscoe, E., 2017. Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, v. 1
  • Journal articles

    2016

  • Felice, M., Bryant, C. and Briscoe, T., 2016. Automatic extraction of learner errors in ESL sentences using linguistically enhanced alignments COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,

  • Read more at: Natalia Budohoska

    Natalia Budohoska

    English; world Englishes; English in post-colonial context; multilingualism; English as a world language; language change and diversity


    Read more at: Dr Simon Baker

    Dr Simon Baker

    Natural language processing; biomedical text; lexical acquisition

    Journal articles

    2023 (Accepted for publication)

  • Collins, C., Baker, S., Brown, J., Zeng, H., Chan, A., Stenius, U., Narita, M. and Korhonen, A., 2023 (Accepted for publication). Text Mining for Contexts and Relationships in Cancer Genomics Literature Bioinformatics,
    Doi: http://doi.org/10.1093/bioinformatics/btae021
  • 2021

  • Ali, I., Dreij, K., Baker, S., Högberg, J., Korhonen, A. and Stenius, U., 2021. Application of Text Mining in Risk Assessment of Chemical Mixtures: A Case Study of Polycyclic Aromatic Hydrocarbons (PAHs). Environ Health Perspect, v. 129
    Doi: http://doi.org/10.1289/EHP6702
  • Su, Y., Wang, Y., Cai, D., Baker, S., Korhonen, A. and Collier, N., 2021. PROTOTYPE-TO-STYLE: Dialogue Generation with Style-Aware Editing on Retrieval Memory IEEE/ACM Transactions on Audio Speech and Language Processing, v. 29
    Doi: http://doi.org/10.1109/TASLP.2021.3087948
  • Majewska, O., Collins, C., Baker, S., Björne, J., Brown, SW., Korhonen, A. and Palmer, M., 2021. BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine. J Biomed Semantics, v. 12
    Doi: http://doi.org/10.1186/s13326-021-00247-z
  • 2020 (Accepted for publication)

  • Vulic, I., Baker, S., Ponti, E., Petti, U., Leviant, I., Wing, K., Majewska, O., Bar, E., Malone, M., Poibeau, T., Reichart, R. and Korhonen, A., 2020 (Accepted for publication). Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity Computational Linguistics,
    Doi: http://doi.org/10.1162/coli_a_00391
  • 2020

  • Wichmann, P., Brintrup, A., Baker, S., Woodall, P. and McFarlane, D., 2020. Extracting supply chain maps from news articles using deep neural networks International Journal of Production Research, v. 58
    Doi: http://doi.org/10.1080/00207543.2020.1720925
  • Crichton, G., Baker, S., Guo, Y. and Korhonen, A., 2020. Neural networks for open and closed Literature-based Discovery. PLoS One, v. 15
    Doi: http://doi.org/10.1371/journal.pone.0232891
  • Petti, U., Baker, S. and Korhonen, A., 2020. A systematic literature review of automatic Alzheimer's disease detection from speech and language. J Am Med Inform Assoc, v. 27
    Doi: http://doi.org/10.1093/jamia/ocaa174
  • Chiu, B. and Baker, S., 2020. Word embeddings for biomedical natural language processing: A survey Language and Linguistics Compass, v. 14
    Doi: http://doi.org/10.1111/lnc3.12402
  • 2019

  • Pyysalo, S., Baker, S., Ali, I., Haselwimmer, S., Shah, T., Young, A., Guo, Y., Högberg, J., Stenius, U., Narita, M. and Korhonen, A., 2019. LION LBD: a literature-based discovery system for cancer biology. Bioinformatics, v. 35
    Doi: http://doi.org/10.1093/bioinformatics/bty845
  • 2018

  • Wichmann, P., Brintrup, A., Baker, S., Woodall, P. and McFarlane, D., 2018. Towards automatically generating supply chain maps from natural language text
    Doi: http://doi.org/10.1016/j.ifacol.2018.08.207
  • 2017 (Accepted for publication)

  • Baker, S., Ali, I., Silins, I., Pyysalo, S., Guo, Y., Högberg, J., Stenius, U. and Korhonen, A., 2017 (Accepted for publication). Cancer Hallmarks Analytics Tool (CHAT): A text mining approach to organise and evaluate scientific literature on cancer Bioinformatics, v. 33
    Doi: http://doi.org/10.1093/bioinformatics/btx454
  • 2017

  • Larsson, K., Baker, S., Silins, I., Guo, Y., Stenius, U., Korhonen, A. and Berglund, M., 2017. Text mining for improved exposure assessment PLOS One, v. 12
    Doi: http://doi.org/10.1371/journal.pone.0173132
  • 2016

  • Baker, S., Silins, I., Guo, Y., Ali, I., Högberg, J., Stenius, U. and Korhonen, A., 2016. Automatic semantic classification of scientific literature according to the hallmarks of cancer. Bioinformatics, v. 32
    Doi: http://doi.org/10.1093/bioinformatics/btv585
  • 2015

  • Korhonen, A., Baker, S., Silins, I., Guo, Y., Ali, I., Hogberg, J. and Stenius, U., 2015. Automatic Semantic Classification of Scientific Literature According to the Hallmarks of Cancer Bioinformatics,
  • Conference proceedings

    2021

  • Su, Y., Cai, D., Wang, Y., Vandyke, D., Baker, S., Li, P. and Collier, N., 2021. Non-Autoregressive Text Generation with Pre-trained Language Models Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics,
  • 2019

  • Chiu, B., Baker, S., Palmer, M. and Korhonen, A., 2019. Enhancing biomedical word embeddings by retrofitting to verb clusters BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task,
  • 2018

  • Stathopoulos, YA., Baker, S., Rei, M. and Teufel, S., 2018. Variable typing: Assigning meaning to variables in mathematical text NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, v. 1
  • Mendes, E., Rodriguez, P., Freitas, V., Baker, S. and Atoui, MA., 2018. Towards improving decision making and estimating the value of decisions in value-based software engineering: the VALUE framework Software Quality Journal, v. 26
    Doi: http://doi.org/10.1007/s11219-017-9360-z
  • 2017 (No publication date)

  • Baker, S., Korhonen, A. and Pyysalo, S., 2017 (No publication date). Cancer Hallmark Text Classification Using Convolutional Neural Networks
    Doi: http://doi.org/10.17863/CAM.12420
  • 2017

  • Baker, S. and Korhonen, A., 2017. Initializing neural networks for hierarchical multi-label text classification BioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop,
    Doi: 10.18653/v1/w17-2339
  • 2016

  • Baker, S., Kiela, D. and Korhonen, A., 2016. Robust text classification for sparsely labelled data using multi-level embeddings COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • 2015

  • Korhonen, A., Guo, Y., Baker, S., Yetisgen-Yildiz, M., Stenius, U., Narita, M. and Liò, P., 2015. Improving literature-based discovery with advanced text mining Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 8623
    Doi: http://doi.org/10.1007/978-3-319-24462-4_8
  • 2014

  • Baker, S., Reichart, R. and Korhonen, A., 2014. An unsupervised model for instance level subcategorization acquisition EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • 2010

  • Baker, S. and Mendes, E., 2010. Aggregating Expert-Driven Causal Maps for Web Effort Estimation ADVANCES IN SOFTWARE ENGINEERING, v. 117
  • Baker, S. and Mendes, E., 2010. Evaluating the Weighted Sum Algorithm for Estimating Conditional Probabilities in Bayesian Networks 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING & KNOWLEDGE ENGINEERING (SEKE 2010),
  • 2008

  • Baker, S., Au, F., Dobbie, G. and Warren, I., 2008. Automated usability testing using HUI Analyzer ASWEC 2008: 19TH AUSTRALIAN SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS,
    Doi: http://doi.org/10.1109/ASWEC.2008.40
  • Baker, S., Au, F., Dobbie, G. and Warren, I., 2008. Automated usability testing using HUI analyzer Proceedings of the Australian Software Engineering Conference, ASWEC,
    Doi: http://doi.org/10.1109/ASWEC.2008.4483248

  • Read more at: Dr Theresa Biberauer

    Dr Theresa Biberauer

    Theoretical, comparative and historical syntax

    Journal articles

    2019

  • Biberauer, T., 2019. Some thoughts on the complexity of syntactic complexity Theoretical Linguistics, v. 45
    Doi: http://doi.org/10.1515/tl-2019-0017
  • Biberauer, T., 2019. Children always go beyond the input: The Maximise Minimal Means perspective Theoretical Linguistics, v. 45
    Doi: http://doi.org/10.1515/tl-2019-0013
  • Biberauer, T., 2019. Factors 2 and 3: Towards a principled approach Catalan Journal of Linguistics, v. 2019
    Doi: http://doi.org/10.5565/REV/CATJL.219
  • 2018

  • Biberauer, T., 2018. Less is more: On the tolerance principle as a manifestation of maximize minimal means Linguistic Approaches to Bilingualism, v. 8
    Doi: http://doi.org/10.1075/lab.18080.bib
  • 2016 (No publication date)

  • Biberauer, T., 2016 (No publication date). ’Nie sommer nie’: Sociohistorical and formal comparative considerations in the rise and maintenance of the modern Afrikaans negation system Stellenbosch Papers in Linguistics Plus, v. 47
  • 2016

  • Deuchar, M. and Biberauer, T., 2016. Doubling: An error or an illusion? Bilingualism, v. 19
    Doi: http://doi.org/10.1017/S1366728916000043
  • 2015

  • Biberauer, T. and Roberts, I., 2015. Rethinking Formal Hierarchies: A Proposed Unification Cambridge Occasional Papers in Linguistics, v. 7
  • 2014

  • Biberauer, T., Haegeman, L. and Van Kemenade, A., 2014. Putting our heads together: Towards a syntax of particles Studia Linguistica, v. 68
    Doi: http://doi.org/10.1111/stul.12021
  • Biberauer, T., Holmberg, A. and Roberts, I., 2014. A Syntactic Universal and Its Consequences Linguistic Inquiry, v. 45
    Doi: 10.1162/ling_a_00153
  • 2012 (No publication date)

  • Biberauer, MT. and Zeijlstra, HH., 2012 (No publication date). Negative Concord in Afrikaans: filling a typological gap Journal of Semantics,
    Doi: http://doi.org/10.1093/jos/ffr010
  • 2011

  • Biberauer, T. and Sheehan, M., 2011. Introduction: Particles through a modern syntactic lens Linguistic Review, v. 28
    Doi: http://doi.org/10.1515/tlir.2011.011
  • Biberauer, T. and Van Kemenade, A., 2011. Subject positions and information-structural diversification in the history of english Catalan Journal of Linguistics, v. 10
    Doi: http://doi.org/10.5565/rev/catjl.32
  • 2010

  • Biberauer, T. and Roberts, I., 2010. Comments on Jager "Anything is nothing is something": on the diachrony of polarity types of indefinites NAT LANG LINGUIST TH, v. 28
    Doi: 10.1007/s11049-010-9111-3
  • Biberauer, T. and D'Alessandro, R., 2010. On the role of gemination in passives: the case of Abruzzese. Snippets, v. 21
  • 2009

  • Biberauer, MT., Newton, G. and Sheehan, M., 2009. Limiting synchronic and diachronic variation and change: the Final-Over-Final Constraint. Language and Linguistics, v. 10
  • Biberauer, T., Newton, G. and Sheehan, M., 2009. Limiting Synchronic and Diachronic Variation and Change: The Final-over-Final Constraint LANG LINGUIST, v. 10
  • 2007

  • Biberauer, T., 2007. A closer look at Negative Concord in Afrikaans Stellenbosch Papers in Linguistics Plus, v. 35
  • 2001

  • Biberauer, T., 2001. Language Change in Afrikaans and the Perennial V2 Puzzle: Considering New Data Durham Working Papers in Linguistics, v. 7
  • 1996

  • Biberauer, T., 1996. Frightening or enlightening? An appraisal of the functions of the military metaphor in the aids context Stellenbosch Papers in Linguistics Plus, v. 29
  • Book chapters

    2018

  • Biberauer, T., 2018. Pro-drop and emergent parameter hierarchies
    Doi: http://doi.org/10.1093/oso/9780198815853.003.0005
  • Biberauer, MT., 2018. Probing the nature of the Final-over-Final Condition: the perspective from adpositions
    Doi: http://doi.org/10.5281/zenodo.1117686
  • 2017 (Published online)

  • Biberauer, T. and Roberts, I., 2017 (Published online). Chapter 4. Conditional inversion and types of parametric change
    Doi: 10.1075/la.243.04rob
  • Biberauer, T., 2017 (Published online). Chapter 5. Optional V2 in modern Afrikaans
    Doi: 10.1075/la.243.05bib
  • 2017

  • Biberauer, T. and Roberts, I., 2017. Parameter setting
    Doi: http://doi.org/10.1017/9781107279070.008
  • Biberauer, MT., 2017. Particles and the Final-over-Final Condition
  • 2016

  • Biberauer, T. and Roberts, I., 2016. Parameter typology from a diachronic perspective
    Doi: 10.1075/la.234.10bib
  • 2015

  • Biberauer, M. and Roberts, I., 2015. The clausal hierarchy, features and parameters
  • 2014

  • Biberauer, MT., Holmberg, JA., Roberts, I. and Sheehan, ML., 2014. Complexity in comparative syntax: the view from modern parametric theory
  • 2013 (No publication date)

  • Sheehan, ML., Biberauer, T., Holmberg, AH. and Roberts, IGR., 2013 (No publication date). Complexity in comparative syntax: the view from modern parametric theory
  • 2013

  • Biberauer, T. and Zeijlstra, H., 2013. Negative changes: Three factors and the diachrony of Afrikaans negation
    Doi: http://doi.org/10.1093/acprof:oso/9780199659203.003.0013
  • 2012

  • Biberauer, T., 2012. Competing reinforcements: When languages opt out of Jespersen’s Cycle
  • Biberauer, T. and Sheehan, M., 2012. Disharmony, Antisymmetry, and the Final-over-Final Constraint
    Doi: http://doi.org/10.1093/acprof:oso/9780199644933.003.0009
  • Biberauer, T., 2012. Competing Reinforcements: When Languages Opt Out of Jespersen's Cycle
  • 2011

  • Roberts, IG. and Biberauer, MT., 2011. Negative words and related expressions: a new perspective on some familiar puzzles
  • 2010

  • Biberauer, MT., Sheehan, M. and Newton, G., 2010. Impossible Changes and Impossible Borrowings: the Final-over-Final Constraint.
  • Biberauer, T., Sheehan, M. and Newton, G., 2010. Impossible changes and impossible borrowings
  • 2009

  • Biberauer, T., 2009. Chapter 5. Jespersen off course? The case of contemporary Afrikaans negation
  • Biberauer, MT., 2009. Jespersen off course? The case of contemporary Afrikaans negation.
  • Biberauer, T. and Roberts, I., 2009. The return of the Subset Principle
    Doi: 10.1093/acprof:oso/9780199560547.003.0004
  • Biberauer, MT. and Roberts, I., 2009. Subjects, Tense and Verb-movement in Germanic and Romance.
  • Biberauer, MT., 2009. How "well-behaved" is Afrikaans? V2 in Modern Spoken Afrikaans
  • Biberauer, MT., 2009. Semi pro-drop languages, expletives and expletive pro reconsidered.
  • Biberauer, T. and Roberts, I., 2009. Subjects, tense and verb-movement
    Doi: 10.1017/CBO9780511770784.008
  • Biberauer, T., 2009. Competing reinforcements
  • 2008

  • Biberauer, T. and Roberts, I., 2008. Cascading Parameter Changes: Internally-Driven Change in Middle and Early Modern English
  • Biberauer, MT., Holmberg, A. and Roberts, I., 2008. Structure and linearization in disharmonic word orders.
  • Biberauer, T., 2008. Doubling vs. omission: Insights from afrikaans negation
    Doi: http://doi.org/10.1163/9781848550216_005
  • Biberauer, MT., 2008. Introduction
  • Biberauer, MT., 2008. Doubling and omission: insights from Afrikaans negation.
  • 2007

  • Biberauer, T. and Roberts, I., 2007. Loss of residual “head final” orders and remnant fronting in Late Middle English
  • 2006

  • Biberauer, T. and Richards, M., 2006. True optionality
  • Biberauer, T., 2006. Afrikaans
    Doi: http://doi.org/10.1016/B0-08-044854-2/02190-8
  • Biberauer, T. and Richards, M., 2006. True optionality
    Doi: 10.1075/la.91.08bib
  • Biberauer, T. and Roberts, I., 2006. Loss of residual “head final” orders and remnant fronting in Late Middle English
    Doi: 10.1075/la.97.13bib
  • 2005

  • Richards, M. and Biberauer, T., 2005. Explaining Expl
    Doi: 10.1075/la.78.06ric
  • 2002

  • Biberauer, T., 2002. Verb Second in Afrikaans: Is This a Unitary Phenomenon?
  • Books

    2017

  • Sheehan, ML., Biberauer, TB, , Holmberg, A, and Roberts, IG, , 2017. The Final over Final Condition: A word-order universal and its implications for linguistic theory
  • 2015

  • 2015. Introduction: Changing views of syntactic change
  • 2013

  • 2013. Challenges to Linearization
    Doi: 10.1515/9781614512431
  • 2012 (No publication date)

  • Sheehan, ML. and Biberauer, T., 2012 (No publication date). Theoretical Approaches to Disharmonic Word Orders
  • 2009

  • Biberauer, T., Holmberg, A. and Roberts, I., 2009. Parametric variation
  • Biberauer, T., Holmberg, A., Roberts, I. and Sheehan, M., 2009. Parametric variation: Null subjects in minimalist theory
    Doi: http://doi.org/10.1017/CBO9780511770784
  • Biberauer, T., Holmberg, A. and Roberts, I., 2009. Parametric variation
  • 2008

  • Biberauer, T., 2008. The limits of syntactic variation
  • Conference proceedings

    2016 (No publication date)

  • Biberauer, T. and Cyrino, SML., 2016 (No publication date). Negative developments in Afrikaans and Brazilian Portuguese
  • 2011 (No publication date)

  • Biberauer, MT., Holmberg, A. and Roberts, I., 2011 (No publication date). Linearising disharmonic word orders: the Final-over-Final Constraint.
  • 2008

  • Biberauer, MT., Holmberg, A. and Roberts, I., 2008. Disharmonic word-order systems and the Final-over-Final-Constraint (FOFC)In Proceedings of XXXIII Incontro di Grammatica Generativa. http://amsacta.cib.unibo.it/archive/00002397/01/PROCEEDINGS_IGG33.pdf,
  • 2005

  • Biberauer, T. and Roberts, I., 2005. Changing EPP parameters in the history of English: accounting for variation and change ENGLISH LANGUAGE & LINGUISTICS, v. 9
    Doi: http://doi.org/10.1017/S1360674305001528
  • Other publications

    2015

  • 2015. Syntax over Time
    Doi: 10.1093/acprof:oso/9780199687923.001.0001

  • Read more at: Professor Paula Buttery

    Professor Paula Buttery

    Co-Director of Cambridge Language Sciences; Lead Scientific Adviser for Cambridge University Press & Assessment; Director of the Cambridge Institute for Automated Language Teaching and Assessment; Machine Learning for Natural Language Processing; Cognitive Models of Language; Low Resource Language.

    Conference proceedings

    2024

  • Gherardi, E., Benedetto, L., Matera, M. and Buttery, P., 2024. Using Knowledge Graphs to Improve Question Difficulty Estimation from Text Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 14830 LNAI
    Doi: http://doi.org/10.1007/978-3-031-64299-9_24
  • 2023

  • Diehl Martinez, R., Goriely, Z., McGovern, H., Davis, C., Caines, A., Buttery, P. and Beinborn, L., 2023. CLIMB – Curriculum Learning for Infant-inspired Model Building CoNLL 2023 - BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, Proceedings,
  • 2022

  • Felice, M., Taslimipoor, S., Andersen, ØE. and Buttery, P., 2022. CEPOC: The Cambridge Exams Publishing Open Cloze dataset 2022 Language Resources and Evaluation Conference, LREC 2022,
  • Davis, C., Bryant, C., Caines, A., Rei, M. and Buttery, P., 2022. Probing for targeted syntactic knowledge through grammatical error detection CoNLL 2022 - 26th Conference on Computational Natural Language Learning, Proceedings of the Conference,
  • Tyen, G., Brenchley, M., Caines, A. and Buttery, P., 2022. Towards an open-domain chatbot for language practice BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
    Doi: 10.18653/v1/2022.bea-1.28
  • Pete, I., Hughes, J., Caines, A., Vu, AV., Gupta, H., Hutchings, A., Anderson, R. and Buttery, P., 2022. PostCog: A tool for interdisciplinary research into underground forums at scale Proceedings - 7th IEEE European Symposium on Security and Privacy Workshops, Euro S and PW 2022,
    Doi: http://doi.org/10.1109/EuroSPW55150.2022.00016
  • Wambsganss, T., Caines, A. and Buttery, P., 2022. ALEN App: Persuasive Writing Support To Foster English Language Learning BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Rietsche, R., Caines, A., Schramm, C., Pfütze, D. and Buttery, P., 2022. The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Felice, M., Taslimipoor, S. and Buttery, P., 2022. Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2022.findings-acl.100
  • 2020

  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English Proceedings of the 28th International Conference on Computational Linguistics,
    Doi: 10.18653/v1/2020.coling-main.195
  • Zaidi, A., Caines, A., Moore, R., Buttery, P. and Rice, A., 2020. Adaptive Forgetting Curves for Spaced Repetition Language Learning Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 12164 LNAI
    Doi: http://doi.org/10.1007/978-3-030-52240-7_65
  • Craighead, H., Caines, A., Buttery, P. and Yannakoudakis, H., 2020. Investigating the effect of auxiliary objectives for the automated grading of learner english speech transcriptions Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2020.acl-main.206
  • Hughes, J., Aycock, S., Caines, A., Buttery, P. and Hutchings, A., 2020. Detecting Trending Terms in Cybersecurity Forum Discussions Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020),
    Doi: 10.18653/v1/2020.wnut-1.15
  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference,
  • Caines, A. and Buttery, P., 2020. REPROLANG 2020: Automatic proficiency scoring of Czech, English, German, Italian, and Spanish learner essays LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings,
  • 2019

  • Aglionby, G., Davis, C., Mishra, P., Caines, A., Yannakoudakis, H., Rei, M., Shutova, E. and Buttery, P., 2019. CAMsterdam at SemEval-2019 task 6: Neural and graph-based feature extraction for the identification of offensive tweets NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop,
  • Moore, R., Caines, A., Rice, A. and Buttery, P., 2019. Behavioural cloning of teachers for automatic homework selection Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11625 LNAI
    Doi: http://doi.org/10.1007/978-3-030-23204-7_28
  • Moore, R., Caines, A., Elliott, M., Zaidi, A., Rice, A. and Buttery, P., 2019. Skills embeddings: A neural approach to multicomponent representations of students and tasks EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Zaidi, AH., Caines, A., Davis, C., Moore, R., Buttery, P. and Rice, A., 2019. Accurate modelling of language learning tasks and students using representations of grammatical proficiency EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Felice, M. and Buttery, P., 2019. Entropy as a proxy for gap complexity in open cloze tests International Conference Recent Advances in Natural Language Processing, RANLP, v. 2019-September
    Doi: http://doi.org/10.26615/978-954-452-056-4_037
  • 2018

  • Pastrana, S., Hutchings, A., Caines, A. and Buttery, P., 2018. Characterizing eve: Analysing cybercrime actors in a large underground forum Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11050 LNCS
    Doi: http://doi.org/10.1007/978-3-030-00470-5_10
  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, P., 2018. Aggressive language in an online hacking forum 2nd Workshop on Abusive Language Online - Proceedings of the Workshop, co-located with EMNLP 2018,
  • 2017

  • Flint, E., Ford, E., Thomas, O., Caines, A. and Buttery, P., 2017. A Text Normalisation System for Non-Standard English Words 3rd Workshop on Noisy User-Generated Text, W-NUT 2017 - Proceedings of the Workshop,
  • Caines, A., Flint, E. and Buttery, P., 2017. Collecting fluency corrections for spoken learner english EMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop,
  • Caines, A., McCarthy, M. and Buttery, P., 2017. Parsing transcripts of speech EMNLP 2017 - 1st Workshop on Speech-Centric Natural Language Processing, SCNLP 2017 - Proceedings of the Workshop,
  • Graham, C., Buttery, P. and Nolan, F., 2017. Vowel characteristics in the assessment of L2 English pronunciation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-1630
  • 2016

  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2016. Automated speech-unit delimitation in spoken learner English COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • Zhang, W., Caines, A., Alikaniotis, D. and Buttery, P., 2016. Predicting author age from Weibo microblog posts Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • Caines, A., Bentz, C., Graham, C., Polzehl, T. and Buttery, P., 2016. Crowdsourcing a multilingual speech corpus: Recording, transcription and annotation of the CROWDED corpus Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • 2015

  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2015. Incremental dependency parsing and disfluency detection in spoken learner English Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9302
    Doi: http://doi.org/10.1007/978-3-319-24033-6_53
  • 2012

  • Caines, A. and Buttery, P., 2012. Annotating progressive aspect constructions in the spoken section of the british national Corpus Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • Buttery, P. and Caines, A., 2012. Reclassifying subcategorization frames for experimental analysis and stimulus generation Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • 2010

  • Caines, A. and Buttery, P., 2010. ‘You talking to me?’ A predictive model for zero auxiliary constructions Proceedings of the Workshop on Natural Language Processing and Linguistics, Finding the Common Ground, Annual Meeting of the Association for Computational Linguistics,
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: A tool for predicting the lexical isolation point of a word Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010,
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: A Tool for Predicting the Lexical Isolation Point of a Word Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • Williams, C., Thwaites, A., Buttery, P., Geertzen, J., Randall, B., Shafto, M., Devereux, B. and Tyler, L., 2010. The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • 2009

  • Vlachos, A., Buttery, P., Séaghdha, DO. and Briscoe, T., 2009. Biomedical event extraction without training data Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task,
  • 2007

  • Buttery, P. and Korhonen, A., 2007. I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition,
  • Journal articles

    2023

  • Goriely, Z., Caines, A. and Buttery, P., 2023. Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues. J Child Lang,
    Doi: http://doi.org/10.1017/S0305000923000491
  • Benedetto, L., Cremonesi, P., Caines, A., Buttery, P., Cappelli, A., Giussani, A. and Turrin, R., 2023. A Survey on Recent Approaches to Question Difficulty Estimation from Text ACM Computing Surveys, v. 55
    Doi: 10.1145/3556538
  • 2022

  • Elliott, M. and Buttery, P., 2022. Non-iterative Conditional Pairwise Estimation for the Rating Scale Model. Educ Psychol Meas, v. 82
    Doi: http://doi.org/10.1177/00131644211046253
  • 2021

  • Katushemererwe, F., Caines, A. and Buttery, P., 2021. Building natural language processing tools for Runyakitara Applied Linguistics Review, v. 12
    Doi: http://doi.org/10.1515/applirev-2020-2004
  • 2019

  • Caines, A., Altmann-Richer, E. and Buttery, P., 2019. The cross-linguistic performance of word segmentation models over time. J Child Lang, v. 46
    Doi: http://doi.org/10.1017/S0305000919000485
  • 2018

  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, PJ., 2018. Automatically identifying the function and intent of posts in underground forums Crime Science, v. 7
    Doi: http://doi.org/10.1186/s40163-018-0094-4
  • 2017

  • Bentz, C., Alikaniotis, D., Samardžić, T. and Buttery, P., 2017. Variation in Word Frequency Distributions: Definitions, Measures and Implications for a Corpus-Based Language Typology Journal of Quantitative Linguistics, v. 24
    Doi: http://doi.org/10.1080/09296174.2016.1265792
  • 2015

  • Thwaites, A., Nimmo-Smith, I., Fonteneau, E., Patterson, RD., Buttery, P. and Marslen-Wilson, WD., 2015. Tracking cortical entrainment in neural activity: auditory processes in human temporal cortex. Front Comput Neurosci, v. 9
    Doi: http://doi.org/10.3389/fncom.2015.00005
  • Bentz, C., Verkerk, A., Kiela, D., Hill, F. and Buttery, P., 2015. Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms. PLoS One, v. 10
    Doi: http://doi.org/10.1371/journal.pone.0128254
  • 2014

  • Bentz, C., Kiela, D., Hill, F. and Buttery, P., 2014. Zipf's law and the grammar of languages: A quantitative study of old and modern English parallel texts Corpus Linguistics and Linguistic Theory, v. 10
    Doi: http://doi.org/10.1515/cllt-2014-0009
  • 2012 (No publication date)

  • Rice, A., Buttery, P., Rai, IA. and Beresford, A., 2012 (No publication date). Language learning on a next-generation service platform for Africa
  • 2011

  • McEntyre, JR., Ananiadou, S., Andrews, S., Black, WJ., Boulderstone, R., Buttery, P., Chaplin, D., Chevuru, S., Cobley, N., Coleman, LA., Davey, P., Gupta, B., Haji-Gholam, L., Hawkins, C., Horne, A., Hubbard, SJ., Kim, JH., Lewin, I., Lyte, V., MacIntyre, R., Mansoor, S., Mason, L., McNaught, J., Newbold, E., Nobata, C., Ong, E., Pillai, S., Rebholz-Schuhmann, D., Rosie, H., Rowbotham, R., Rupp, CJ., Stoehr, P. and Vaughan, P., 2011. UKPMC: a full text article resource for the life sciences NUCLEIC ACIDS RES, v. 39
    Doi: http://doi.org/10.1093/nar/gkq1063
  • Andersen, Ø., Briscoe, T., Buttery, P., Carroll, J., Medlock, B., Parish, T. and Watson, R., 2011. Text Processing Tools and Services from iLexIR Ltd
  • 2010

  • Hawkins, JA. and Buttery, P., 2010. Criterial features in learner corpora: Theory and illustrations English Profile Journal, v. 1
    Doi: http://doi.org/10.1017/S2041536210000103
  • Poornima, S., Good, J., Su, Q., Huang, CR., Chen, K., Sharma, DM., Dimitriadis, A., Plank, B., van Noord, G., Caines, A. and others, , 2010. Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$ Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$,
  • Briscoe, T., Buttery, P., Carroll, J., Medlock, B. and Watson, R., 2010. Text Processing Tools and Services from iLexIR Ltd
  • 2009

  • Hawkins, JA. and Buttery, P., 2009. Using learner language from corpora to profile levels of proficiency: Insights from the english profile programme Language Testing Matters: Investigating the wider social and educational impact of assessment,
  • 2008

  • Briscoe, T. and Buttery, P., 2008. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY The evolution of language: proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain, 12-15 March 2008,
  • 2007

  • 2007. Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
  • 2006

  • Buttery, P., 2006. Computational models for first language acquisition
  • 2005

  • Buttery, P. and Korhonen, A., 2005. Large-scale analysis of verb subcategorization differences between child directed speech and adult speech Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes,
  • Buttery, P., 2005. Charles D. Yang. Knowledge and Learning in Natural Language. Oxford University Press, 2002. ISBN 0 19 925414 1 (hardback), Price $60. ISBN 0 19 925415 X (paperback), Price $21.95, 220 pages. Nat. Lang. Eng., v. 11
    Doi: http://doi.org/10.1017/S1351324905213724
  • 2004

  • Buttery, P. and Briscoe, T., 2004. The significance of errors to parametric models of language acquisition AAAI Spring Symposium - Technical Report, v. 5
  • Buttery, P., 2004. A quantitative evaluation of naturalistic models of language acquisition; the efficiency of the Triggering Learning Algorithm compared to a Categorial Grammar Learner Coling 2004,
  • Datasets

    2023

  • Tyen, WHG., Brenchley, M., Caines, A. and Buttery, P., 2023. Research data supporting "Towards an open-domain chatbot for language practice"
    Doi: http://doi.org/10.17863/CAM.90764
  • Theses / dissertations

    2022 (No publication date)

  • Moore, R., 2022 (No publication date). Skill embeddings: artificial neural network representations for pedagogical policy development.
    Doi: http://doi.org/10.17863/CAM.90433
  • Book chapters

    2018

  • Caines, A., McCarthy, M. and Buttery, P., 2018. 'You still talking to me?': The zero auxiliary progressive in spoken British english twenty years on
  • 2017

  • Caines, A. and Buttery, P., 2017. The Effect of Task and Topic on Opportunity of Use in Learner Corpora
  • 2012 (No publication date)

  • Buttery, PJ., McCarthy, M. and Carter, R., 2012 (No publication date). Chatting in the academy: informality in spoken academic discourse
  • 2012

  • Caines, A. and Buttery, P., 2012. Normalising frequency counts to account for ‘opportunity of use’ in learner corpora
  • 2011

  • Buttery, PJ. and McCarthy, M., 2011. Lexis in Spoken Discourse.
  • 2008

  • Briscoe, E. and Buttery PJ, , 2008. The evolution of language. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY
  • Reports

    2017

  • Caines, AP., Nicholls, D. and Buttery, P., 2017. Annotating errors and disfluencies in transcriptions of speech

  • Read more at: Dr Marie-Françoise Besnier

    Dr Marie-Françoise Besnier

    The transliteration and translation of texts of the omen corpus; Akkadian


    What we do

    Cambridge Language Sciences is an Interdisciplinary Research Centre at the University of Cambridge. Our virtual network connects researchers from five schools across the university as well as other world-leading research institutions. Our aim is to strengthen research collaborations and knowledge transfer across disciplines in order to address large-scale multi-disciplinary research challenges relating to language research.

    JOIN OUR NETWORK

    JOIN OUR MAILING LIST

    CONTACT US