skip to content

Cambridge Language Sciences

Interdisciplinary Research Centre
 

Biography

See my other page for more details.

Publications (from Symplectic)

Datasets

2023

  • Tyen, WHG., Brenchley, M., Caines, A. and Buttery, P., 2023. Research data supporting "Towards an open-domain chatbot for language practice"
    Doi: http://doi.org/10.17863/CAM.90764
  • Journal articles

    2023

  • Benedetto, L., Cremonesi, P., Caines, A., Buttery, P., Cappelli, A., Giussani, A. and Turrin, R., 2023. A Survey on Recent Approaches to Question Difficulty Estimation from Text ACM Computing Surveys, v. 55
    Doi: 10.1145/3556538
  • Goriely, Z., Caines, A. and Buttery, P., 2023. Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues. J Child Lang,
    Doi: http://doi.org/10.1017/S0305000923000491
  • 2022

  • Elliott, M. and Buttery, P., 2022. Non-iterative Conditional Pairwise Estimation for the Rating Scale Model. Educ Psychol Meas, v. 82
    Doi: http://doi.org/10.1177/00131644211046253
  • 2021

  • Katushemererwe, F., Caines, A. and Buttery, P., 2021. Building natural language processing tools for Runyakitara Applied Linguistics Review, v. 12
    Doi: http://doi.org/10.1515/applirev-2020-2004
  • 2019

  • Caines, A., Altmann-Richer, E. and Buttery, P., 2019. The cross-linguistic performance of word segmentation models over time. J Child Lang, v. 46
    Doi: http://doi.org/10.1017/S0305000919000485
  • 2018

  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, PJ., 2018. Automatically identifying the function and intent of posts in underground forums Crime Science, v. 7
    Doi: http://doi.org/10.1186/s40163-018-0094-4
  • 2017

  • Bentz, C., Alikaniotis, D., Samardžić, T. and Buttery, P., 2017. Variation in Word Frequency Distributions: Definitions, Measures and Implications for a Corpus-Based Language Typology Journal of Quantitative Linguistics, v. 24
    Doi: http://doi.org/10.1080/09296174.2016.1265792
  • 2015

  • Thwaites, A., Nimmo-Smith, I., Fonteneau, E., Patterson, RD., Buttery, P. and Marslen-Wilson, WD., 2015. Tracking cortical entrainment in neural activity: auditory processes in human temporal cortex. Front Comput Neurosci, v. 9
    Doi: http://doi.org/10.3389/fncom.2015.00005
  • Bentz, C., Verkerk, A., Kiela, D., Hill, F. and Buttery, P., 2015. Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms. PLoS One, v. 10
    Doi: http://doi.org/10.1371/journal.pone.0128254
  • 2014

  • Bentz, C., Kiela, D., Hill, F. and Buttery, P., 2014. Zipf's law and the grammar of languages: A quantitative study of old and modern English parallel texts Corpus Linguistics and Linguistic Theory, v. 10
    Doi: http://doi.org/10.1515/cllt-2014-0009
  • 2012 (No publication date)

  • Rice, A., Buttery, P., Rai, IA. and Beresford, A., 2012 (No publication date). Language learning on a next-generation service platform for Africa
  • 2011

  • Andersen, Ø., Briscoe, T., Buttery, P., Carroll, J., Medlock, B., Parish, T. and Watson, R., 2011. Text Processing Tools and Services from iLexIR Ltd
  • McEntyre, JR., Ananiadou, S., Andrews, S., Black, WJ., Boulderstone, R., Buttery, P., Chaplin, D., Chevuru, S., Cobley, N., Coleman, LA., Davey, P., Gupta, B., Haji-Gholam, L., Hawkins, C., Horne, A., Hubbard, SJ., Kim, JH., Lewin, I., Lyte, V., MacIntyre, R., Mansoor, S., Mason, L., McNaught, J., Newbold, E., Nobata, C., Ong, E., Pillai, S., Rebholz-Schuhmann, D., Rosie, H., Rowbotham, R., Rupp, CJ., Stoehr, P. and Vaughan, P., 2011. UKPMC: a full text article resource for the life sciences NUCLEIC ACIDS RES, v. 39
    Doi: http://doi.org/10.1093/nar/gkq1063
  • 2010

  • Hawkins, JA. and Buttery, P., 2010. Criterial features in learner corpora: Theory and illustrations English Profile Journal, v. 1
    Doi: http://doi.org/10.1017/S2041536210000103
  • Poornima, S., Good, J., Su, Q., Huang, CR., Chen, K., Sharma, DM., Dimitriadis, A., Plank, B., van Noord, G., Caines, A. and others, , 2010. Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$ Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$,
  • Briscoe, T., Buttery, P., Carroll, J., Medlock, B. and Watson, R., 2010. Text Processing Tools and Services from iLexIR Ltd
  • 2009

  • Hawkins, JA. and Buttery, P., 2009. Using learner language from corpora to profile levels of proficiency: Insights from the english profile programme Language Testing Matters: Investigating the wider social and educational impact of assessment,
  • 2008

  • Briscoe, T. and Buttery, P., 2008. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY The evolution of language: proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain, 12-15 March 2008,
  • 2007

  • 2007. Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
  • 2006

  • Buttery, P., 2006. Computational models for first language acquisition
  • 2005

  • Buttery, P., 2005. Charles D. Yang. Knowledge and Learning in Natural Language. Oxford University Press, 2002. ISBN 0 19 925414 1 (hardback), Price $60. ISBN 0 19 925415 X (paperback), Price $21.95, 220 pages. Nat. Lang. Eng., v. 11
    Doi: http://doi.org/10.1017/S1351324905213724
  • Buttery, P. and Korhonen, A., 2005. Large-scale analysis of verb subcategorization differences between child directed speech and adult speech Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes,
  • 2004

  • Buttery, P., 2004. A quantitative evaluation of naturalistic models of language acquisition; the efficiency of the Triggering Learning Algorithm compared to a Categorial Grammar Learner Coling 2004,
  • Buttery, P. and Briscoe, T., 2004. The significance of errors to parametric models of language acquisition AAAI Spring Symposium - Technical Report, v. 5
  • Theses / dissertations

    2022 (No publication date)

  • Moore, R., 2022 (No publication date). Skill embeddings: artificial neural network representations for pedagogical policy development.
    Doi: http://doi.org/10.17863/CAM.90433
  • Conference proceedings

    2022

  • Tyen, G., Brenchley, M., Caines, A. and Buttery, P., 2022. Towards an open-domain chatbot for language practice BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Pete, I., Hughes, J., Caines, A., Vu, AV., Gupta, H., Hutchings, A., Anderson, R. and Buttery, P., 2022. PostCog: A tool for interdisciplinary research into underground forums at scale Proceedings - 7th IEEE European Symposium on Security and Privacy Workshops, Euro S and PW 2022,
    Doi: http://doi.org/10.1109/EuroSPW55150.2022.00016
  • Wambsganss, T., Caines, A. and Buttery, P., 2022. ALEN App: Persuasive Writing Support To Foster English Language Learning BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Rietsche, R., Caines, A., Schramm, C., Pfütze, D. and Buttery, P., 2022. The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Felice, M., Taslimipoor, S. and Buttery, P., 2022. Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Felice, M., Taslimipoor, S., Andersen, ØE. and Buttery, P., 2022. CEPOC: The Cambridge Exams Publishing Open Cloze dataset 2022 Language Resources and Evaluation Conference, LREC 2022,
  • Davis, C., Bryant, C., Caines, A., Rei, M. and Buttery, P., 2022. Probing for targeted syntactic knowledge through grammatical error detection CoNLL 2022 - 26th Conference on Computational Natural Language Learning, Proceedings of the Conference,
  • 2020

  • Zaidi, A., Caines, A., Moore, R., Buttery, P. and Rice, A., 2020. Adaptive Forgetting Curves for Spaced Repetition Language Learning Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 12164 LNAI
    Doi: http://doi.org/10.1007/978-3-030-52240-7_65
  • Craighead, H., Caines, A., Buttery, P. and Yannakoudakis, H., 2020. Investigating the effect of auxiliary objectives for the automated grading of learner english speech transcriptions Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Hughes, J., Aycock, S., Caines, A., Buttery, P. and Hutchings, A., 2020. Detecting Trending Terms in Cybersecurity Forum Discussions Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020),
    Doi: 10.18653/v1/2020.wnut-1.15
  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference,
  • Caines, A. and Buttery, P., 2020. REPROLANG 2020: Automatic proficiency scoring of Czech, English, German, Italian, and Spanish learner essays LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings,
  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English Proceedings of the 28th International Conference on Computational Linguistics,
    Doi: http://doi.org/10.18653/v1/2020.coling-main.195
  • 2019

  • Aglionby, G., Davis, C., Mishra, P., Caines, A., Yannakoudakis, H., Rei, M., Shutova, E. and Buttery, P., 2019. CAMsterdam at SemEval-2019 task 6: Neural and graph-based feature extraction for the identification of offensive tweets NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop,
  • Moore, R., Caines, A., Rice, A. and Buttery, P., 2019. Behavioural cloning of teachers for automatic homework selection Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11625 LNAI
    Doi: http://doi.org/10.1007/978-3-030-23204-7_28
  • Moore, R., Caines, A., Elliott, M., Zaidi, A., Rice, A. and Buttery, P., 2019. Skills embeddings: A neural approach to multicomponent representations of students and tasks EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Zaidi, AH., Caines, A., Davis, C., Moore, R., Buttery, P. and Rice, A., 2019. Accurate modelling of language learning tasks and students using representations of grammatical proficiency EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Felice, M. and Buttery, P., 2019. Entropy as a proxy for gap complexity in open cloze tests International Conference Recent Advances in Natural Language Processing, RANLP, v. 2019-September
    Doi: http://doi.org/10.26615/978-954-452-056-4_037
  • 2018

  • Pastrana, S., Hutchings, A., Caines, A. and Buttery, P., 2018. Characterizing eve: Analysing cybercrime actors in a large underground forum Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11050 LNCS
    Doi: http://doi.org/10.1007/978-3-030-00470-5_10
  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, P., 2018. Aggressive language in an online hacking forum 2nd Workshop on Abusive Language Online - Proceedings of the Workshop, co-located with EMNLP 2018,
  • 2017

  • Graham, C., Buttery, P. and Nolan, F., 2017. Vowel characteristics in the assessment of L2 English pronunciation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-1630
  • Flint, E., Ford, E., Thomas, O., Caines, A. and Buttery, P., 2017. A Text Normalisation System for Non-Standard English Words 3rd Workshop on Noisy User-Generated Text, W-NUT 2017 - Proceedings of the Workshop,
  • Caines, A., Flint, E. and Buttery, P., 2017. Collecting fluency corrections for spoken learner english EMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop,
  • Caines, A., McCarthy, M. and Buttery, P., 2017. Parsing transcripts of speech EMNLP 2017 - 1st Workshop on Speech-Centric Natural Language Processing, SCNLP 2017 - Proceedings of the Workshop,
  • 2016

  • Zhang, W., Caines, A., Alikaniotis, D. and Buttery, P., 2016. Predicting author age from Weibo microblog posts Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • Caines, A., Bentz, C., Graham, C., Polzehl, T. and Buttery, P., 2016. Crowdsourcing a multilingual speech corpus: Recording, transcription and annotation of the CROWDED corpus Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2016. Automated speech-unit delimitation in spoken learner English COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • 2015

  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2015. Incremental dependency parsing and disfluency detection in spoken learner English Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9302
    Doi: http://doi.org/10.1007/978-3-319-24033-6_53
  • 2012

  • Caines, A. and Buttery, P., 2012. Annotating progressive aspect constructions in the spoken section of the british national Corpus Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • Buttery, P. and Caines, A., 2012. Reclassifying subcategorization frames for experimental analysis and stimulus generation Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • 2010

  • Caines, A. and Buttery, P., 2010. ‘You talking to me?’ A predictive model for zero auxiliary constructions Proceedings of the Workshop on Natural Language Processing and Linguistics, Finding the Common Ground, Annual Meeting of the Association for Computational Linguistics,
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: a tool for predicting the lexical isolation point of a word
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: A Tool for Predicting the Lexical Isolation Point of a Word Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • Williams, C., Thwaites, A., Buttery, P., Geertzen, J., Randall, B., Shafto, M., Devereux, B. and Tyler, L., 2010. The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • 2009

  • Vlachos, A., Buttery, P., Séaghdha, DO. and Briscoe, T., 2009. Biomedical event extraction without training data Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task,
  • 2007

  • Buttery, P. and Korhonen, A., 2007. I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition,
  • Book chapters

    2018

  • Caines, A., McCarthy, M. and Buttery, P., 2018. 'You still talking to me?': The zero auxiliary progressive in spoken British english twenty years on
  • Caines, A. and Buttery, P., 2018. The Effect of Task and Topic on Opportunity of Use in Learner Corpora
  • 2012 (No publication date)

  • Buttery, PJ., McCarthy, M. and Carter, R., 2012 (No publication date). Chatting in the academy: informality in spoken academic discourse
  • 2012

  • Caines, A. and Buttery, P., 2012. Normalising frequency counts to account for ‘opportunity of use’ in learner corpora
  • 2011

  • Buttery, PJ. and McCarthy, M., 2011. Lexis in Spoken Discourse.
  • 2008

  • Briscoe, E. and Buttery PJ, , 2008. The evolution of language. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY
  • Reports

    2017

  • Caines, AP., Nicholls, D. and Buttery, P., 2017. Annotating errors and disfluencies in transcriptions of speech
  • Professor of Language and Machine Learning
    Dr Paula  Buttery

    Contact Details

    Email address: 
    (+44) (0)1223 763832

    Affiliations

    Classifications: