skip to content

Cambridge Language Sciences

Interdisciplinary Research Centre
 
Read more at: Professor Paula Buttery

Professor Paula Buttery

Co-Director of Cambridge Language Sciences; Lead Scientific Adviser for Cambridge University Press & Assessment; Director of the Cambridge Institute for Automated Language Teaching and Assessment; Machine Learning for Natural Language Processing; Cognitive Models of Language; Low Resource Language.

Conference proceedings

2024

  • Gherardi, E., Benedetto, L., Matera, M. and Buttery, P., 2024. Using Knowledge Graphs to Improve Question Difficulty Estimation from Text Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 14830 LNAI
    Doi: http://doi.org/10.1007/978-3-031-64299-9_24
  • 2023

  • Diehl Martinez, R., Goriely, Z., McGovern, H., Davis, C., Caines, A., Buttery, P. and Beinborn, L., 2023. CLIMB – Curriculum Learning for Infant-inspired Model Building CoNLL 2023 - BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, Proceedings,
  • 2022

  • Tyen, G., Brenchley, M., Caines, A. and Buttery, P., 2022. Towards an open-domain chatbot for language practice BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
    Doi: 10.18653/v1/2022.bea-1.28
  • Pete, I., Hughes, J., Caines, A., Vu, AV., Gupta, H., Hutchings, A., Anderson, R. and Buttery, P., 2022. PostCog: A tool for interdisciplinary research into underground forums at scale Proceedings - 7th IEEE European Symposium on Security and Privacy Workshops, Euro S and PW 2022,
    Doi: http://doi.org/10.1109/EuroSPW55150.2022.00016
  • Wambsganss, T., Caines, A. and Buttery, P., 2022. ALEN App: Persuasive Writing Support To Foster English Language Learning BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Rietsche, R., Caines, A., Schramm, C., Pfütze, D. and Buttery, P., 2022. The Specificity and Helpfulness of Peer-to-Peer Feedback in Higher Education BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings,
  • Felice, M., Taslimipoor, S. and Buttery, P., 2022. Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2022.findings-acl.100
  • Felice, M., Taslimipoor, S., Andersen, ØE. and Buttery, P., 2022. CEPOC: The Cambridge Exams Publishing Open Cloze dataset 2022 Language Resources and Evaluation Conference, LREC 2022,
  • Davis, C., Bryant, C., Caines, A., Rei, M. and Buttery, P., 2022. Probing for targeted syntactic knowledge through grammatical error detection CoNLL 2022 - 26th Conference on Computational Natural Language Learning, Proceedings of the Conference,
  • 2020

  • Zaidi, A., Caines, A., Moore, R., Buttery, P. and Rice, A., 2020. Adaptive Forgetting Curves for Spaced Repetition Language Learning Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 12164 LNAI
    Doi: http://doi.org/10.1007/978-3-030-52240-7_65
  • Craighead, H., Caines, A., Buttery, P. and Yannakoudakis, H., 2020. Investigating the effect of auxiliary objectives for the automated grading of learner english speech transcriptions Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2020.acl-main.206
  • Hughes, J., Aycock, S., Caines, A., Buttery, P. and Hutchings, A., 2020. Detecting Trending Terms in Cybersecurity Forum Discussions Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020),
    Doi: 10.18653/v1/2020.wnut-1.15
  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference,
  • Caines, A. and Buttery, P., 2020. REPROLANG 2020: Automatic proficiency scoring of Czech, English, German, Italian, and Spanish learner essays LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings,
  • Caines, A., Bentz, C., Knill, K., Rei, M. and Buttery, P., 2020. Grammatical error detection in transcriptions of spoken English Proceedings of the 28th International Conference on Computational Linguistics,
    Doi: 10.18653/v1/2020.coling-main.195
  • 2019

  • Aglionby, G., Davis, C., Mishra, P., Caines, A., Yannakoudakis, H., Rei, M., Shutova, E. and Buttery, P., 2019. CAMsterdam at SemEval-2019 task 6: Neural and graph-based feature extraction for the identification of offensive tweets NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop,
  • Moore, R., Caines, A., Rice, A. and Buttery, P., 2019. Behavioural cloning of teachers for automatic homework selection Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11625 LNAI
    Doi: http://doi.org/10.1007/978-3-030-23204-7_28
  • Moore, R., Caines, A., Elliott, M., Zaidi, A., Rice, A. and Buttery, P., 2019. Skills embeddings: A neural approach to multicomponent representations of students and tasks EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Zaidi, AH., Caines, A., Davis, C., Moore, R., Buttery, P. and Rice, A., 2019. Accurate modelling of language learning tasks and students using representations of grammatical proficiency EDM 2019 - Proceedings of the 12th International Conference on Educational Data Mining,
  • Felice, M. and Buttery, P., 2019. Entropy as a proxy for gap complexity in open cloze tests International Conference Recent Advances in Natural Language Processing, RANLP, v. 2019-September
    Doi: http://doi.org/10.26615/978-954-452-056-4_037
  • 2018

  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, P., 2018. Aggressive language in an online hacking forum 2nd Workshop on Abusive Language Online - Proceedings of the Workshop, co-located with EMNLP 2018,
  • Pastrana, S., Hutchings, A., Caines, A. and Buttery, P., 2018. Characterizing eve: Analysing cybercrime actors in a large underground forum Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 11050 LNCS
    Doi: http://doi.org/10.1007/978-3-030-00470-5_10
  • 2017

  • Graham, C., Buttery, P. and Nolan, F., 2017. Vowel characteristics in the assessment of L2 English pronunciation Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 08-12-September-2016
    Doi: http://doi.org/10.21437/Interspeech.2016-1630
  • Flint, E., Ford, E., Thomas, O., Caines, A. and Buttery, P., 2017. A Text Normalisation System for Non-Standard English Words 3rd Workshop on Noisy User-Generated Text, W-NUT 2017 - Proceedings of the Workshop,
  • Caines, A., Flint, E. and Buttery, P., 2017. Collecting fluency corrections for spoken learner english EMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop,
  • Caines, A., McCarthy, M. and Buttery, P., 2017. Parsing transcripts of speech EMNLP 2017 - 1st Workshop on Speech-Centric Natural Language Processing, SCNLP 2017 - Proceedings of the Workshop,
  • 2016

  • Zhang, W., Caines, A., Alikaniotis, D. and Buttery, P., 2016. Predicting author age from Weibo microblog posts Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • Caines, A., Bentz, C., Graham, C., Polzehl, T. and Buttery, P., 2016. Crowdsourcing a multilingual speech corpus: Recording, transcription and annotation of the CROWDED corpus Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016,
  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2016. Automated speech-unit delimitation in spoken learner English COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • 2015

  • Moore, R., Caines, A., Graham, C. and Buttery, P., 2015. Incremental dependency parsing and disfluency detection in spoken learner English Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9302
    Doi: http://doi.org/10.1007/978-3-319-24033-6_53
  • 2012

  • Caines, A. and Buttery, P., 2012. Annotating progressive aspect constructions in the spoken section of the british national Corpus Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • Buttery, P. and Caines, A., 2012. Reclassifying subcategorization frames for experimental analysis and stimulus generation Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012,
  • 2010

  • Caines, A. and Buttery, P., 2010. ‘You talking to me?’ A predictive model for zero auxiliary constructions Proceedings of the Workshop on Natural Language Processing and Linguistics, Finding the Common Ground, Annual Meeting of the Association for Computational Linguistics,
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: A tool for predicting the lexical isolation point of a word Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010,
  • Thwaites, A., Geertzen, J., Marslen-Wilson, WD. and Buttery, P., 2010. LIPS: A Tool for Predicting the Lexical Isolation Point of a Word Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • Williams, C., Thwaites, A., Buttery, P., Geertzen, J., Randall, B., Shafto, M., Devereux, B. and Tyler, L., 2010. The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10),
  • 2009

  • Vlachos, A., Buttery, P., Séaghdha, DO. and Briscoe, T., 2009. Biomedical event extraction without training data Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task,
  • 2007

  • Buttery, P. and Korhonen, A., 2007. I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition,
  • Datasets

    2023

  • Tyen, WHG., Brenchley, M., Caines, A. and Buttery, P., 2023. Research data supporting "Towards an open-domain chatbot for language practice"
    Doi: http://doi.org/10.17863/CAM.90764
  • Journal articles

    2023

  • Benedetto, L., Cremonesi, P., Caines, A., Buttery, P., Cappelli, A., Giussani, A. and Turrin, R., 2023. A Survey on Recent Approaches to Question Difficulty Estimation from Text ACM Computing Surveys, v. 55
    Doi: 10.1145/3556538
  • Goriely, Z., Caines, A. and Buttery, P., 2023. Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues. J Child Lang,
    Doi: http://doi.org/10.1017/S0305000923000491
  • 2022

  • Elliott, M. and Buttery, P., 2022. Non-iterative Conditional Pairwise Estimation for the Rating Scale Model. Educ Psychol Meas, v. 82
    Doi: http://doi.org/10.1177/00131644211046253
  • 2021

  • Katushemererwe, F., Caines, A. and Buttery, P., 2021. Building natural language processing tools for Runyakitara Applied Linguistics Review, v. 12
    Doi: http://doi.org/10.1515/applirev-2020-2004
  • 2019

  • Caines, A., Altmann-Richer, E. and Buttery, P., 2019. The cross-linguistic performance of word segmentation models over time. J Child Lang, v. 46
    Doi: http://doi.org/10.1017/S0305000919000485
  • 2018

  • Caines, A., Pastrana, S., Hutchings, A. and Buttery, PJ., 2018. Automatically identifying the function and intent of posts in underground forums Crime Science, v. 7
    Doi: http://doi.org/10.1186/s40163-018-0094-4
  • 2017

  • Bentz, C., Alikaniotis, D., Samardžić, T. and Buttery, P., 2017. Variation in Word Frequency Distributions: Definitions, Measures and Implications for a Corpus-Based Language Typology Journal of Quantitative Linguistics, v. 24
    Doi: http://doi.org/10.1080/09296174.2016.1265792
  • 2015

  • Thwaites, A., Nimmo-Smith, I., Fonteneau, E., Patterson, RD., Buttery, P. and Marslen-Wilson, WD., 2015. Tracking cortical entrainment in neural activity: auditory processes in human temporal cortex. Front Comput Neurosci, v. 9
    Doi: http://doi.org/10.3389/fncom.2015.00005
  • Bentz, C., Verkerk, A., Kiela, D., Hill, F. and Buttery, P., 2015. Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms. PLoS One, v. 10
    Doi: http://doi.org/10.1371/journal.pone.0128254
  • 2014

  • Bentz, C., Kiela, D., Hill, F. and Buttery, P., 2014. Zipf's law and the grammar of languages: A quantitative study of old and modern English parallel texts Corpus Linguistics and Linguistic Theory, v. 10
    Doi: http://doi.org/10.1515/cllt-2014-0009
  • 2012 (No publication date)

  • Rice, A., Buttery, P., Rai, IA. and Beresford, A., 2012 (No publication date). Language learning on a next-generation service platform for Africa
  • 2011

  • Andersen, Ø., Briscoe, T., Buttery, P., Carroll, J., Medlock, B., Parish, T. and Watson, R., 2011. Text Processing Tools and Services from iLexIR Ltd
  • McEntyre, JR., Ananiadou, S., Andrews, S., Black, WJ., Boulderstone, R., Buttery, P., Chaplin, D., Chevuru, S., Cobley, N., Coleman, LA., Davey, P., Gupta, B., Haji-Gholam, L., Hawkins, C., Horne, A., Hubbard, SJ., Kim, JH., Lewin, I., Lyte, V., MacIntyre, R., Mansoor, S., Mason, L., McNaught, J., Newbold, E., Nobata, C., Ong, E., Pillai, S., Rebholz-Schuhmann, D., Rosie, H., Rowbotham, R., Rupp, CJ., Stoehr, P. and Vaughan, P., 2011. UKPMC: a full text article resource for the life sciences NUCLEIC ACIDS RES, v. 39
    Doi: http://doi.org/10.1093/nar/gkq1063
  • 2010

  • Hawkins, JA. and Buttery, P., 2010. Criterial features in learner corpora: Theory and illustrations English Profile Journal, v. 1
    Doi: http://doi.org/10.1017/S2041536210000103
  • Poornima, S., Good, J., Su, Q., Huang, CR., Chen, K., Sharma, DM., Dimitriadis, A., Plank, B., van Noord, G., Caines, A. and others, , 2010. Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$ Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground$$,
  • Briscoe, T., Buttery, P., Carroll, J., Medlock, B. and Watson, R., 2010. Text Processing Tools and Services from iLexIR Ltd
  • 2009

  • Hawkins, JA. and Buttery, P., 2009. Using learner language from corpora to profile levels of proficiency: Insights from the english profile programme Language Testing Matters: Investigating the wider social and educational impact of assessment,
  • 2008

  • Briscoe, T. and Buttery, P., 2008. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY The evolution of language: proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain, 12-15 March 2008,
  • 2007

  • 2007. Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
  • 2006

  • Buttery, P., 2006. Computational models for first language acquisition
  • 2005

  • Buttery, P., 2005. Charles D. Yang. Knowledge and Learning in Natural Language. Oxford University Press, 2002. ISBN 0 19 925414 1 (hardback), Price $60. ISBN 0 19 925415 X (paperback), Price $21.95, 220 pages. Nat. Lang. Eng., v. 11
    Doi: http://doi.org/10.1017/S1351324905213724
  • Buttery, P. and Korhonen, A., 2005. Large-scale analysis of verb subcategorization differences between child directed speech and adult speech Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes,
  • 2004

  • Buttery, P., 2004. A quantitative evaluation of naturalistic models of language acquisition; the efficiency of the Triggering Learning Algorithm compared to a Categorial Grammar Learner Coling 2004,
  • Buttery, P. and Briscoe, T., 2004. The significance of errors to parametric models of language acquisition AAAI Spring Symposium - Technical Report, v. 5
  • Theses / dissertations

    2022 (No publication date)

  • Moore, R., 2022 (No publication date). Skill embeddings: artificial neural network representations for pedagogical policy development.
    Doi: http://doi.org/10.17863/CAM.90433
  • Book chapters

    2018

  • Caines, A., McCarthy, M. and Buttery, P., 2018. 'You still talking to me?': The zero auxiliary progressive in spoken British english twenty years on
  • 2017

  • Caines, A. and Buttery, P., 2017. The Effect of Task and Topic on Opportunity of Use in Learner Corpora
  • 2012 (No publication date)

  • Buttery, PJ., McCarthy, M. and Carter, R., 2012 (No publication date). Chatting in the academy: informality in spoken academic discourse
  • 2012

  • Caines, A. and Buttery, P., 2012. Normalising frequency counts to account for ‘opportunity of use’ in learner corpora
  • 2011

  • Buttery, PJ. and McCarthy, M., 2011. Lexis in Spoken Discourse.
  • 2008

  • Briscoe, E. and Buttery PJ, , 2008. The evolution of language. LINGUISTIC ADAPTATIONS FOR RESOLVING AMBIGUITY
  • Reports

    2017

  • Caines, AP., Nicholls, D. and Buttery, P., 2017. Annotating errors and disfluencies in transcriptions of speech

  • Read more at: Professor Ted Briscoe

    Professor Ted Briscoe

    Computational linguistics; speech and language processing; textual information management; evolutionary linguistics

    Conference proceedings

    2021

  • Dahiya, R., Nathan, A. and Occhipinti, LG., 2021. Front Matter FLEPS 2021 - IEEE International Conference on Flexible and Printable Sensors and Systems,
    Doi: http://doi.org/10.1109/FLEPS51544.2021.9469865
  • 2019 (No publication date)

  • Farag, Y., Yannakoudakis, H. and Briscoe, T., 2019 (No publication date). Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input Proceedings of NAACL-HLT 2018, New Orleans, Louisiana, pages 263–271, v. Volume 1
    Doi: http://doi.org/10.18653/v1/N18-1024
  • 2019

  • Bryant, C., Felice, M., Andersen, ØE. and Briscoe, T., 2019. The bea-2019 shared task on grammatical error correction ACL 2019 - Innovative Use of NLP for Building Educational Applications, BEA 2019 - Proceedings of the 14th Workshop,
  • Xia, M., Kochmar, E. and Briscoe, T., 2019. Automatic learner summary assessment for reading comprehension. NAACL-HLT (1),
  • 2018

  • Bryant, C. and Briscoe, T., 2018. Language model based grammatical error correction without annotated training data Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018,
    Doi: 10.18653/v1/w18-0529
  • Zhang, M., Chen, X., Cummins, R., Andersen, Ø. and Briscoe, T., 2018. The effect of adding authorship knowledge in automated text scoring Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018,
    Doi: 10.18653/v1/w18-0536
  • 2017

  • Rei, M., Felice, M., Yuan, Z. and Briscoe, T., 2017. Artificial Error Generation with Machine Translation and Syntactic Patterns Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications,
  • Farag, Y., Rei, M. and Briscoe, T., 2017. An Error-Oriented Approach to Word Embedding Pre-Training Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications,
  • Bryant, CJ., Felice, M. and Briscoe, E., 2017. Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, v. 1
  • Zaidi, AH., Moore, R. and Briscoe, T., 2017. Curriculum Q-Learning for Visual Vocabulary Acquisition
  • 2016

  • Yuan, Z. and Briscoe, T., 2016. Grammatical error correction using neural machine translation 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/n16-1042
  • Cummins, R., Zhang, M. and Briscoe, T., 2016. Constrained multi-task learning for automated essay scoring 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 2
    Doi: http://doi.org/10.18653/v1/p16-1075
  • Xia, M., Kochmar, E. and Briscoe, E., 2016. Text Readability Assessment for Second Language Learners Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications,
  • Yuan, Z., Briscoe, T. and Felice, M., 2016. Candidate re-ranking for smt-based grammatical error correction Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016,
    Doi: 10.18653/v1/w16-0530
  • Cummins, R., Yannakoudakis, H. and Briscoe, T., 2016. Unsupervised modeling of topical relevance in l2 learner text Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016,
    Doi: 10.18653/v1/w16-0510
  • 2015

  • Kochmar, E. and Briscoe, T., 2015. Using learner data to improve error correction in adjective-noun combinations 10th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2015 at the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015,
    Doi: 10.3115/v1/w15-0627
  • Felice, M. and Briscoe, T., 2015. Towards a standard evaluation method for grammatical error detection and correction NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference,
    Doi: http://doi.org/10.3115/v1/n15-1060
  • 2014

  • Ng, HT., Wu, SM., Briscoe, T., Hadiwinoto, C., Susanto, RH. and Bryant, C., 2014. The CoNLL-2014 shared task on grammatical error correction CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings of the Shared Task,
    Doi: http://doi.org/10.3115/v1/w14-1701
  • Wang, Y., Wang, L., Zeng, X., Wong, DF., Chao, LS. and Lu, Y., 2014. Factored Statistical Machine Translation for Grammatical Error Correction. CoNLL Shared Task,
  • Rei, M. and Briscoe, T., 2014. Parser lexicalisation through self-learning NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference,
  • Kochmar, E. and Briscoe, T., 2014. Detecting learner errors in the choice of content words using compositional distributional semantics COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers,
  • Felice, M., Yuan, Z., Andersen, ØE., Yannakoudakis, H. and Kochmar, E., 2014. Grammatical error correction using hybrid systems and type filtering CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings of the Shared Task,
    Doi: http://doi.org/10.3115/v1/w14-1702
  • Rei, M. and Briscoe, T., 2014. Looking for Hyponyms in Vector Space. CoNLL,
  • 2014. Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task, CoNLL 2014, Baltimore, Maryland, USA, June 26-27, 2014 CoNLL Shared Task,
  • 2013

  • Rei, M. and Briscoe, T., 2013. Parser lexicalisation through self-learning Proceedings of the North American Conference of the Association for Computational Computational Linguistics,
  • Kochmar, E. and Briscoe, T., 2013. Capturing anomalies in the choice of content words in compositional distributional semantic space International Conference Recent Advances in Natural Language Processing, RANLP,
  • Rei, M. and Briscoe, T., 2013. Parser lexicalisation through self-learning Proceedings of the 2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013,
  • 2012 (No publication date)

  • Yannakoudakis, H., Briscoe, E. and Alexopoulou, T., 2012 (No publication date). Automating Second Language Acquisition Research: Integrating Information Visualisation and Machine Learning EACL,
  • Kochmar, E., Andersen, O. and Briscoe, T., 2012 (No publication date). HOO 2012 Error Recognition and Correction Shared Task: Cambridge University Submission Report
  • Yannakoudakis, H. and Briscoe, T., 2012 (No publication date). Modeling coherence in ESOL learner texts
  • 2012

  • Kochmar, E., Andersen, O. and Briscoe, E., 2012. HOO 2012 Error Recognition and Correction Shared Task: Cambridge University Submission Report http://aclweb.org/anthology/W12-2028, v. Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
  • 2011

  • Rei, M. and Briscoe, T., 2011. Unsupervised Entailment Detection between Dependency Graph Fragments
    Doi: http://doi.org/10.17863/CAM.21358
  • Yannakoudakis, H., Briscoe, T. and Medlock, B., 2011. A new dataset and method for automatically grading ESOL texts Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1,
  • 2010

  • Rei, M. and Briscoe, T., 2010. Combining Manual Rules and Supervised Learning for Hedge Cue and Scope Detection CoNLL 2010 - 14th Conference on Computational Natural Language Learning: Shared Task, Proceedings,
  • Vlachos, A., Ghahramani, Z. and Briscoe, T., 2010. Active learning for constrained Dirichlet process mixture models Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics,
  • Rei, M. and Briscoe, T., 2010. Combining manual rules and supervised learning for hedge cue and scope detection Proceedings of the Fourteenth Conference on Computational Natural Language Learning: Shared Task,
  • Briscoe, T., Harrison, K., Naish-Guzman, A., Parker, A., Siddharthan, A., Sinclair, D., Slater, M. and Watson, R., 2010. Camtology: intelligent information access for science Proceedings of the NAACL HLT 2010 Demonstration Session,
  • 2009

  • Briscoe, T., 2009. What Can Formal or Computational Models Tell Us about How (Much) Language Shaped the Brain? BIOLOGICAL FOUNDATIONS AND ORIGIN OF SYNTAX,
  • Vlachos, A., Buttery, P., Séaghdha, DO. and Briscoe, T., 2009. Biomedical event extraction without training data Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task,
  • Jaeger, H., Steels, L., Baronchelli, A., Briscoe, T., Christiansen, MH., Ths, TG., Jager, G., Kirby, S., Komarova, NL., Richerson, PJ. and Triesch, J., 2009. What Can Mathematical, Computational, and Robotic Models Tell Us about the Origins of Syntax? BIOLOGICAL FOUNDATIONS AND ORIGIN OF SYNTAX,
  • 2008

  • Andersen, ØE., Nioche, J., Briscoe, T. and Carroll, JA., 2008. The BNC Parsed with RASP4UIMA. LREC,
  • Briscoe, T., Gasperin, C., Lewin, I. and Vlachos, A., 2008. Bootstrapping an interactive information extraction system for FlyBase curation. Ontologies and Text Mining for Life Sciences, v. 08131
  • 2007

  • Medlock, B. and Briscoe, T., 2007. Weakly Supervised Learning for Hedge Classification in Scientific Literature. ACL,
  • Preiss, J., Briscoe, T. and Korhonen, A., 2007. A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. ACL,
  • 2006

  • Briscoe, T. and Carroll, JA., 2006. Evaluating the Accuracy of an Unlexicalized Statistical Parser on the PARC DepBank. ACL,
  • Briscoe, T., Carroll, JA. and Watson, R., 2006. The Second Release of the RASP System. ACL,
  • 2005

  • Yallop, J., Korhonen, A. and Briscoe, T., 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. ACL,
  • 2004

  • Carroll, J. and Briscoe, T., 2004. High precision extraction of grammatical relations NEW DEVELOPMENTS IN PARSING TECHNOLOGY,
  • Preiss, J., Gasperin, C. and Briscoe, T., 2004. Can Anaphoric Definite Descriptions be Replaced by Pronouns? LREC,
  • 2002

  • Preiss, J., Korhonen, A. and Briscoe, T., 2002. Subcategorization Acquisition as an Evaluation Method for WSD. LREC,
  • Briscoe, T. and Carroll, JA., 2002. Robust Accurate Statistical Annotation of General Text. LREC,
  • 2001

  • Carroll, JA. and Briscoe, T., 2001. High Precision Extraction of Grammatical Relations. IWPT,
  • 1997

  • Briscoe, T., 1997. Co-evolution of language and of the language acquisition device 35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE,
  • Briscoe, T. and Carroll, JA., 1997. Automatic Extraction of Subcategorization from Corpora. ANLP,
  • 1992

  • COPESTAKE, A. and BRISCOE, T., 1992. LEXICAL OPERATIONS IN A UNIFICATION-BASED FRAMEWORK LEXICAL SEMANTICS AND KNOWLEDGE REPRESENTATION, v. 627
  • 1991

  • McQueen, JM. and Briscoe, EJ., 1991. A computational tool for examining lexical segmentation in continuous speech. EUROSPEECH,
  • 1990

  • Briscoe, T., Copestake, AA. and Boguraev, B., 1990. Enjoy the Paper: Lexicology. COLING,
  • BRISCOE, T., 1990. LEXICAL ACCESS IN CONNECTED SPEECH RECOGNITION 27TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS,
  • 1989

  • TAYLOR, L., GROVER, C. and BRISCOE, T., 1989. THE SYNTACTIC REGULARITY OF ENGLISH NOUN PHRASES FOURTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS,
  • 1988

  • Boguraev, B., Carroll, JA., Briscoe, T. and Grover, C., 1988. Software support for practical grammar development. COLING,
  • 1987

  • Boguraev, B., Carter, DM. and Briscoe, T., 1987. A Multi-Purpose Interface to an On-line Dictionary. EACL,
  • Briscoe, T., 1987. Deterministic Parsing And Unbounded Dependencies. EACL,
  • Briscoe, T., Grover, C., Boguraev, B. and Carroll, JA., 1987. A Formalism and Environment for the Development of a Large Grammar of English. IJCAI,
  • Carter, DM., Boguraev, B. and Briscoe, T., 1987. Lexical stress and phonetic information: which segments are most informative? ECST,
  • Boguraev, B., Briscoe, T., Carroll, JA., Carter, DM. and Grover, C., 1987. The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English. ACL,
  • 1985

  • Alshawi, H., Boguraev, B. and Briscoe, T., 1985. Towards A Dictionary Support Environment For Realtime Parsing. EACL,
  • 1984

  • Briscoe, EJ. and Boguraev, B., 1984. Control Structures And Theories Of Interaction In Speech Understanding Systems. COLING,
  • Journal articles

    2020

  • Farag, Y., Valvoda, J., Yannakoudakis, H. and Briscoe, T., 2020. Analyzing Neural Discourse Coherence Models CODI workshop in EMNLP2020,
  • 2019

  • Xia, M., Kochmar, E. and Briscoe, T., 2019. Text Readability Assessment for Second Language Learners. CoRR, v. abs/1906.07580
  • Xia, M., Kochmar, E. and Briscoe, T., 2019. Automatic learner summary assessment for reading comprehension NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, v. 1
    Doi: 10.18653/v1/n19-1261
  • 2018

  • Yannakoudakis, H., Andersen, ØE., Geranpayeh, A., Briscoe, T. and Nicholls, D., 2018. Developing an automated writing placement system for ESL learners Applied Measurement in Education, v. 31
    Doi: http://doi.org/10.1080/08957347.2018.1464447
  • 2017

  • Farag, Y., Rei, M. and Briscoe, T., 2017. An Error-Oriented Approach to Word Embedding Pre-Training. CoRR, v. abs/1707.06841
  • Rei, M., Felice, M., Yuan, Z. and Briscoe, T., 2017. Artificial Error Generation with Machine Translation and Syntactic Patterns. CoRR, v. abs/1707.05236
  • 2016

  • Felice, M., Bryant, C. and Briscoe, T., 2016. Automatic extraction of learner errors in ESL sentences using linguistically enhanced alignments COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • 2011

  • Briscoe, T., Harrison, K., Naish, A., Parker, A., Rei, M., Siddharthan, A., Sinclair, D., Slater, M. and Watson, R., 2011. Intelligent Information Access from Scientific Papers Current Challenges in Patent Information Retrieval,
  • Rei, M. and Briscoe, T., 2011. Unsupervised entailment detection between dependency graph fragments ACL HLT 2011,
  • 2010

  • Briscoe, T., Medlock, B. and Andersen, Ø., 2010. Automated assessment of ESOL free text examinations CUCL TR,
  • 2009

  • Briscoe, T., 2009. Herbert Jaeger, Luc Steels, Andrea Baronchelli, Ted Briscoe, Morten H. Christiansen, Thomas Griffiths, Gerhard Jäger, Simon Kirby, Natalia L. Komarova, Peter J. Richerson, and Jochen Triesch Biological foundations and origin of syntax,
  • 2008

  • Briscoe, T., 2008. Language learning, power laws, and sexual selection Mind and Society, v. 7
    Doi: http://doi.org/10.1007/s11299-007-0040-8
  • Gasperin, C. and Briscoe, T., 2008. Statistical anaphora resolution in biomedical texts Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference, v. 1
    Doi: http://doi.org/10.3115/1599081.1599114
  • Karamanis, N., Seal, R., Lewin, I., McQuilton, P., Vlachos, A., Gasperin, C., Drysdale, R. and Briscoe, T., 2008. Natural language processing in aid of FlyBase curators. BMC Bioinformatics, v. 9
    Doi: http://doi.org/10.1186/1471-2105-9-193
  • 2007

  • Karamanis, N., Lewin, I., Seal, R., Drysdale, R. and Briscoe, E., 2007. Integrating natural language processing with FlyBase curation. Pac Symp Biocomput,
  • Watson, R. and Briscoe, T., 2007. Adapting the RASP system for the CoNLL07 domain-adaptation task EMNLP-CoNLL 2007 - Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning,
  • 2006

  • Vlachos, A., Gasperin, C., Lewin, I. and Briscoe, T., 2006. Bootstrapping the recognition and anaphoric linking of named entities in Drosophila articles. Pac Symp Biocomput,
  • 2004

  • Buttery, P. and Briscoe, T., 2004. The significance of errors to parametric models of language acquisition AAAI Spring Symposium - Technical Report, v. 5
  • 2001

  • Briscoe, T., 2001. The symbolic species: The co-evolution of language and the human brain NOTES REC ROY SOC, v. 55
  • 2000

  • Briscoe, T., 2000. Grammatical acquisition: Inductive bias and coevolution of language and the language acquisition device LANGUAGE, v. 76
  • 1999

  • Briscoe, EJ., 1999. The Acquisition of Grammar in an Evolving Population of Language Agents. Electron. Trans. Artif. Intell., v. 3
  • Carroll, J., Minnen, G. and Briscoe, T., 1999. Corpus Annotation for Parser Evaluation Proceedings of the EACL99 workshop on Linguistically Interpreted Corpora (LINC), Bergen, Norway, June 12,
  • Briscoe, T. and Copestake, A., 1999. Lexical rules in constraint-based grammars COMPUT LINGUIST, v. 25
  • 1998

  • Carroll, J., Minnen, G. and Briscoe, T., 1998. Can Subcategorisation Probabilities Help a Statistical Parser? 6th Workshop on Very Large Corpora, Montreal, Canada, 1998,
  • 1997

  • Briscoe, T., 1997. Co-evolution of Language and of the Language Acquisition Device CoRR, v. cmp-lg/9705001
  • Briscoe, T. and Carroll, J., 1997. Automatic Extraction of Subcategorization from Corpora
  • 1996

  • Carroll, J. and Briscoe, T., 1996. Apportioning Development Effort in a Probabilistic LR Parsing System through Evaluation Conference on Empirical Methods in Natural Language Processing (EMNLP-96), 92-100,
  • LASCARIDES, A., BRISCOE, T., ASHER, N. and COPESTAKE, A., 1996. ORDER INDEPENDENT AND PERSISTENT TYPED DEFAULT UNIFICATION LINGUIST PHILOS, v. 19
  • 1995

  • MCQUEEN, JM., CUTLER, A., BRISCOE, T. and NORRIS, D., 1995. MODELS OF CONTINUOUS SPEECH RECOGNITION AND THE CONTENTS OF THE VOCABULARY LANG COGNITIVE PROC, v. 10
  • Copestake, A. and Briscoe, T., 1995. Semi-productive polysemy and sense extension Journal of Semantics, v. 12
    Doi: http://doi.org/10.1093/jos/12.1.15
  • Briscoe, T. and Carroll, J., 1995. Developing and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels 4th International Workshop on Parsing Technologies (IWPT-95), 48-58,
  • 1994

  • Gopestake, A., Briscoe, T., Vossen, P., Ageno, A., Castellon, I., Ribas, F., Rigau, G., Rodríguez, H. and Samiotou, A., 1994. Acquisition of lexical translation relations from MRDS Machine Translation, v. 9
    Doi: http://doi.org/10.1007/BF00980578
  • 1993

  • Briscoe, T. and Carroll, J., 1993. Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars. Comput. Linguistics, v. 19
  • 1992

  • COPESTAKE, A. and BRISCOE, T., 1992. LEXICAL OPERATIONS IN A UNIFICATION-BASED FRAMEWORK LECT NOTES ARTIF INT, v. 627
  • 1984

  • BRISCOE, EJ., 1984. PARSING NATURAL-LANGUAGE - KING,M J LINGUIST, v. 20
  • Other publications

    2018

  • Farag, Y., Yannakoudakis, H. and Briscoe, T., 2018. Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input. CoRR, v. abs/1804.06898
  • Book chapters

    2010

  • Watson, R., Briscoe, T. and Carroll, J., 2010. Semi-supervised Training of a Statistical Parser from Unlabeled Partially-Bracketed Data
    Doi: http://doi.org/10.1007/978-90-481-9352-3_16
  • Briscoe, T., 2010. Grammatical Assimilation
    Doi: http://doi.org/10.1093/acprof:oso/9780199244843.003.0016

  • Read more at: Professor Anna Korhonen

    Professor Anna Korhonen

    Computational approaches to lexicon, syntax, semantics and discourse;
    scientific text processing and text mining;
    NLP for biomedicine;
    NLP for real-world applications;
    computational models of human language learning

    Conference proceedings

    2024

  • Parović, M., Vulić, I. and Korhonen, A., 2024. Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, v. 2
  • Hu, S., Vulić, I., Liu, F. and Korhonen, A., 2024. Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings,
  • Petti, U. and Korhonen, A., 2024. LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings,
  • Hu, S., Wang, X., Yuan, Z., Korhonen, A. and Vulić, I., 2024. DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024, v. 3
  • Qiu, Y., Zhao, Z., Ziser, Y., Korhonen, A., Ponti, EM. and Cohen, SB., 2024. Are Large Language Models Temporally Grounded? Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024, v. 1
  • Razumovskaia, E., Glavaš, G., Korhonen, A. and Vulić, I., 2024. SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024, v. 1
  • Li, Y., Korhonen, A. and Vulić, I., 2024. Self-Augmented In-Context Learning for Unsupervised Word Translation Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 2
  • Fytas, P., Breger, A., Selby, I., Baker, S., Shahipasand, S. and Korhonen, A., 2024. Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology. BioNLP 2024 - 23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, Proceedings of the Workshop and Shared Tasks,
  • Li, Y., Zhai, X., Alzantot, M., Yu, K., Vulić, I., Korhonen, A. and Hammad, M., 2024. CALRec: Contrastive Alignment of Generative LLMs for Sequential Recommendation RecSys 2024 - Proceedings of the 18th ACM Conference on Recommender Systems,
    Doi: http://doi.org/10.1145/3640457.3688121
  • 2023

  • Vulić, I., Glavaš, G., Liu, F., Collier, N., Ponti, EM. and Korhonen, A., 2023. Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference,
  • Yuan, Z., Hu, S., Vulić, I., Korhonen, A. and Meng, Z., 2023. Can Pretrained Language Models (Yet) Reason Deductively? EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference,
  • Liu, CC., Pfeiffer, J., Korhonen, A., Vulić, I. and Gurevych, I., 2023. Delving Deeper into Cross-lingual Visual Question Answering EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023,
  • Petti, U., Nyrup, R., Skopek, JM. and Korhonen, A., 2023. Ethical considerations in the early detection of Alzheimer's disease using speech and AI ACM International Conference Proceeding Series,
    Doi: http://doi.org/10.1145/3593013.3594063
  • Parović, M., Ansell, A., Vulić, I. and Korhonen, A., 2023. Cross-Lingual Transfer with Target Language-Ready Task Adapters Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Ansell, A., Ponti, EM., Korhonen, A. and Vulić, I., 2023. Distilling Efficient Language-Specific Models for Cross-Lingual Transfer Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Moghe, N., Razumovskaia, E., Guillou, L., Vulić, I., Korhonen, A. and Birch, A., 2023. MULTI<sup>3</sup>NLU<sup>++</sup>: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Li, Y., Chang, CY., Rawls, S., Vulić, I. and Korhonen, A., 2023. Translation-Enhanced Multilingual Text-to-Image Generation Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 1
  • Köksal, A., Yalcin, OF., Akbiyik, A., Kılavuz, MT., Korhonen, A. and Schütze, H., 2023. Language-Agnostic Bias Detection in Language Models with Bias Probing Findings of the Association for Computational Linguistics: EMNLP 2023,
  • Kantharuban, A., Vulić, I. and Korhonen, A., 2023. Quantifying the Dialect Gap and its Correlates Across Languages Findings of the Association for Computational Linguistics: EMNLP 2023,
    Doi: 10.18653/v1/2023.findings-emnlp.481
  • Zhou, H., Wan, X., Vulić, I. and Korhonen, A., 2023. Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning Findings of the Association for Computational Linguistics: EMNLP 2023,
  • Qiu, Y., Ziser, Y., Korhonen, A., Ponti, EM. and Cohen, SB., 2023. Detecting and Mitigating Hallucinations in Multilingual Summarisation EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Li, Y., Korhonen, A. and Vulić, I., 2023. On Bilingual Lexicon Induction with Large Language Models EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Razumovskaia, E., Vulić, I. and Korhonen, A., 2023. Transfer-Free Data-Efficient Multilingual Slot Labeling EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Ansell, A., Parović, M., Vulić, I., Korhonen, A. and Ponti, EM., 2023. Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Hu, S., Zhou, H., Yuan, Z., Gritta, M., Zhang, G., Iacobacci, I., Korhonen, A. and Vulić, I., 2023. A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • 2022

  • Li, Y., Liu, F., Vulić, I. and Korhonen, A., 2022. Improving Bilingual Lexicon Induction with Cross-Encoder Reranking Findings of the Association for Computational Linguistics: EMNLP 2022,
  • Liu, Q., McCarthy, D. and Korhonen, A., 2022. Measuring Context-Word Biases in Lexical Semantic Datasets Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022,
  • Parović, M., Glavaš, G., Vulić, I. and Korhonen, A., 2022. BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference,
  • Li, Y., Liu, F., Collier, N., Korhonen, A. and Vulic, I., 2022. Improving Word Translation via Two-Stage Contrastive Learning Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 1
  • Ansell, A., Ponti, EM., Korhonen, A. and Vulic, I., 2022. Composable Sparse Fine-Tuning for Cross-Lingual Transfer Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 1
  • Razumovskaia, E., Vulić, I. and Korhonen, A., 2022. Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • 2021

  • Majewska, O., Vulic, I., Glavaš, G., Ponti, EM. and Korhonen, A., 2021. Verb knowledge injection for multilingual event processing ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Vulic, I., Ponti, EM., Korhonen, A. and Glavaš, G., 2021. LEXFIT: Lexical fine-tuning of pretrained language models ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Zhao, M., Zhu, Y., Shareghi, E., Vulic, I., Reichart, R., Korhonen, A. and Schütze, H., 2021. A closer look at few-shot crosslingual transfer: The choice of shots matters ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Liu, F., Vulić, I., Korhonen, A. and Collier, N., 2021. Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference, v. 2
  • Hangya, V., Liu, Q., Stojanovski, D., Fraser, A. and Korhonen, A., 2021. Improving Machine Translation of Rare and Unseen Word Senses WMT 2021 - 6th Conference on Machine Translation, Proceedings,
  • Liu, F., Vulić, I., Korhonen, A. and Collier, N., 2021. Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Liu, Q., Ponti, EM., McCarthy, D., Vulić, I. and Korhonen, A., 2021. AM<sup>2</sup>ICO: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings,
  • Liu, Q., Liu, F., Collier, N., Korhonen, A. and Vulić, I., 2021. MIRRORWIC: On Eliciting Word-in-Context Representations from Pretrained Language Models CoNLL 2021 - 25th Conference on Computational Natural Language Learning, Proceedings,
  • Ansell, A., Ponti, EM., Pfeiffer, J., Ruder, S., Glavaš, G., Vulic, I. and Korhonen, A., 2021. MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021,
  • Zhu, Y., Shareghi, E., Li, Y., Reichart, R. and Korhonen, A., 2021. Combining deep generative models and multi-lingual pretraining for semi-supervised document classification EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference,
  • 2020 (Accepted for publication)

  • Vulic, I., Ponti, E., Litschko, R., Glavas, G. and Korhonen, A., 2020 (Accepted for publication). Probing Pretrained Language Models for Lexical Semantics Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020),
    Doi: http://doi.org/10.18653/v1/2020.emnlp-main.586
  • 2020

  • Dubossarsky, H., Vulic, I., Reichart, R. and Korhonen, A., 2020. The Secret is in the Spectra: Predicting Cross-Lingual Task Performance with Spectral Similarity Measures Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020),
    Doi: http://doi.org/10.18653/v1/2020.emnlp-main.186
  • Gerz, D., Vulić, I., Rei, M., Reichart, R. and Korhonen, A., 2020. Multidirectional associative optimization of function-specific word representations Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Ponti, E., Glavaš, G., Majewska, O., Liu, Q., Vulic, I. and Korhonen, A., 2020. XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020),
    Doi: http://doi.org/10.18653/v1/2020.emnlp-main.185
  • Sasano, R. and Korhonen, A., 2020. Investigating word-class distributions in word vector spaces Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2020.acl-main.337
  • Vulić, I., Korhonen, A. and Glavaš, G., 2020. Improving bilingual lexicon induction with unsupervised post-processing of monolingual word vector spaces Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/2020.repl4nlp-1.7
  • Karan, M., Vulić, I., Korhonen, A. and Glavaš, G., 2020. Classification-based self-learning for weakly supervised bilingual lexicon induction Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Majewska, O., Vulic, I., McCarthy, D. and Korhonen, A., 2020. Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020),
  • Li, Y., Ponti, E., Vulic, I. and Korhonen, A., 2020. Emergent Communication Pretraining for Few-Shot Machine Translation Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020),
  • Liu, Q., McCarthy, D. and Korhonen, A., 2020. Towards better context-aware lexical semantics: Adjusting contextualized representations through static anchors EMNLP 2020 - 2020 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • Lauscher, A., Vulic, I., Ponti, E., Korhonen, A. and Glavas, G., 2020. Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020),
  • Glavas, G., Vulic, I., Korhonen, A. and Ponzetto, SP., 2020. SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020),
  • Korhonen, A. and Traum, D., 2020. Message from the program chairs ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference,
  • Korhonen, A. and Traum, D., 2020. Message from the program chairs ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference,
  • Majewska, O., McCarthy, D., van den Bosch, J., Kriegeskorte, N., Vulic, I. and Korhonen, A., 2020. Spatial multi-arrangement for clustering and multi-way similarity dataset construction LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings,
  • 2019 (Accepted for publication)

  • Shareghi, E., Gerz, D., Vulic, I. and Korhonen, A., 2019 (Accepted for publication). Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines
    Doi: http://doi.org/10.17863/CAM.39778
  • 2019

  • Liu, Q., McCarthy, D., Vulić, I. and Korhonen, A., 2019. Investigating cross-lingual alignment methods for contextualized embeddings with Token-level evaluation CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference,
  • Ponti, EM., Vulić, I., Cotterell, R., Reichart, R. and Korhonen, A., 2019. Towards zero-shot language modeling EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Ponti, EM., Vulić, I., Glavaš, G., Reichart, R. and Korhonen, A., 2019. Cross-lingual semantic specialization via lexical relation induction EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Vulić, I., Glavaš, G., Reichart, R. and Korhonen, A., 2019. Do we really need fully unsupervised cross-lingual embeddings? EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Zhu, Y., Heinzerling, B., Vulić, I., Strube, M., Reichart, R. and Korhonen, A., 2019. On the importance of subword information for morphological tasks in truly low-resource languages CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference,
  • Tseng, BH., Rei, M., Budzianowski, P., Turner, RE., Byrne, B. and Korhonen, A., 2019. Semi-supervised bootstrapping of dialogue state trackers for task-oriented modelling EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference,
  • Liu, Q., McCarthy, D. and Korhonen, A., 2019. Second-order contexts from lexical substitutes for few-shot learning of word representations *SEM@NAACL-HLT 2019 - 8th Joint Conference on Lexical and Computational Semantics,
  • Shareghi, E., Li, Y., Zhu, Y., Reichart, R. and Korhonen, A., 2019. Bayesian learning for neural dependency parsing NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, v. 1
  • Chiu, B., Baker, S., Palmer, M. and Korhonen, A., 2019. Enhancing biomedical word embeddings by retrofitting to verb clusters BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task,
  • Zhu, Y., Vulić, I. and Korhonen, A., 2019. A systematic study of leveraging subword information for learning word representations NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, v. 1
  • Shareghi, E., Gerz, D., Vulić, I. and Korhonen, A., 2019. Show some love to your n-grams: A bit of progress and stronger n-gram language modeling baselines NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, v. 1
  • Majewska, O., McCarthy, D., Vulić, I. and Korhonen, A., 2019. Acquiring verb classes through bottom-up semantic verb clustering LREC 2018 - 11th International Conference on Language Resources and Evaluation,
  • 2018

  • Gerz, D., Vulić, I., Ponti, EM., Reichart, R. and Korhonen, A., 2018. On the relation between linguistic typology and (limitations of) multilingual language modeling Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018,
  • Vulic, I., Glavaš, G., Mrkšić, N. and Korhonen, A., 2018. Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources
  • Ponti, E., Reichart, R., Korhonen, A. and Vulic, I., 2018. Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018),
    Doi: http://doi.org/10.18653/v1/P18-1142
  • Vulić, I. and Korhonen, A., 2018. Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation Proceedings of the Annual Meeting of the Association for Computational Linguistics,
    Doi: 10.18653/v1/w18-3018
  • Ponti, EM., Vulić, I., Glavaš, G., Mrkšić, N. and Korhonen, A., 2018. Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018,
  • 2017 (No publication date)

  • Baker, S., Korhonen, A. and Pyysalo, S., 2017 (No publication date). Cancer Hallmark Text Classification Using Convolutional Neural Networks
    Doi: http://doi.org/10.17863/CAM.12420
  • 2017

  • Vulić, I., Mrkšić, N. and Korhonen, A., 2017. Cross-lingual induction and transfer of verb classes based on word vector space specialisation EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings,
    Doi: http://doi.org/10.18653/v1/d17-1270
  • Vulić, I., Schwartz, R., Rappoport, A., Reichart, R. and Korhonen, A., 2017. Automatic Selection of Context Configurations for Improved Class-Specific Word Representations Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017),
    Doi: http://doi.org/10.18653/v1/K17-1013
  • Vulić, I., Kiela, D. and Korhonen, A., 2017. Evaluation by association: A systematic study of quantitative word association evaluation 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, v. 1
    Doi: http://doi.org/10.18653/v1/e17-1016
  • Ponti, EM. and Korhonen, A., 2017. Event-Related Features in Feedforward Neural Networks Contribute to Identifying Causal Relations in Discourse LSDSem 2017 - 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-Level Semantics, Proceedings of the Workshop,
    Doi: 10.18653/v1/w17-0903
  • Vulic, I., Mrkšic, N., Reichart, R., Séaghdha, D., Young, S. and Korhonen, A., 2017. Morph-fitting: Fine-tuning word vector spaces with simple language-specific rules ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), v. 1
    Doi: http://doi.org/10.18653/v1/P17-1006
  • Ponti, EM., Vulić, I. and Korhonen, A., 2017. Decoding sentiment from distributed representations of sentences *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings,
    Doi: http://doi.org/10.18653/v1/s17-1003
  • Baker, S. and Korhonen, A., 2017. Initializing neural networks for hierarchical multi-label text classification BioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop,
    Doi: 10.18653/v1/w17-2339
  • 2016

  • Vulić, I. and Korhonen, A., 2016. Is "universal syntax" universally useful for learning distributed word representations? 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Short Papers,
    Doi: http://doi.org/10.18653/v1/p16-2084
  • Vulíc, I. and Korhonen, A., 2016. On the role of seed lexicons in learning bilingual word embeddings 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers, v. 1
    Doi: http://doi.org/10.18653/v1/p16-1024
  • Chiu, B., Korhonen, A. and Pyysalo, S., 2016. Intrinsic evaluation ofword vectors fails to predict extrinsic performance Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • Chiu, B., Crichton, G., Korhonen, A. and Pyysalo, S., 2016. How to train goodword embeddings for biomedical nlp BioNLP 2016 - Proceedings of the 15th Workshop on Biomedical Natural Language Processing,
  • 2016. Learning distributed representations of sentences from unlabelled data 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
  • Hill, F., Cho, K. and Korhonen, A., 2016. Learning distributed representations of sentences from unlabelled data 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference,
    Doi: http://doi.org/10.18653/v1/n16-1162
  • O'Horan, H., Berzak, Y., Vulić, I., Reichart, R. and Korhonen, A., 2016. Survey on the use of typological information in natural language processing COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • Baker, S., Kiela, D. and Korhonen, A., 2016. Robust text classification for sparsely labelled data using multi-level embeddings COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers,
  • Gerz, D., Vulic, I., Hill, F., Reichart, R. and Korhonen, A., 2016. Simverb-3500: A large-scale evaluation set of verb similarity EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings,
    Doi: 10.18653/v1/d16-1235
  • Berzak, Y., Huang, Y., Barbu, A., Korhonen, A. and Katz, B., 2016. Anchoring and agreement in syntactic annotations EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings,
    Doi: http://doi.org/10.18653/v1/d16-1239
  • 2015

  • Geertzen, J., Alexopoulou, T., Post, B. and Korhonen, A., 2015. Native language effects on pronunciation accuracy L2 English International Symposium on Monolingual and Bilingual Speech (ISMBS),
  • Alexopoulou, T., Geertzen, J., Meurers, D. and Korhonen, A., 2015. Relativisors and animacy in L2 English Second Language Research Forum (SLRF) 2015,
  • Karlgren, J., Callin, J., Collins-Thompson, K., Gyllensten, AC., Ekgren, A., Jurgens, D., Korhonen, A., Olsson, F., Sahlgren, M. and Schütze, H., 2015. Evaluating learning language representations Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 9283
    Doi: http://doi.org/10.1007/978-3-319-24027-5_25
  • Korhonen, A., Guo, Y., Baker, S., Yetisgen-Yildiz, M., Stenius, U., Narita, M. and Liò, P., 2015. Improving literature-based discovery with advanced text mining Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 8623
    Doi: http://doi.org/10.1007/978-3-319-24462-4_8
  • 2014

  • Hill, F. and Korhonen, A., 2014. Concreteness and subjectivity as dimensions of lexical meaning 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, v. 2
    Doi: http://doi.org/10.3115/v1/p14-2118
  • Scarton, C., Sun, L., Kipper-Schuler, K., Duran, MS., Palmer, M. and Korhonen, A., 2014. Verb clustering for Brazilian Portuguese Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 8403 LNCS
    Doi: http://doi.org/10.1007/978-3-642-54906-9-3
  • Silins, I., Korhonen, A., Guo, Y. and Stenius, U., 2014. A text-mining approach for chemical risk assessment and cancer research TOXICOLOGY LETTERS, v. 229
    Doi: http://doi.org/10.1016/j.toxlet.2014.06.565
  • Larsson, K., Silins, I., Guo, Y., Korhonen, A., Stenius, U. and Berglund, M., 2014. Text mining for improved human exposure assessment TOXICOLOGY LETTERS, v. 229
    Doi: http://doi.org/10.1016/j.toxlet.2014.06.427
  • Guo, Y., Séaghdha, D., Silins, I., Sun, L., Högberg, J., Stenius, U. and Korhonen, A., 2014. CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of the Conference System Demonstrations,
  • Jiang, X., Guo, Y., Geertzen, J., Alexopoulou, D., Sun, L. and Korhonen, A., 2014. Native Language Identification using large, longitudinal data Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014,
  • Scarton, C., Sun, L., Kipper-Schuler, K., Duran, MS., Palmer, M. and Korhonen, A., 2014. Verb clustering for Brazilian Portuguese Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 8403 LNCS
    Doi: http://doi.org/10.1007/978-3-642-54906-9_3
  • Hill, F. and Korhonen, A., 2014. Learning abstract concept embeddings from multi-modal data: Since you probably can't see what I mean EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
    Doi: http://doi.org/10.3115/v1/d14-1032
  • Baker, S., Reichart, R. and Korhonen, A., 2014. An unsupervised model for instance level subcategorization acquisition EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • 2013 (No publication date)

  • Geertzen, J., Alexopoulou, T. and Korhonen, A., 2013 (No publication date). Automatic linguistic annotation of large scale L2 databases: the EF-Cambridge Open Language Database Selected papers from the Second Language Research Forum,
  • Korhonen, A., Guo, Y. and Reichart, R., 2013 (No publication date). Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints
  • Korhonen, A. and O'Seaghdha, D., 2013 (No publication date). Probabilistic models of similarity in syntactic context EMNLP 2011,
  • 2013

  • Baldwin, T. and Korhonen, A., 2013. Preface EMNLP 2013 - 2013 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • Sun, L., McCarthy, D. and Korhonen, A., 2013. Diathesis alternation approximation for verb clustering ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, v. 2
  • Guo, Y., Reichart, R. and Korhonen, A., 2013. Improved Information Structure Analysis of Scientific Documents through Discourse and Lexical Constraints Proceedings of the 2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013,
  • van de Cruys, T., Poibeau, T. and Korhonen, A., 2013. A Tensor-based Factorization Model of Semantic Compositionality Proceedings of the 2nd Workshop on Computational Linguistics for Literature, CLfL 2013 at the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2013,
  • Hill, F., Kiela, D. and Korhonen, A., 2013. Concreteness and Corpora: A Theoretical and Practical Analysis CMCL 2013 - Cognitive Modeling and Computational Linguistics, Proceedings of the Workshop,
  • Hill, F., Korhonen, A. and Bentz, C., 2013. Large-scale empirical analyses of concreteness Proceedings of the Annual Meeting of the Cognitive Science Society,
  • Kelly, C., Korhonen, A. and Devereux, B., 2013. Minimally Supervised Learning for Unconstrained Conceptual Property Extraction Cooperative Minds: Social Interaction and Group Dynamics - Proceedings of the 35th Annual Meeting of the Cognitive Science Society, CogSci 2013,
  • Hill, F., Korhonen, A. and Bentz, C., 2013. Large-Scale Empirical Analyses of the Abstract/Concrete Distinction Cooperative Minds: Social Interaction and Group Dynamics - Proceedings of the 35th Annual Meeting of the Cognitive Science Society, CogSci 2013,
  • Korhonen, A. and Reichart, R., 2013. Improved Lexical Acquisition through DPP-based Verb Clustering Association for Computational Linguistics,
  • Korhonen, A., Van de Cruys, T. and Poibeau, T., 2013. A Tensor-based Factorization Model of Semantic Compositionality http://aclweb.org/anthology/N/N13/N13-1.pdf,
  • 2012 (No publication date)

  • Rimell, L., Poibeau, T. and Korhonen, A., 2012 (No publication date). Merging Lexicons for Higher Precision Subcategorization Frame Acquisition Proceedings of the LREC 2012 Workshop on Language Resource Merging,
  • 2012

  • Kadekar, S., Silins, I., Korhonen, A., Dreij, K., Al-Anati, L., Hogberg, J. and Stenius, U., 2012. Exocrine pancreatic tumorigenesis and autotaxin expression TOXICOLOGY LETTERS, v. 211
    Doi: http://doi.org/10.1016/j.toxlet.2012.03.216
  • Silins, I., Korhonen, A., Sun, L., Hogberg, J. and Stenius, U., 2012. A text mining approach for chemical cancer research and risk assessment TOXICOLOGY LETTERS, v. 211
    Doi: http://doi.org/10.1016/j.toxlet.2012.03.458
  • Korhonen, A. and Reichart, R., 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents Proceedings of the 24th International Conference on Computational Linguistics (COLING),
  • Shutova, E., van de Cruys, T. and Korhonen, A., 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model Proceedings of the 24th International Conference on Computational Linguistics (COLING),
  • Guo, Y., Silins, I., Korhonen, A. and Reichart, R., 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature Proceedings of the 24th International Conference on Computational Linguistics (COLING),
  • Séaghdha, D. and Korhonen, A., 2012. Modelling selectional preferences in a lexical hierarchy *SEM 2012 - 1st Joint Conference on Lexical and Computational Semantics, v. 1
  • Kelly, C., Devereux, B. and Korhonen, A., 2012. Semi-supervised learning for automatic conceptual property extraction Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 2012-June
  • Abend, O., Biemann, C., Korhonen, A., Rappoport, A., Sogaard, A. and Reichart, R., 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
  • Berwick, R., Korhonen, A., Villavicencio, A. and Poibeau, T., 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss
  • Alexopoulou, T., Geertzen, J., Meurers, D. and Korhonen, A., 2012. L1 effects in L2 English relative clauses: evidence from corpus production Abstracts of the 22nd Annual Conference of the European Second Language Association (EUROSLA-22),
  • 2011

  • Abend, O., Korhonen, A., Rappoport, A. and Reichart, R., 2011. Introduction Workshop on Unsupervised Learning in NLP at the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011 - Proceedings,
  • Abend, O., Korhonen, A., Reichart, R. and Rappoport, A., 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP
  • Devereux, B., Tyler, L. and Korhonen, A., 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs International Conference on Cognitive Neuroscience (ICON),
  • Zhuang, J., Devereux, B., Tyler, L. and Korhonen, A., 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics International Conference on Cognitive Neuroscience (ICON),
  • 2010

  • Kadekar, S., Silins, I., Korhonen, A., Hogberg, J., Dreij, K. and Stenius, U., 2010. Carcinogen-induced inflammation and pancreatic cancer Proceedings of the 101th Annual Meeting of the American Association for Cancer Research,
  • Kelly, C., Korhonen, A. and Devereux, B., 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics,
  • Devereux, B., Korhonen, A. and Kelly, C., 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics,
  • Murphy, B., Korhonen, A. and Chang, K-MK., 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics
  • Devereux, B., Kelly, C., Pilkington, N., Korhonen, A. and Poibeau, T., 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives,
  • Moore, S., Buchholz, S. and Korhonen, A., 2010. Annotating the enron email corpus with number senses Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010,
  • Devereux, B., Pilkington, N., Poibeau, T. and Korhonen, A., 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora COGNITION IN FLUX,
  • Guo, Y., Silins, I., Korhonen, A., Sun, L., Liakata, M. and Stenius, U., 2010. Identifying the information structure of scientific abstracts: An investigation of three different schemes Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • 2009

  • Schuler, KK., Korhonen, A. and Brown, S., 2009. VerbNet overview, extensions, mappings and applications NAACL-HLT 2009 - Human Language Technologies: 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Tutorial Abstracts,
  • Vlachos, A., Korhonen, A. and Ghahramani, Z., 2009. Unsupervised and constrained dirichlet process mixture models for verb clustering.
  • Moore, S., Buchholz, S. and Korhonen, A., 2009. Number Sense Disambiguation Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics,
  • Sun, L., Korhonen, A., Silins, I. and Stenius, U., 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment Proceedings ...,
  • Kipper-Schuler, K., Korhonen, A. and Brown, S., 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications North American Chapter of the Association for Computational Linguistics - Human Language Technologies NAACL HLT,
  • 2008

  • Sun, L., Korhonen, A. and Krymolowski, Y., 2008. Verb class discovery from rich syntactic data COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, v. 4919
  • Vlachos, A., Ghahramani, Z. and Korhonen, A., 2008. Dirichlet process mixture models for verb clustering
  • Lewin, I., Silins, I., Korhonen, A., Hogberg, J. and Stenius, U., 2008. A New Challenge for Text Mining: Cancer Risk Assessment Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining,
  • Korhonen, A., Lewin, I., Silins, I., Hogberg, J. and Stenius, U., 2008. CRAB - Cancer Risk Assessment and Biomedical Text Mining Proceedings of the European Conference on Computational Biology,
  • Messiant, C., Korhonen, A. and Poibeau, T., 2008. LexSchem: A large subcategorization lexicon for French verbs Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008,
  • Sun, L., Korhonen, A. and Krymolowski, Y., 2008. Automatic classification of English verbs using rich syntactic features IJCNLP 2008 - 3rd International Joint Conference on Natural Language Processing, Proceedings of the Conference, v. 2
  • 2007

  • Buttery, P. and and Anna Korhonen, AV., 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition
  • Preiss, J., Briscoe, T. and Korhonen, A., 2007. A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. ACL,
  • Buttery, P. and Korhonen, A., 2007. I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition,
  • 2006

  • Kipper, K., Korhonen, A., Ryant, N. and Palmer, M., 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes Proceedings of EURALEX,
  • Korhonen, A., Krymolowski, Y. and Briscoe, T., 2006. A large subcategorization lexicon for natural language processing applications Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006,
  • Kipper, K., Korhonen, A., Ryant, N. and Palmer, M., 2006. Extending VerbNet with novel verb classes Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006,
  • Korhonen, A., Krymolowski, Y. and Collier, N., 2006. Automatic Classification of Verbs in Biomedical Texts COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE,
  • 2005

  • Yallop, J., Korhonen, A. and Briscoe, T., 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. ACL,
  • Baldwin, T., Villavicencio, A. and Korhonen, A., 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition
  • 2004

  • Preiss, J. and Korhonen, A., 2004. WSD for subcategorization acquisition task description Proceedings of the SENSEVAL@ACL 2004: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text - Held in cooperation with ACL 2004,
  • Tanaka, T., Villavicencio, A., Korhonen, A. and Bond, F., 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing
  • Briscoe, T. and Korhonen, A., 2004. Extended Lexical-Semantic Classification of English Verbs Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics,
  • 2003

  • Korhonen, A., Krymolowski, Y. and Marx, Z., 2003. Clustering polysemic subcategorization frame distributions semantically 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE,
  • Korhonen, A. and Preiss, J., 2003. Improving subcategorization acquisition using word sense disambiguation 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE,
  • 2002

  • Preiss, J., Korhonen, A. and Briscoe, T., 2002. Subcategorization Acquisition as an Evaluation Method for WSD. LREC,
  • Korhonen, A. and Krymolowski, Y., 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems Proceedings of the Annual Meeting of the Association for Computational Linguistics,
  • 2000

  • Korhonen, A., Gorrell, G. and McCarthy, D., 2000. Statistical filtering and subcategorization frame acquisition PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA,
  • Korhonen, A., 2000. Using semantically motivated estimates to help subcategorization acquisition PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA,
  • 1998

  • McCarthy, D. and Korhonen, A., 1998. Detecting verbal participation in diathesis alternations Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 2
  • Journal articles

    2024

  • Zhou, H., Wan, X., Vulić, I. and Korhonen, A., 2024. AUTOPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning Transactions of the Association for Computational Linguistics, v. 12
    Doi: http://doi.org/10.1162/tacl_a_00662
  • 2023 (Accepted for publication)

  • Breger, A., Selby, I., Roberts, M., Babar, J., Gkrania-Klotsas, E., Preller, J., Escudero Sanchez, L., Rudd, J., Aston, J., Weir-McCall, J., Sala, E. and Schoenlieb, C., 2023 (Accepted for publication). A pipeline to further enhance quality, integrity and reusability of the NCCID clinical data Scientific data,
    Doi: 10.1038/s41597-023-02340-7
  • Collins, C., Baker, S., Brown, J., Zeng, H., Chan, A., Stenius, U., Narita, M. and Korhonen, A., 2023 (Accepted for publication). Text Mining for Contexts and Relationships in Cancer Genomics Literature Bioinformatics,
    Doi: http://doi.org/10.1093/bioinformatics/btae021
  • 2023

  • Petti, U., Baker, S., Korhonen, A. and Robin, J., 2023. The Generalizability of Longitudinal Changes in Speech Before Alzheimer's Disease Diagnosis. J Alzheimers Dis, v. 92
    Doi: http://doi.org/10.3233/JAD-220847
  • Dittmer, S., Roberts, M., Gilbey, J., Biguri, A., Selby, I., Breger, A., Thorpe, M., Weir-McCall, JR., Gkrania-Klotsas, E., Korhonen, A., Jefferson, E., Langs, G., Yang, G., Prosch, H., Stanczuk, J., Tang, J., Babar, J., Escudero Sánchez, L., Teare, P., Patel, M., Wassin, M., Holzer, M., Walton, N., Lió, P., Shadbahr, T., Sala, E., Preller, J., Rudd, JHF., Aston, JAD. and Schönlieb, CB., 2023. Navigating the development challenges in creating complex data systems Nature Machine Intelligence, v. 5
    Doi: 10.1038/s42256-023-00665-x
  • Schellaert, W., Martínez-Plumed, F., Vold, K., Burden, J., Casares, PAM., Loe, BS., Reichart, R., Héigeartaigh, S., Korhonen, A. and Hernández-Orallo, J., 2023. Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models Journal of Artificial Intelligence Research, v. 77
    Doi: http://doi.org/10.1613/jair.1.14157
  • Hu, S., Zhou, H., Hergul, M., Gritta, M., Zhang, G., Iacobacci, I., Vulić, I. and Korhonen, A., 2023. MULTI<sup>3</sup>WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems Transactions of the Association for Computational Linguistics, v. 11
    Doi: http://doi.org/10.1162/tacl_a_00609
  • Petti, U., Baker, S., Korhonen, A. and Robin, J., 2023. How Much Speech Data Is Needed for Tracking Language Change in Alzheimer's Disease? A Comparison of Random Length, 5-Min, and 1-Min Spontaneous Speech Samples. Digit Biomark, v. 7
    Doi: http://doi.org/10.1159/000533423
  • Majewska, O., Razumovskaia, E., Ponti, EM., Vulić, I. and Korhonen, A., 2023. Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation Transactions of the Association for Computational Linguistics, v. 11
    Doi: http://doi.org/10.1162/tacl_a_00539
  • Majewska, O. and Korhonen, A., 2023. Verb Classification Across Languages Annual Review of Linguistics, v. 9
    Doi: http://doi.org/10.1146/annurev-linguistics-030521-043632
  • Petti, U., Baker, S., Korhonen, A. and Robin, J., 2023. The Generalizability of Longitudinal Changes in Speech Before Alzheimer's Disease Diagnosis. J Alzheimers Dis, v. 92
    Doi: http://doi.org/10.3233/JAD-220847
  • 2022

  • Majewska, O., Razumovskaia, E., Ponti, EM., Vulić, I. and Korhonen, A., 2022. Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
  • Razumovskaia, E., Glavaš, G., Majewska, O., Ponti, EM., Korhonen, A. and Vulic, I., 2022. Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems Journal of Artificial Intelligence Research, v. 74
    Doi: http://doi.org/10.1613/JAIR.1.13083
  • 2021

  • Ponti, EM., Vulić, I., Cotterell, R., Parović, M., Reichart, R. and Korhonen, A., 2021. Parameter space factorization for zero-shot learning across tasks and languages Transactions of the Association for Computational Linguistics, v. 9
    Doi: http://doi.org/10.1162/tacl_a_00374
  • Majewska, O., McCarthy, D., van den Bosch, JJF., Kriegeskorte, N., Vulić, I. and Korhonen, A., 2021. Semantic data set construction from human clustering and spatial arrangement Computational Linguistics, v. 47
    Doi: http://doi.org/10.1162/COLI_a_00396
  • Roberts, M., Driggs, D., Thorpe, M., Gilbey, J., Yeung, M., Ursprung, S., Aviles-Rivero, AI., Etmann, C., McCague, C., Beer, L., Weir-McCall, JR., Teng, Z., Gkrania-Klotsas, E., Ruggiero, A., Korhonen, A., Jefferson, E., Ako, E., Langs, G., Gozaliasl, G., Yang, G., Prosch, H., Preller, J., Stanczuk, J., Tang, J., Hofmanninger, J., Babar, J., Sánchez, LE., Thillai, M., Gonzalez, PM., Teare, P., Zhu, X., Patel, M., Cafolla, C., Azadbakht, H., Jacob, J., Lowe, J., Zhang, K., Bradley, K., Wassin, M., Holzer, M., Ji, K., Ortet, MD., Ai, T., Walton, N., Lio, P., Stranks, S., Shadbahr, T., Lin, W., Zha, Y., Niu, Z., Rudd, JHF., Sala, E. and Schönlieb, CB., 2021. Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans Nature Machine Intelligence, v. 3
    Doi: http://doi.org/10.1038/s42256-021-00307-0
  • Ali, I., Dreij, K., Baker, S., Högberg, J., Korhonen, A. and Stenius, U., 2021. Application of Text Mining in Risk Assessment of Chemical Mixtures: A Case Study of Polycyclic Aromatic Hydrocarbons (PAHs). Environ Health Perspect, v. 129
    Doi: http://doi.org/10.1289/EHP6702
  • Majewska, O., Collins, C., Baker, S., Björne, J., Brown, SW., Korhonen, A. and Palmer, M., 2021. BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine. J Biomed Semantics, v. 12
    Doi: http://doi.org/10.1186/s13326-021-00247-z
  • Huang, Y., Murakami, A., Alexopoulou, T. and Korhonen, A., 2021. Subcategorization frame identification for learner English International Journal of Corpus Linguistics, v. 26
    Doi: http://doi.org/10.1075/ijcl.18097.hua
  • Su, Y., Wang, Y., Cai, D., Baker, S., Korhonen, A. and Collier, N., 2021. PROTOTYPE-TO-STYLE: Dialogue Generation with Style-Aware Editing on Retrieval Memory IEEE/ACM Transactions on Audio Speech and Language Processing, v. 29
    Doi: http://doi.org/10.1109/TASLP.2021.3087948
  • 2020 (Accepted for publication)

  • Vulic, I., Baker, S., Ponti, E., Petti, U., Leviant, I., Wing, K., Majewska, O., Bar, E., Malone, M., Poibeau, T., Reichart, R. and Korhonen, A., 2020 (Accepted for publication). Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity Computational Linguistics,
    Doi: http://doi.org/10.1162/coli_a_00391
  • 2020

  • Crichton, G., Baker, S., Guo, Y. and Korhonen, A., 2020. Neural networks for open and closed Literature-based Discovery. PLoS One, v. 15
    Doi: http://doi.org/10.1371/journal.pone.0232891
  • Petti, U., Baker, S. and Korhonen, A., 2020. A systematic literature review of automatic Alzheimer's disease detection from speech and language. J Am Med Inform Assoc, v. 27
    Doi: http://doi.org/10.1093/jamia/ocaa174
  • 2019

  • Ponti, EM., O'Horan, H., Berzak, Y., Vulic, I., Reichart, R., Poibeau, T., Shutova, E. and Korhonen, A., 2019. Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing COMPUTATIONAL LINGUISTICS, v. 45
    Doi: http://doi.org/10.1162/coli_a_00357
  • Pyysalo, S., Baker, S., Ali, I., Haselwimmer, S., Shah, T., Young, A., Guo, Y., Högberg, J., Stenius, U., Narita, M. and Korhonen, A., 2019. LION LBD: a literature-based discovery system for cancer biology. Bioinformatics, v. 35
    Doi: http://doi.org/10.1093/bioinformatics/bty845
  • 2018 (Accepted for publication)

  • Chiu, HW., Majewska, O., Pyysalo, S., Wey, L., Stenius, U., Korhonen, AL. and Palmer, M., 2018 (Accepted for publication). A Neural Classification Method for Supporting the Creation of BioVerbNet Journal of Biomedical Semantics, v. 10
    Doi: http://doi.org/10.1186/s13326-018-0193-x
  • Huang, Y., Murakami, A., Alexopoulou, T. and Korhonen, A., 2018 (Accepted for publication). Dependency parsing of learner English International Journal of Corpus Linguistics, v. 23
    Doi: http://doi.org/10.1075/ijcl.16080.hua
  • 2018

  • Majewska, O., Vulić, I., McCarthy, D., Huang, Y., Murakami, A., Laippala, V. and Korhonen, A., 2018. Investigating the cross-lingual translatability of VerbNet-style classification. Lang Resour Eval, v. 52
    Doi: http://doi.org/10.1007/s10579-017-9403-x
  • Chiu, B., Pyysalo, S., Vulić, I. and Korhonen, A., 2018. Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine. BMC Bioinformatics, v. 19
    Doi: http://doi.org/10.1186/s12859-018-2039-z
  • Crichton, G., Guo, Y., Pyysalo, S. and Korhonen, A., 2018. Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches. BMC Bioinformatics, v. 19
    Doi: http://doi.org/10.1186/s12859-018-2163-9
  • Gerz, D., Vulić, I., Ponti, E., Naradowsky, J., Reichart, R. and Korhonen, A., 2018. Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction Transactions of the Association for Computational Linguistics, v. 6
    Doi: 10.1162/tacl_a_00032
  • 2017 (Accepted for publication)

  • Baker, S., Ali, I., Silins, I., Pyysalo, S., Guo, Y., Högberg, J., Stenius, U. and Korhonen, A., 2017 (Accepted for publication). Cancer Hallmarks Analytics Tool (CHAT): A text mining approach to organise and evaluate scientific literature on cancer Bioinformatics, v. 33
    Doi: http://doi.org/10.1093/bioinformatics/btx454
  • 2017

  • Lu, Y., Guo, Y. and Korhonen, A., 2017. Link prediction in drug-target interactions network using similarity indices. BMC Bioinformatics, v. 18
    Doi: http://doi.org/10.1186/s12859-017-1460-z
  • Larsson, K., Baker, S., Silins, I., Guo, Y., Stenius, U., Korhonen, A. and Berglund, M., 2017. Text mining for improved exposure assessment PLOS One, v. 12
    Doi: http://doi.org/10.1371/journal.pone.0173132
  • Crichton, G., Pyysalo, S., Chiu, B. and Korhonen, A., 2017. A neural network multi-task learning approach to biomedical named entity recognition BMC Bioinformatics, v. 18
    Doi: http://doi.org/10.1186/s12859-017-1776-8
  • Vulić, I., Gerz, D., Kiela, D., Hill, F. and Korhonen, A., 2017. HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment Computational Linguistics, v. 43
    Doi: http://doi.org/10.1162/COLI_a_00301
  • 2016

  • Hill, F., Cho, K., Korhonen, A. and Bengio, Y., 2016. Learning to Understand Phrases by Embedding the Dictionary Transactions of the Association for Computational Linguistics, v. 4
    Doi: 10.1162/tacl_a_00080
  • Baker, S., Silins, I., Guo, Y., Ali, I., Högberg, J., Stenius, U. and Korhonen, A., 2016. Automatic semantic classification of scientific literature according to the hallmarks of cancer. Bioinformatics, v. 32
    Doi: http://doi.org/10.1093/bioinformatics/btv585
  • Ali, I., Guo, Y., Silins, I., Högberg, J., Stenius, U. and Korhonen, A., 2016. Grouping chemicals for health risk assessment: A text mining-based case study of polychlorinated biphenyls (PCBs). Toxicol Lett, v. 241
    Doi: http://doi.org/10.1016/j.toxlet.2015.11.003
  • Ali, I., Högberg, J., Hsieh, J-H., Auerbach, S., Korhonen, A., Stenius, U. and Silins, I., 2016. Gender differences in cancer susceptibility: role of oxidative stress. Carcinogenesis, v. 37
    Doi: http://doi.org/10.1093/carcin/bgw076
  • 2015

  • Alexopoulou, T., Geertzen, J., Korhonen, A. and Meurers, D., 2015. Exploring big educational learner corpora for SLA research* Perspectives on relative clauses International Journal of Learner Corpus Research, v. 1
    Doi: http://doi.org/10.1075/ijlcr.1.1.04ale
  • Korhonen, A., Baker, S., Silins, I., Guo, Y., Ali, I., Hogberg, J. and Stenius, U., 2015. Automatic Semantic Classification of Scientific Literature According to the Hallmarks of Cancer Bioinformatics,
  • Guo, Y., Reichart, R. and Korhonen, A., 2015. Unsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents Transactions of Association for Computational Linguistics, v. 3
  • Kiela, D., Guo, Y., Stenius, U. and Korhonen, A., 2015. Unsupervised discovery of information structure in biomedical documents. Bioinformatics, v. 31
    Doi: http://doi.org/10.1093/bioinformatics/btu758
  • Kiela, D., Guo, Y., Stenius, U. and Korhonen, A., 2015. Unsupervised discovery of information structure in biomedical documents Bioinformatics, v. 31
    Doi: http://doi.org/10.1093/bioinformatics/btu758
  • Hill, F., Reichart, R. and Korhonen, A., 2015. Simlex-999: Evaluating semantic models with (Genuine) similarity estimation Computational Linguistics, v. 41
    Doi: http://doi.org/10.1162/COLI_a_00237
  • 2014

  • Hill, F., Korhonen, A. and Bentz, C., 2014. A quantitative empirical analysis of the abstract/concrete distinction Cognitive Science, v. 38
    Doi: http://doi.org/10.1111/cogs.12076
  • Kiela, D., Hill, F., Korhonen, A. and Clark, S., 2014. Improving multi-modal representations using image dispersion: Why less is sometimes more 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, v. 2
    Doi: http://doi.org/10.3115/v1/p14-2135
  • Kelly, C., Devereux, B. and Korhonen, A., 2014. Automatic extraction of property norm-like data from large text corpora Cognitive Science, v. 38
    Doi: http://doi.org/10.1111/cogs.12091
  • Silins, I., Korhonen, A. and Stenius, U., 2014. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool. Front Pharmacol, v. 5
    Doi: http://doi.org/10.3389/fphar.2014.00145
  • Séaghdha, D. and Korhonen, A., 2014. Probabilistic distributional semantics with latent variable models Computational Linguistics, v. 40
    Doi: http://doi.org/10.1162/COLI_a_00194
  • Hill, F., Korhonen, A. and Reichart, R., 2014. Multi-Modal Models for Concrete and Abstract Concept Meaning Transactions of ACL (TACL), v. 2
  • Hill, F., Korhonen, A. and Bentz, C., 2014. A quantitative empirical analysis of the abstract/concrete distinction. Cogn Sci, v. 38
    Doi: http://doi.org/10.1111/cogs.12076
  • 2013

  • Kelly, C., Devereux, B. and Korhonen, A., 2013. Automatic Extraction of Property Norm-Like Data From Large Text Corpora Cognitive Science,
  • Shutova, E., Devereux, BJ. and Korhonen, A., 2013. Conceptual metaphor theory meets the data: A corpus-based human annotation study Language Resources and Evaluation, v. 47
    Doi: http://doi.org/10.1007/s10579-013-9238-z
  • Lippincott, T., Rimell, L., Verspoor, K. and Korhonen, A., 2013. Approaches to verb subcategorization for biomedicine. J Biomed Inform, v. 46
    Doi: http://doi.org/10.1016/j.jbi.2012.12.001
  • Poibeau, T., Villavicencio, A., Alishahi, A. and Korhonen, A., 2013. Computational Modeling as a Methodology for Studying Human Language Learning Cognitive Aspects of Computational Language Acquisition,
  • Rimell, L., Lippincott, T., Verspoor, K., Johnson, HL. and Korhonen, A., 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. J Biomed Inform, v. 46
    Doi: http://doi.org/10.1016/j.jbi.2013.01.001
  • Guo, Y., Silins, I., Stenius, U. and Korhonen, A., 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. Bioinformatics, v. 29
    Doi: http://doi.org/10.1093/bioinformatics/btt163
  • Rimell, L., Lippincott, T., Verspoor, K., Johnson, HL. and Korhonen, A., 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine Journal of Biomedical Informatics, v. 46
    Doi: http://doi.org/10.1016/j.jbi.2013.01.001
  • Lippincott, T., Rimell, L., Verspoor, K. and Korhonen, A., 2013. Approaches to verb subcategorization for biomedicine Journal of Biomedical Informatics, v. 46
    Doi: http://doi.org/10.1016/j.jbi.2012.12.001
  • Kelly, C., Korhonen, A. and Devereux, B., 2013. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations Cognitive Science,
  • Shutova, E., Devereux, BJ. and Korhonen, A., 2013. Conceptual metaphor theory meets the data: a corpus-based human annotation study Language Resources and Evaluation,
  • Shutova, E., Kaplan, J., Teufel, S. and Korhonen, A., 2013. A computational model of logical metonymy ACM Transactions on Speech and Language Processing, v. 10
    Doi: http://doi.org/10.1145/2483969.2483973
  • 2012

  • Korhonen, A., Séaghdha, DO., Silins, I., Sun, L., Högberg, J. and Stenius, U., 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS One, v. 7
    Doi: http://doi.org/10.1371/journal.pone.0033427
  • Kadekar, S., Silins, I., Korhonen, A., Dreij, K., Al-Anati, L., Hogberg, J. and Stenius, U., 2012. Exocrine Pancreatic Carcinogenesis and Autotaxin Expression PLOS ONE, v. 7
    Doi: http://doi.org/10.1371/journal.pone.0043209
  • Silins, I., Korhonen, A., Högberg, J. and Stenius, U., 2012. Data and literature gathering in chemical cancer risk assessment Integrated Environmental Assessment and Management, v. 8
    Doi: http://doi.org/10.1002/ieam.1278
  • Shutova, E., Teufel, SH. and Korhonen, A., 2012. Statistical Metaphor Processing Computational Linguistics, v. 39
  • Van De Cruys, T., Rimell, L., Poibeau, T. and Korhonen, A., 2012. Multi-way tensor factorization for unsupervised lexical acquisition 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers,
  • Contractor, D., Guo, Y. and Korhonen, A., 2012. Using argumentative zones for extractive summarization of scientific articles 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers,
  • Lippincott, T., Séaghdha, D. and Korhonen, A., 2012. Learning syntactic verb frames using graphical models 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference, v. 1
  • 2011

  • Séaghdh, DO. and Korhonen, A., 2011. Probabilistic models of similarity in syntactic context EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • Guo, Y., Korhonen, A., Liakata, M., Silins, I., Hogberg, J. and Stenius, U., 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics, v. 12
    Doi: http://doi.org/10.1186/1471-2105-12-69
  • Lippincott, T., Séaghdha, DÓ. and Korhonen, A., 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics, v. 12
    Doi: http://doi.org/10.1186/1471-2105-12-212
  • Guo, Y., Korhonen, A., Silins, I. and Stenius, U., 2011. Weakly supervised learning of information structure of scientific abstracts--is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics, v. 27
    Doi: http://doi.org/10.1093/bioinformatics/btr536
  • Van De Cruys, T., Poibeau, T. and Korhonen, A., 2011. Latent vector weighting for word meaning in context EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • Guo, Y., Korhonen, A. and Poibeau, T., 2011. A weakly-supervised approach to Argumentative Zoning of scientific documents EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • Sun, L. and Korhonen, A., 2011. Hierarchical verb clustering using graph factorization EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
  • 2010

  • Korhonen, A., 2010. Automatic lexical classification: bridging research and practice. Philos Trans A Math Phys Eng Sci, v. 368
    Doi: http://doi.org/10.1098/rsta.2010.0039
  • Shutova, E., Sun, L. and Korhonen, A., 2010. Metaphor identification using verb and noun clustering Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, v. 2
  • Sun, L., Korhonen, A., Poibeau, T. and Messiant, C., 2010. Investigating the cross-linguistic potential of verbnet -style classification Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, v. 2
  • Lippincott, T., Śeaghdha, DO., Sun, L. and Korhonen, A., 2010. Exploring variation across biomedical subdomains Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference, v. 2
  • Korhonen, A., 2010. Automatic Lexical Classification - Bridging Research and Practice. In Philoshophical Transactions A of the Royal Society, v. 368
  • 2009

  • Korhonen, A., 2009. Automatic lexical classification - Balancing between machine learning and linguistics PACLIC 23 - Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, v. 1
  • Korhonen, A., Silins, I., Sun, L. and Stenius, U., 2009. The first step in the development of Text Mining technology for Cancer Risk Assessment: identifying and organizing scientific evidence in risk assessment literature. BMC Bioinformatics, v. 10
    Doi: http://doi.org/10.1186/1471-2105-10-303
  • Silins, I., Korhonen, A., Hogberg, J., Sun, L. and Stenius, U., 2009. Improved cancer risk assessment using text mining CANCER RESEARCH, v. 69
  • Devereux, B., Pilkington, N., Poibeau, T. and Korhonen, A., 2009. Towards Unrestricted, Large-Scale Acquisition of Feature-Based Conceptual Representations from Corpus Data Research on Language and Computation, v. 7
    Doi: http://doi.org/10.1007/s11168-010-9068-8
  • Sun, L. and Korhonen, A., 2009. Improving verb clustering with automatically acquired selectional preferences EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009,
    Doi: http://doi.org/10.3115/1699571.1699596
  • 2008

  • Kipper, K., Korhonen, A., Ryant, N. and Palmer, M., 2008. A large-scale classification of English verbs LANG RESOUR EVAL, v. 42
    Doi: http://doi.org/10.1007/s10579-007-9048-2
  • Korhonen, A., Krymolowski, Y. and Collier, N., 2008. The choice of features for classification of verbs in biomedical texts Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference, v. 1
    Doi: http://doi.org/10.3115/1599081.1599138
  • 2007

  • 2007. Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
  • 2006

  • Mizuta, Y., Korhonen, A., Mullen, T. and Collier, N., 2006. Zone analysis in biology articles as a basis for information extraction. Int J Med Inform, v. 75
    Doi: http://doi.org/10.1016/j.ijmedinf.2005.06.013
  • 2005

  • Yallop, J., Korhonen, A. and Briscoe, T., 2005. Automatic acquisition of adjectival subcategorization from corpora ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference,
    Doi: http://doi.org/10.3115/1219840.1219916
  • Buttery, P. and Korhonen, A., 2005. Large-scale analysis of verb subcategorization differences between child directed speech and adult speech Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes,
  • Villavicencio, A., Bond, F., Korhonen, A. and McCarthy, D., 2005. Introduction to the special issue on multiword expressions: Having a crack at a hard nut COMPUT SPEECH LANG, v. 19
    Doi: http://doi.org/10.1016/j.csl.2005.05.001
  • 1999

  • Baljko, M. and Korhonen, A., 1999. Preface to the student session papers Proceedings of the Annual Meeting of the Association for Computational Linguistics, v. 1999-June
  • Theses / dissertations

    2023 (No publication date)

  • Liu, Q., 2023 (No publication date). On the Evaluation and Modelling of Context-sensitive Lexical Semantics
    Doi: http://doi.org/10.17863/CAM.95289
  • Datasets

    2018

  • Chiu, HW., Pyysalo, S., Vulic, I. and Korhonen, A., 2018. Bio-SimVerb
    Doi: http://doi.org/10.17863/CAM.18370
  • 2017 (No publication date)

  • Gerz, DS., Vulic, I., Hill, F., Reichart, R. and Korhonen, A., 2017 (No publication date). Research data supporting "SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity"
  • Book chapters

    2018

  • Jiang, X., Huang, Y., Guo, Y., Geertzen, J., Alexopoulou, T., Sun, L. and Korhonen, A., 2018. Native language identification on EFCAMDAT
    Doi: http://doi.org/10.1017/9781316676974.007
  • 2013

  • Korhonen, A., 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora
  • Books

    2013

  • Villavicencio, A., Poibeau, T., Alishahi, A. and Korhonen, A., 2013. Cognitive Aspects of Computational Language Acquisition

  • Read more at: Professor Ann Copestake

    Professor Ann Copestake

    Natural Language Processing; computational linguistics; semantics


    What we do

    Cambridge Language Sciences is an Interdisciplinary Research Centre at the University of Cambridge. Our virtual network connects researchers from five schools across the university as well as other world-leading research institutions. Our aim is to strengthen research collaborations and knowledge transfer across disciplines in order to address large-scale multi-disciplinary research challenges relating to language research.

    JOIN OUR NETWORK

    JOIN OUR MAILING LIST

    CONTACT US