Corpus Linguistics Summer School 2021

Monday 5 July (09:00) - Thursday 8 July 2021 (17:00)


Our Corpus Linguistics summer school is open to undergraduate, postgraduate, and doctoral students, as well as researchers who want to improve their skills to apply corpus methods in their own research.

The fifth Corpus Linguistics Summer School will be entirely online and consist of synchronous and asynchronous elements. The synchronous part will take place from 5 to 8 July, 2021. Prior to the synchronous part, participants will be expected to complete the asynchronous part consisting of self-study video lectures and hands-on materials. Please note that the synchronous sessions will not be recorded.

Our summer school aims to equip participants with critical expertise in both the theory and practice of corpus-based linguistic research. Building on the strengths of our Centre for Corpus Research (CCR) and our invited speakers, we strive to offer participants a learning experience that is both beneficial for their own specific research needs and enriching to them as language researchers at large. There will also be the opportunity for participants to present their own work and receive feedback from our expert team. Given the specialised nature of the programme, a basic understanding of corpus linguistics is highly recommended.


The complete programme of the summer school will be available nearer the time.

Over the course of four days, participants will be actively involved in two kinds of sessions. First, hands-on sessions will put the emphasis on the learning of practical skills for the purpose of extracting and analysing corpus data of various kinds, and the application of this knowledge to specific research projects. Second, participants will also learn about current corpus-based research from our local staff of corpus linguists.

The programme will cover the following topics (non-exhaustive list):

  • Corpus tools and specialized software
  • Data exploration
  • Creating a custom corpus
  • Phraseology
  • Introduction to R and tidyverse
  • Web scraping with R
  • Statistics in corpus linguistics
  • Sign language corpora
  • Corpora and legal research

The programmes of previous years are available for your reference:

Invited speakers:

They will be joined by our local team of corpus linguists, including Jason Grafmiller, Jack Grieve, Nicholas Groom, Sten Hansson, Karen McAuliffe, Michaela Mahlberg, Petar Milin, Akira Murakami, Ruth Page, Adam Schembri, Paul Thompson, and Bodo Winter.

As every year, the annual Sinclair Lecture will take place during the Summer School. This year, the Lecture will be delivered by Dagmar Divjak (University of Birmingham) on Thursday 8 July , entitled 'Of wo/men and machines: an interdisciplinary take on language in use'. Further details can be found on our Sinclair Lecture 2021 webpage.


Pre-recorded video lectures in the asynchronous part of the summer school and the recording of the Sinclair Lecture will be captioned. Synchronous sessions will be auto-captioned.


A talk at the Corpus Linguistics Summer School 2019

Culture and collections

Schools, institutes and departments

Services and facilities