The London-Lund Corpus 2 of spoken British English (LLC 2)

More than 50 years ago the remarkable launch of a machine-readable database consisting of half a million words, the London-Lund Corpus of spoken British English (LLC 1) was a fact. The purpose of this project is to compile LLC 2, 50 years on, in the same way and the same place to make research on contemporary speech and recent language change possible.

The main goal of this project is to compile a new spoken language corpus, entitled the London-Lund Corpus 2 of spoken British English (LLC 2), designed in accordance to the principles of the London-Lund Corpus (LLC 1), launched in 1975. The goal of compiling a corpus comparable to LLC 1 is to facilitate principled research on recent changes in contemporary spoken English. As opposed to written discourse, spoken discourse has not been documented on a regular basis since the 1990s. This has left a gap in the investigation of naturally occurring language developments in contemporary language use. The three fundamental stages of corpus compilation for which funding is sought are data collection, transcription and annotation. Like LLC 1, the data for LLC 2 will be recorded at the University College London, and the main emphasis will be put on capturing spontaneous face-to-face conversations between educated adults. Following the data collection process, the recordings will be transcribed and annotated by research assistants. After the completion of these stages, the corpus will be made available to the linguistics community, with the aim of stimulating new and exciting research in the field.

The compilation of the corpus has been funded by grants from

The Erik Philip-Sörensen Foundation and

The Linnaeus Centre: Thinking in Time: Cognition, Communication, and Learning, financed by the Swedish Research Council, Grant No. 349-2007-
