In this course, participants will learn what corpus linguistics is and why it matters . Corpus linguistics is an empirical method for studying language through naturally occurring texts (corpora), allowing researchers to analyze real-world language use in a systematic way . Students will learn how to build and manage a corpus, and they will get hands-on experience analyzing linguistic data using tools such as AntConc and Python.
The course connects data patterns retrieved from corpora to linguistic theory, showing how empirical findings can challenge or support existing models in syntax, semantics, and pragmatics. Students will also explore the practical applications of corpus insights in computational linguistics, natural language processing, and language teaching.
A special focus will be placed on future directions in the field, including integration with Large Language Models like BERT or ChatGPT. Throughout the course, learners will participate in weekly mini-labs, online discussions based on readings, and maintain a corpus diary to reflect on their learning and observations in real-world language use.
- Trainer/in: Hiwa Asadpour