Encyclopedia of Chinese Language and Linguistics

Get access

Lancaster Corpus of Mandarin Chinese
(2,113 words)

1. Introduction

The Lancaster Corpus of Mandarin Chinese (LCMC) is a one million-word balanced corpus that represents written Mandarin. The corpus is designed as a Chinese match for the FLOB (Hundt, Sand and Siemund 1998) and Frown (Hundt, Sand and Skandera 1999) corpora of British and American English. It was created as part of the research project “Contrastive English and Chinese”…

Cite this page
Richard XIAO, “Lancaster Corpus of Mandarin Chinese”, in: Encyclopedia of Chinese Language and Linguistics, General Editor Rint Sybesma. Consulted online on 23 March 2023 <http://dx.doi.org/10.1163/2210-7363_ecll_COM_00000208>
First published online: 2015



▲   Back to top   ▲