Linguistic Data Consortium

The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and government research laboratories. LDC was formed in 1992 to address the critical data shortage then facing language technology research and development. LDC is hosted by the University of Pennsylvania and is a center within the University’s School of Arts and Sciences. 


The University of Toronto is a subscriber to the Linguistic Data Consortium which licenses language corpora and other language resources. For more information about the LDC, please visit their website

Date of collection
1993 to 2019