Linguistic Data Consortium

The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for linguistics research and development purposes. The University of Pennsylvania is the LDC's host institution. The LDC was founded in 1992 with a grant from the Advanced Research Projects Agency (ARPA), and is partly supported by grant IRI-9528587 from the Information and Intelligent Systems division of the National Science Foundation.

External links

* [http://www.ldc.upenn.edu/ LDC Website]


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Corpus linguistics — is the study of language as expressed in samples (corpora) or real world text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally …   Wikipedia

  • Brian MacWhinney — Brian James MacWhinney (born August 22, 1945) is Professor of Psychology at Carnegie Mellon University, MacWhinney co founded the CHILDES (Child Language Data Exchange System) Project in 1984 with Catherine Snow. He also directs the TalkBank… …   Wikipedia

  • Text corpus — In linguistics, a corpus (plural corpora ) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or… …   Wikipedia

  • Computational lexicology — is that branch of computational linguistics, which is concerned with the use of computers in the study of lexicon. It has been more narrowly described by some scholars (Amsler, 1980) as the use of computers in the study of machine readable… …   Wikipedia

  • N-gram — An n gram is a sub sequence of n items from a given sequence. n grams are used in various areas of statistical natural language processing and genetic sequence analysis. The items in question can be phonemes, syllables, letters, words or base… …   Wikipedia

  • Text analytics — The term text analytics describes a set of linguistic, lexical, pattern recognition,extraction, tagging/structuring, visualization, and predictive techniques. The termalso describes processes that apply these techniques, whether independently or… …   Wikipedia

  • Media bias in the United States — Journalism News · Writing style Ethics · Objectivity Values · …   Wikipedia

  • Fox News Channel controversies — Critics of Fox News Channel have accused the network of having a bias favoring the political right and the Republican Party. Fox News has publicly denied such charges,[1] stating that the reporters in the newsroom provide separate, neutral… …   Wikipedia

  • Adsotrans — Adso is an open source Chinese to English dictionary and natural language processing engine for Chinese text. The Adso project started in 2001. Its gist translation and dictionary interface are online at the Adsotrans website… …   Wikipedia

  • Mark Liberman — (pronounced /ˈlɪbərmən/) is an American linguist. He has a dual appointment at the University of Pennsylvania, as Trustee Professor of Phonetics in the Department of Linguistics, and as a professor in the Department of Computer and… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.