Keyword (linguistics)

In corpus linguistics a keyword is a word which occurs in a text more often than we would expect to occur by chance alone. Keywords are calculated by carrying out a statistical test (e.g., loglinear) which compares the word frequencies in a text against their expected frequencies derived in a much larger corpus, which acts as a reference for general language use. AntConc is a freely available text analysis tool capable of calculating keywords.

ee also

* Collocation

External links

* [http://nflrc.hawaii.edu/RFL/April2005/chujo/chujo.html Understanding the role of text length, sample size and vocabulary size in determining text coverage] , by Kiyomi Chujo and Masao Utiyama
* [http://language.tiu.ac.jp/flc/index.html Frequency Level Checker]


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Keyword — A keyword is a word or concept with special significance, in particular any word used as the key to a code or used in a reference work to link to other words or other information.The term keyword in reference to Search Engine Marketing (SEM)… …   Wikipedia

  • Corpus linguistics — is the study of language as expressed in samples (corpora) or real world text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally …   Wikipedia

  • Key — may refer to: Building* Key, Carpentry: timber or metal wedges used across or between two or more members to act as a tightening agent. * Key, Painting: to rough the surface of previous coats of paint to allow a secure bond for the next or top… …   Wikipedia

  • Natural language user interface — Natural Language User Interfaces (LUI) are a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying data in software applications. In interface… …   Wikipedia

  • Concept Search — A concept search (or conceptual search) is an automated information retrieval method that is used to search electronically stored unstructured text (for example, digital archives, email, scientific literature, etc.) for information that is… …   Wikipedia

  • Semantic change — Semantic change, also known as semantic shift or semantic progression describes the evolution of word usage usually to the point that the modern meaning is radically different from the original usage. In diachronic (or historical) linguistics,… …   Wikipedia

  • Wizard of Oz experiment — In the field of human computer interaction, a Wizard of Oz experiment is a research experiment in which subjects interact with a computer system that subjects believe to be autonomous, but which is actually being operated or partially operated by …   Wikipedia

  • Corpus-assisted discourse studies — Corpus assisted discourse studies, or CADS, is related historically and methodologically to the discipline of corpus linguistics. The principal endeavor of corpus assisted discourse studies is the investigation, and comparison of features of… …   Wikipedia

  • Asterisk — This article is about the typographical symbol. For other uses, see Asterisk (disambiguation). See also: * (disambiguation) * Asterisk …   Wikipedia

  • Question answering — (QA) is a type of information retrieval. Given a collection of documents (such as the World Wide Web or a local collection) the system should be able to retrieve answers to questions posed in natural language. QA is regarded as requiring more… …   Wikipedia


Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.