Constraint Grammar

Constraint Grammar (CG) is a methodological paradigm for Natural language processing (NLP). Linguist-written, context dependent rules are compiled into a grammar that assigns grammatical tags ("readings") to words or other tokens in running text. Typical tags address lemmatisation (lexeme or base form), inflexion, derivation, syntactic function, dependency, valency, case roles, semantic type etc. Each rule either adds, removes, selects or replaces a tag or a set of grammatical tags in a given sentence context. Context conditions can be linked to any tag or tag set of any word anywhere in the sentence, either locally (defined distances) or globally (undefined distances). Context conditions in the same rule may be linked, i.e. conditioned upon each other, negated, or blocked by interfering words or tags. Typical CGs consist of thousands of rules, that are applied set-wise in progressive steps, covering ever more advanced levels of analysis. Within each level, safe rules are used before heuristic rules, and no rule is allowed to remove the last reading of a given kind, thus providing a high degree of robustness.

The Constraint Grammar concept was launched by Fred Karlsson in 1990 (Karlsson 1990; Karlsson et al., eds, 1995), and CG taggers and parsers have since been written for a large variety of languages, routinely achieving accuracy F-scores for PoS (word class) of over 99%[1]. A number of syntactic CG systems have reported F-scores of around 95% for syntactic function labels. CG systems can be used to create full syntactic trees in other formalisms by adding small, non-terminal based phrase structure grammars or dependency grammars, and a number of corpus/treebank projects have used Constraint Grammar for automatic annotation. CG methodology has also used in a number of language technology applications, such as spell checkers and machine translation systems.

Contents

List of Constraint Grammar systems sorted by language

Free software


Non-free software

External links

Footnotes

  1. ^ For English, see for example Tapanainen and Voutilainen 1994.

References


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Grammar framework — In theoretical linguistics, the following fundamental approaches towards constructing grammar frameworks for natural languages are distinguished:*Generative grammar: algorithmic (phrase structure grammars) **Transformational grammar (1960s)… …   Wikipedia

  • Constraint logic programming — Programming paradigms Agent oriented Automata based Component based Flow based Pipelined Concatenative Concurrent computing …   Wikipedia

  • Constraint programming — Programming paradigms Agent oriented Automata based Component based Flow based Pipelined Concatenative Concurrent computin …   Wikipedia

  • Operator Grammar — For the class of formal computer languages, see Operator precedence grammar. Operator Grammar is a mathematical theory of human language that explains how language carries information. This theory is the culmination of the life work of Zellig… …   Wikipedia

  • Construction grammar — The term construction grammar (CxG) covers a family of theories, or models, of grammar that are based on the idea that the primary unit of grammar is the grammatical construction rather than the atomic syntactic unit and the rule that combines… …   Wikipedia

  • Head-driven Phrase Structure Grammar — Die Head driven Phrase Structure Grammar (HPSG) ist eine Grammatiktheorie, die in den 1980er Jahren auf der Basis der Wiederbelebung der kontextfreien Phrasenstrukturgrammatiken als Generative Grammatiktheorie aus der Familie der… …   Deutsch Wikipedia

  • Lexical functional grammar — (LFG) is a grammar framework in theoretical linguistics, a variety of generative grammar. The development of the theory was initiated by Joan Bresnan and Ronald Kaplan in the 1970s, in reaction to the direction research in the area of… …   Wikipedia

  • Fred Karlsson — (born 1946) is a professor of general linguistics at the University of Helsinki.In computational linguistics Karlsson has designed a language independent formalism called Constraint Grammar. It makes possible the automatic morphological… …   Wikipedia

  • Gramática de restricciones — (en inglés Constraint Grammar , CG) es un tipo de gramática que se usa para la desambiguación léxica. También se usa en análisis superficial de oraciones. Normalmente las gramáticas de restricciones tienen más de mil reglas lingüísticas. El… …   Wikipedia Español

  • CG — CG, Cg or cg may stand for: Airlines PNG IATA code Categorial grammar Constraint grammar Carnival Games, a video game for Wii and Nintendo DS Center of gravity Centigram (cg), a unit of mass in the metric system CG artwork, a digitally made… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.