home cs/pos edit page issue tracker

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates, as in

To auto je zelené.  “The car is green.”

The ADJ tag is intended for ordinary adjectives only. See DET for determiners and NUM for cardinal numerals.

In accord with the UD approach, adjectival ordinal numerals (první, sedmý, stopadesátý)  are tagged as adjectives, although the traditional grammar classifies them as numerals. They behave like adjectives both morphologically and syntactically, with the exception that they cannot be compared and negated.

Most Czech adjectives inflect for cs-feat/Gender (velký – velká – velké)  “big”, cs-feat/Number (velký – velcí),  cs-feat/Case (velký – velkého – velkému – velkém – velkým),  cs-feat/Degree (velký – větší – největší),  and Negation (velký – nevelký). 

Examples

Border cases

Passive participles lie on the border between verbs and adjectives. Core participial forms (ending in consonant or short vowel) are tagged VERB. Long forms are participial adjectives and they are tagged ADJ. For example:

Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only true participles (verbs) can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the participial adjectives inflect for case and thus can modify nouns.

There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:

Here both groups are ADJ. The nominal forms are used in predication, the standard forms both in predication and to modify nouns.

References


Treebank Statistics (UD_Czech)

There are 14158 ADJ lemmas (24%), 36819 ADJ types (28%) and 180811 ADJ tokens (12%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: český, velký, nový, další, první, jiný, druhý, vysoký, dobrý, celý

The 10 most frequent ADJ types: první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní

The 10 most frequent ambiguous lemmas: velký (ADJ 2468, ADV 1), obchodní (ADJ 588, ADV 1), starý (ADJ 567, NOUN 5), známý (ADJ 560, NOUN 21), domácí (ADJ 515, NOUN 5), mladý (ADJ 443, NOUN 3), třeba (ADJ 409, ADV 404), blízký (ADJ 314, NOUN 2), vedoucí (ADJ 156, NOUN 145), spolkový (ADJ 117, NOUN 1)

The 10 most frequent ambiguous types: vlastní (ADJ 464, VERB 76), třeba (ADJ 408, ADV 372), hlavní (ADJ 298, NOUN 3), tzv (ADJ 359, ADV 1), domácí (ADJ 230, NOUN 2), dobré (ADJ 211, NOUN 1), vysoké (ADJ 190, NOUN 1), a (CONJ 31068, ADJ 183, NOUN 49, ADP 7), lepší (ADJ 169, VERB 2), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4)

Morphology

The form / lemma ratio of ADJ is 2.600579 (the average of all parts of speech is 2.195970).

The 1st highest number of forms (32) was observed with the lemma “známý”: nejznámější, nejznámějších, nejznámějším, neznáma, neznámo, neznámou, neznámá, neznámé, neznámého, neznámém, neznámí, neznámý, neznámých, neznámým, neznámými, znám, známa, známi, známo, známou, známy, známá, známé, známého, známém, známému, známí, známý, známých, známým, známými, známější

The 2nd highest number of forms (31) was observed with the lemma “dobrý”: Dobrú, dobrou, dobrá, dobré, dobrého, dobrém, dobrému, dobrý, dobrých, dobrým, dobrými, dobří, lepší, lepších, lepšího, lepším, lepšími, lepšímu, nedobrou, nedobrá, nedobré, nedobrého, nedobrý, nedobrých, nejlepší, nejlepších, nejlepšího, nejlepším, nejlepšími, nejlepšímu, nelepší

The 3rd highest number of forms (31) was observed with the lemma “velký”: největší, největších, největšího, největším, největšími, největšímu, nevelkou, nevelká, nevelké, nevelkého, nevelký, nevelkých, nevelkým, nevelkými, velcí, velkou, velká, velké, velkého, velkém, velkému, velký, velkých, velkým, velkými, větší, větších, většího, větším, většími, většímu

ADJ occurs with 20 features: cs-feat/Number (176213; 97% instances), cs-feat/Gender (176190; 97% instances), cs-feat/Case (174220; 96% instances), cs-feat/Negative (173109; 96% instances), cs-feat/Degree (166322; 92% instances), cs-feat/Animacy (73924; 41% instances), cs-feat/NumType (4990; 3% instances), cs-feat/NameType (4756; 3% instances), cs-feat/Aspect (4498; 2% instances), cs-feat/Tense (4498; 2% instances), cs-feat/VerbForm (4498; 2% instances), cs-feat/Voice (4498; 2% instances), cs-feat/Gender[psor] (2707; 1% instances), cs-feat/Poss (2707; 1% instances), cs-feat/Foreign (2669; 1% instances), cs-feat/Variant (1889; 1% instances), cs-feat/Abbr (1714; 1% instances), cs-feat/Hyph (398; 0% instances), cs-feat/Style (62; 0% instances), cs-feat/NumValue (30; 0% instances)

ADJ occurs with 61 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Com, NameType=Com,Geo, NameType=Com,Giv, NameType=Com,Oth, NameType=Com,Pro, NameType=Com,Pro,Sur, NameType=Com,Sur, NameType=Geo, NameType=Geo,Giv, NameType=Geo,Oth, NameType=Geo,Pro, NameType=Geo,Sur, NameType=Giv, NameType=Giv,Sur, NameType=Nat, NameType=Oth, NameType=Oth,Sur, NameType=Pro, NameType=Sur, Negative=Neg, Negative=Pos, NumType=Gen, NumType=Ord, NumType=Sets, NumValue=1, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Poss=Yes, Style=Arch, Style=Coll, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Part, Voice=Act

ADJ occurs with 761 feature combinations. The most frequent feature combination is Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Sing (13492 tokens). Examples: české, evropské, nové, národní, politické, slovenské, státní, světové, celé, velké

Relations

ADJ nodes are attached to their parents using 25 different relations: cs-dep/amod (157491; 87% instances), cs-dep/conj (7778; 4% instances), cs-dep/root (4894; 3% instances), cs-dep/foreign (1691; 1% instances), cs-dep/dep (1489; 1% instances), cs-dep/xcomp (1193; 1% instances), cs-dep/acl (966; 1% instances), cs-dep/nsubj (955; 1% instances), cs-dep/ccomp (939; 1% instances), cs-dep/dobj (761; 0% instances), cs-dep/advmod (723; 0% instances), cs-dep/advcl (692; 0% instances), cs-dep/iobj (472; 0% instances), cs-dep/appos (335; 0% instances), cs-dep/csubj (181; 0% instances), cs-dep/parataxis (79; 0% instances), cs-dep/name (56; 0% instances), cs-dep/cc (53; 0% instances), cs-dep/nsubjpass (32; 0% instances), cs-dep/nmod (15; 0% instances), cs-dep/advmod:emph (6; 0% instances), cs-dep/cop (4; 0% instances), cs-dep/mwe (3; 0% instances), cs-dep/case (2; 0% instances), cs-dep/vocative (1; 0% instances)

Parents of ADJ nodes belong to 16 different parts of speech: NOUN (153461; 85% instances), ADJ (6932; 4% instances), VERB (6653; 4% instances), PROPN (6485; 4% instances), ROOT (4894; 3% instances), PRON (1201; 1% instances), NUM (784; 0% instances), ADV (221; 0% instances), DET (88; 0% instances), PART (42; 0% instances), ADP (18; 0% instances), SYM (14; 0% instances), CONJ (12; 0% instances), INTJ (3; 0% instances), SCONJ (2; 0% instances), PUNCT (1; 0% instances)

147540 (82%) ADJ nodes are leaves.

14379 (8%) ADJ nodes have one child.

7405 (4%) ADJ nodes have two children.

11487 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 21.

Children of ADJ nodes are attached using 30 different relations: cs-dep/punct (17759; 21% instances), cs-dep/advmod (11333; 14% instances), cs-dep/cop (9168; 11% instances), cs-dep/conj (7680; 9% instances), cs-dep/nmod (7507; 9% instances), cs-dep/cc (6782; 8% instances), cs-dep/nsubj (5591; 7% instances), cs-dep/dobj (3891; 5% instances), cs-dep/mark (2244; 3% instances), cs-dep/csubj (2052; 2% instances), cs-dep/case (2005; 2% instances), cs-dep/advcl (1461; 2% instances), cs-dep/advmod:emph (1140; 1% instances), cs-dep/dep (635; 1% instances), cs-dep/xcomp (623; 1% instances), cs-dep/expl (538; 1% instances), cs-dep/aux (530; 1% instances), cs-dep/appos (408; 0% instances), cs-dep/foreign (308; 0% instances), cs-dep/amod (268; 0% instances), cs-dep/nummod (260; 0% instances), cs-dep/acl (153; 0% instances), cs-dep/parataxis (123; 0% instances), cs-dep/ccomp (106; 0% instances), cs-dep/det (81; 0% instances), cs-dep/neg (66; 0% instances), cs-dep/name (42; 0% instances), cs-dep/discourse (22; 0% instances), cs-dep/auxpass:reflex (5; 0% instances), cs-dep/vocative (2; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: PUNCT (17759; 21% instances), NOUN (15088; 18% instances), VERB (14143; 17% instances), ADV (11950; 14% instances), ADJ (6932; 8% instances), CONJ (6287; 8% instances), PRON (2820; 3% instances), SCONJ (2240; 3% instances), ADP (1964; 2% instances), PROPN (1700; 2% instances), NUM (773; 1% instances), AUX (530; 1% instances), PART (482; 1% instances), DET (106; 0% instances), INTJ (5; 0% instances), SYM (4; 0% instances)


ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]