home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: ADJ

There are 6580 ADJ lemmas (24%), 15117 ADJ types (28%) and 40774 ADJ tokens (12%). Out of 17 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: český, velký, nový, další, první, jiný, druhý, vysoký, celý, poslední

The 10 most frequent ADJ types: první, další, české, nové, druhé, poslední, státní, možné, dalších, vlastní

The 10 most frequent ambiguous lemmas: známý (ADJ 115, NOUN 6), rád (ADJ 57, ADV 21), vedoucí (ADJ 36, NOUN 27), účetní (ADJ 16, NOUN 4), nevidomý (NOUN 28, ADJ 12), nemocný (NOUN 11, ADJ 7), dospělý (ADJ 5, NOUN 5), cestující (ADJ 3, NOUN 1), radní (ADJ 3, NOUN 3), ex (ADJ 2, X 1)

The 10 most frequent ambiguous types: vlastní (ADJ 110, VERB 18), a (CCONJ 6945, ADJ 32, NOUN 17, X 6), vedoucí (ADJ 20, NOUN 18), možná (ADV 45, ADJ 9, PART 2), znám (ADJ 10, VERB 2), jazykové (ADJ 9, NOUN 2), o (ADP 2182, NOUN 33, ADJ 6), stará (ADJ 5, VERB 3), známým (ADJ 6, NOUN 1), účetní (ADJ 6, NOUN 4)

Morphology

The form / lemma ratio of ADJ is 2.297416 (the average of all parts of speech is 1.964432).

The 1st highest number of forms (24) was observed with the lemma “velký”: Nevelká, Velcí, největší, největších, největšího, největším, největšími, největšímu, nevelké, velkou, velká, velké, velkého, velkém, velkému, velký, velkých, velkým, velkými, větší, větších, většího, větším, většímu.

The 2nd highest number of forms (24) was observed with the lemma “známý”: nejznámější, nejznámějších, nejznámějším, neznámo, neznámá, neznámé, neznámého, neznámí, neznámý, neznámých, neznámým, neznámými, znám, známa, známo, známou, známy, známá, známé, známého, známí, známý, známých, známým.

The 3rd highest number of forms (22) was observed with the lemma “malý”: malou, malá, malé, malého, malém, malému, malí, malý, malých, malým, malými, menší, menších, menšího, menším, menšími, menšímu, nejmenší, nejmenším, nemalou, nemalé, nemalými.

ADJ occurs with 19 features: Number (40456; 99% instances), Gender (40447; 99% instances), Polarity (39019; 96% instances), Case (38032; 93% instances), Degree (35543; 87% instances), Animacy (16398; 40% instances), VerbForm (5919; 15% instances), Voice (5919; 15% instances), Aspect (3031; 7% instances), Variant (2426; 6% instances), NumType (1216; 3% instances), Tense (931; 2% instances), Gender[psor] (538; 1% instances), Poss (538; 1% instances), NameType (474; 1% instances), Abbr (182; 0% instances), Hyph (119; 0% instances), Style (22; 0% instances), Typo (2; 0% instances)

ADJ occurs with 46 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Imp,Perf, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Geo,Giv, NameType=Giv, NameType=Nat, NumType=Mult,Sets, NumType=Ord, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Polarity=Neg, Polarity=Pos, Poss=Yes, Style=Coll, Style=Expr, Style=Slng, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Part, Voice=Act, Voice=Pass

ADJ occurs with 570 feature combinations. The most frequent feature combination is Case=Gen|Degree=Pos|Gender=Fem|Number=Sing|Polarity=Pos (2734 tokens). Examples: české, evropské, nové, národní, slovenské, politické, celé, velké, světové, ekonomické

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (34242; 84% instances), conj (1983; 5% instances), root (1948; 5% instances), ccomp (531; 1% instances), xcomp (366; 1% instances), dep (335; 1% instances), advcl (264; 1% instances), acl:relcl (205; 1% instances), obl:arg (187; 0% instances), obl (167; 0% instances), nsubj (131; 0% instances), obj (94; 0% instances), acl (90; 0% instances), appos (81; 0% instances), csubj (58; 0% instances), orphan (39; 0% instances), parataxis (22; 0% instances), csubj:pass (9; 0% instances), nsubj:pass (7; 0% instances), flat (6; 0% instances), iobj (5; 0% instances), advmod (1; 0% instances), fixed (1; 0% instances), nmod (1; 0% instances), vocative (1; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (33666; 83% instances), (1948; 5% instances), VERB (1890; 5% instances), ADJ (1725; 4% instances), PROPN (866; 2% instances), DET (255; 1% instances), NUM (144; 0% instances), PRON (118; 0% instances), ADV (70; 0% instances), X (64; 0% instances), PART (11; 0% instances), AUX (10; 0% instances), SYM (4; 0% instances), CCONJ (3; 0% instances)

30504 (75%) ADJ nodes are leaves.

4754 (12%) ADJ nodes have one child.

1290 (3%) ADJ nodes have two children.

4226 (10%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 35 different relations: punct (5544; 20% instances), advmod (3163; 12% instances), obl (3119; 11% instances), cop (2572; 9% instances), conj (2004; 7% instances), nsubj (1751; 6% instances), cc (1721; 6% instances), aux:pass (1339; 5% instances), obl:arg (1177; 4% instances), nsubj:pass (1017; 4% instances), mark (850; 3% instances), advcl (452; 2% instances), case (422; 2% instances), csubj (351; 1% instances), advmod:emph (268; 1% instances), aux (233; 1% instances), xcomp (197; 1% instances), obj (195; 1% instances), dep (179; 1% instances), expl:pv (104; 0% instances), appos (100; 0% instances), nmod (88; 0% instances), orphan (61; 0% instances), ccomp (59; 0% instances), parataxis (52; 0% instances), nummod (43; 0% instances), acl:relcl (31; 0% instances), det (25; 0% instances), amod (23; 0% instances), csubj:pass (13; 0% instances), flat (11; 0% instances), discourse (5; 0% instances), expl:pass (2; 0% instances), acl (1; 0% instances), vocative (1; 0% instances)

Children of ADJ nodes belong to 17 different parts of speech: NOUN (6295; 23% instances), PUNCT (5544; 20% instances), AUX (4163; 15% instances), ADV (3371; 12% instances), ADJ (1725; 6% instances), CCONJ (1698; 6% instances), VERB (1235; 5% instances), SCONJ (830; 3% instances), DET (581; 2% instances), PRON (433; 2% instances), PROPN (423; 2% instances), ADP (417; 2% instances), PART (220; 1% instances), NUM (205; 1% instances), X (22; 0% instances), SYM (10; 0% instances), INTJ (1; 0% instances)