home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDTC: POS Tags: ADJ

There are 17762 ADJ lemmas (20%), 54112 ADJ types (28%) and 361037 ADJ tokens (10%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 1 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: velký, další, nový, první, český, jiný, vysoký, druhý, celý, malý

The 10 most frequent ADJ types: další, první, nové, poslední, české, velké, dalších, cenných, obchodní, hlavní

The 10 most frequent ambiguous lemmas: rád (ADJ 1984, ADV 335), starý (ADJ 1944, NOUN 13), mladý (ADJ 1386, NOUN 55), známý (ADJ 915, NOUN 170), domácí (ADJ 728, NOUN 83), vedoucí (ADJ 606, NOUN 423), místní (ADJ 536, NOUN 5), bílý (ADJ 515, NOUN 10), blízký (ADJ 507, NOUN 10), černý (ADJ 457, NOUN 16)

The 10 most frequent ambiguous types: hlavní (ADJ 841, NOUN 6), vlastní (ADJ 855, VERB 380), starší (ADJ 513, NOUN 1), vysoké (ADJ 433, NOUN 7), celou (ADJ 439, NOUN 1), lepší (ADJ 394, VERB 2), tzv (ADJ 396, ADV 7), vedoucí (ADJ 320, NOUN 259), domácí (ADJ 341, NOUN 42), místní (ADJ 256, NOUN 3)

Morphology

The form / lemma ratio of ADJ is 3.046504 (the average of all parts of speech is 2.169184).

The 1st highest number of forms (35) was observed with the lemma “známý”: nejznámější, nejznámějších, nejznámějším, nejznámějšími, neznáma, neznámo, neznámou, neznámá, neznámé, neznámého, neznámém, neznámému, neznámí, neznámý, neznámých, neznámým, neznámými, znám, známa, známi, známo, známou, známy, známá, známé, známého, známém, známému, známí, známý, známých, známým, známými, známější, známějších.

The 2nd highest number of forms (33) was observed with the lemma “schopný”: nejneschopnějších, nejschopnější, nejschopnějších, nejschopnějším, nejschopnějšímu, neschopen, neschopna, neschopni, neschopnou, neschopná, neschopné, neschopného, neschopní, neschopný, neschopných, neschopným, neschopnými, schopen, schopna, schopni, schopno, schopnou, schopny, schopná, schopné, schopného, schopnému, schopní, schopný, schopných, schopným, schopnými, schopnější.

The 3rd highest number of forms (32) was observed with the lemma “malý”: malej, malou, malym, malá, malé, malého, malém, malému, malí, malý, malých, malým, malými, menší, menších, menšího, menším, menšími, menšímu, nejmenší, nejmenších, nejmenšího, nejmenším, nejmenšími, nejmenšímu, nemalou, nemalá, nemalé, nemalých, nemalým, nemalými, nemenší.

ADJ occurs with 19 features: Number (358889; 99% instances), Gender (358829; 99% instances), Polarity (346146; 96% instances), Case (337099; 93% instances), Degree (336415; 93% instances), Animacy (151657; 42% instances), VerbForm (55612; 15% instances), Voice (55612; 15% instances), Aspect (28096; 8% instances), Variant (21806; 6% instances), Tense (10138; 3% instances), NumType (10119; 3% instances), Gender[psor] (4763; 1% instances), Poss (4762; 1% instances), NameType (4113; 1% instances), Abbr (1150; 0% instances), Hyph (827; 0% instances), Style (230; 0% instances), Typo (83; 0% instances)

ADJ occurs with 52 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Imp,Perf, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Geo,Giv, NameType=Geo,Giv,Oth, NameType=Giv, NameType=Giv,Nat, NameType=Nat, NumType=Mult, NumType=Mult,Sets, NumType=Ord, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Polarity=Neg, Polarity=Pos, Poss=Yes, Style=Coll, Style=Expr, Style=Slng, Style=Vrnc, Style=Vulg, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Part, Voice=Act, Voice=Pass

ADJ occurs with 927 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Fem|Number=Sing|Polarity=Pos (21154 tokens). Examples: česká, velká, nová, celá, další, malá, americká, poslední, federální, stará

Relations

ADJ nodes are attached to their parents using 26 different relations: amod (298283; 83% instances), root (18817; 5% instances), conj (15262; 4% instances), ccomp (5738; 2% instances), acl:relcl (3517; 1% instances), advcl (3079; 1% instances), advcl:pred (2849; 1% instances), dep (2516; 1% instances), obj (1434; 0% instances), nmod (1373; 0% instances), acl (1323; 0% instances), obl:arg (1251; 0% instances), xcomp (976; 0% instances), obl (869; 0% instances), nsubj (786; 0% instances), appos (727; 0% instances), iobj (651; 0% instances), csubj (458; 0% instances), parataxis (343; 0% instances), orphan (250; 0% instances), flat (243; 0% instances), csubj:pass (130; 0% instances), advmod:emph (98; 0% instances), nsubj:pass (59; 0% instances), fixed (3; 0% instances), vocative (2; 0% instances)

Parents of ADJ nodes belong to 17 different parts of speech: NOUN (295024; 82% instances), VERB (20346; 6% instances), (18817; 5% instances), ADJ (12747; 4% instances), PROPN (6820; 2% instances), DET (2298; 1% instances), NUM (1621; 0% instances), PRON (1189; 0% instances), ADV (978; 0% instances), X (418; 0% instances), AUX (394; 0% instances), PART (199; 0% instances), SYM (140; 0% instances), CCONJ (33; 0% instances), ADP (6; 0% instances), INTJ (6; 0% instances), SCONJ (1; 0% instances)

265260 (73%) ADJ nodes are leaves.

43600 (12%) ADJ nodes have one child.

11142 (3%) ADJ nodes have two children.

41035 (11%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 13.

Children of ADJ nodes are attached using 38 different relations: punct (53048; 21% instances), advmod (26496; 10% instances), obl (25701; 10% instances), cop (23138; 9% instances), conj (15896; 6% instances), aux:pass (15422; 6% instances), nsubj (15155; 6% instances), obl:arg (13097; 5% instances), cc (12958; 5% instances), nsubj:pass (11787; 5% instances), advmod:emph (9488; 4% instances), mark (8919; 3% instances), advcl (4162; 2% instances), case (2868; 1% instances), aux (2583; 1% instances), csubj (2554; 1% instances), obj (2159; 1% instances), expl:pv (1506; 1% instances), xcomp (1312; 1% instances), dep (996; 0% instances), compound (959; 0% instances), advcl:pred (851; 0% instances), appos (802; 0% instances), amod (630; 0% instances), ccomp (547; 0% instances), parataxis (423; 0% instances), nmod (419; 0% instances), orphan (344; 0% instances), csubj:pass (251; 0% instances), nummod (162; 0% instances), acl:relcl (120; 0% instances), discourse (119; 0% instances), flat (63; 0% instances), det (49; 0% instances), expl:pass (15; 0% instances), vocative (10; 0% instances), det:nummod (2; 0% instances), iobj (1; 0% instances)

Children of ADJ nodes belong to 17 different parts of speech: NOUN (56291; 22% instances), PUNCT (53048; 21% instances), AUX (41384; 16% instances), ADV (29206; 11% instances), CCONJ (13473; 5% instances), ADJ (12747; 5% instances), VERB (10822; 4% instances), SCONJ (8729; 3% instances), DET (7993; 3% instances), PART (7297; 3% instances), PRON (4553; 2% instances), PROPN (3745; 1% instances), ADP (2862; 1% instances), NUM (2344; 1% instances), X (346; 0% instances), SYM (154; 0% instances), INTJ (18; 0% instances)