home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: NUM

There are 309 NUM lemmas (4%), 337 NUM types (2%) and 1157 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: один, два, 3, три, 5, 20, 1, 15, 2, 30

The 10 most frequent NUM types: два, один, 3, три, 5, одну, 20, двох, 1, 15

The 10 most frequent ambiguous lemmas: один (NUM 146, DET 42, PRON 25), 3 (NUM 43, ADJ 14), 5 (NUM 26, ADJ 7), 20 (NUM 25, ADJ 2), 1 (ADJ 33, NUM 22), 15 (NUM 21, ADJ 7), 2 (NUM 20, ADJ 8), 30 (NUM 17, ADJ 6), 10 (NUM 16, ADJ 7), 14 (NUM 15, ADJ 1)

The 10 most frequent ambiguous types: один (NUM 48, PRON 12, DET 8), 3 (NUM 43, ADJ 14), 5 (NUM 26, ADJ 7), 20 (NUM 25, ADJ 2), 1 (ADJ 33, NUM 22), 15 (NUM 20, ADJ 7), 2 (NUM 20, ADJ 8), одна (NUM 15, DET 1), одне (NUM 18, NOUN 5, DET 2, PRON 1), 30 (NUM 17, ADJ 6)

Morphology

The form / lemma ratio of NUM is 1.090615 (the average of all parts of speech is 1.931827).

The 1st highest number of forms (12) was observed with the lemma “один”: один, одна, одне, одним, одно, одного, одному, одну, одні, одній, однією, однієї.

The 2nd highest number of forms (4) was observed with the lemma “два”: два, двома, двох, дві.

The 3rd highest number of forms (3) was observed with the lemma “17”: 17, 17-ти, 17-ть.

NUM occurs with 9 features: Case (1135; 98% instances), NumType (1127; 97% instances), Gender (288; 25% instances), Number (175; 15% instances), Animacy (26; 2% instances), BadStyle (9; 1% instances), Abbr (4; 0% instances), Uninflect (3; 0% instances), ExtPos (1; 0% instances)

NUM occurs with 18 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, BadStyle=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=DET, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, Uninflect=Yes

NUM occurs with 56 feature combinations. The most frequent feature combination is Case=Nom|NumType=Card (531 tokens). Examples: 3, три, 20, 15, 0, 5, 14, 19, 30, 4

Relations

NUM nodes are attached to their parents using 25 different relations: nummod (404; 35% instances), nummod:gov (287; 25% instances), orphan (229; 20% instances), root (43; 4% instances), appos (39; 3% instances), conj (29; 3% instances), nmod (29; 3% instances), parataxis (18; 2% instances), nsubj (16; 1% instances), obj (14; 1% instances), amod (13; 1% instances), flat (8; 1% instances), obl (6; 1% instances), compound (5; 0% instances), list (4; 0% instances), acl (3; 0% instances), ccomp (2; 0% instances), advcl (1; 0% instances), det (1; 0% instances), fixed (1; 0% instances), flat:title (1; 0% instances), iobj (1; 0% instances), nsubj:pass (1; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (846; 73% instances), ADP (86; 7% instances), PROPN (56; 5% instances), VERB (52; 4% instances), (43; 4% instances), ADJ (33; 3% instances), NUM (33; 3% instances), PRON (5; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), X (1; 0% instances)

694 (60%) NUM nodes are leaves.

364 (31%) NUM nodes have one child.

52 (4%) NUM nodes have two children.

47 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 28 different relations: punct (358; 55% instances), advmod (44; 7% instances), nmod (39; 6% instances), case (30; 5% instances), conj (26; 4% instances), nsubj (24; 4% instances), amod (19; 3% instances), advmod:emph (16; 2% instances), parataxis (15; 2% instances), cc (13; 2% instances), orphan (10; 2% instances), mark (7; 1% instances), advmod:neg (6; 1% instances), cop (6; 1% instances), discourse (5; 1% instances), expl (4; 1% instances), list (4; 1% instances), fixed (3; 0% instances), nummod:gov (3; 0% instances), obl (3; 0% instances), acl:relcl (2; 0% instances), appos (2; 0% instances), compound (2; 0% instances), flat:range (2; 0% instances), nummod (2; 0% instances), vocative (2; 0% instances), flat (1; 0% instances), reparandum (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: PUNCT (358; 55% instances), NOUN (65; 10% instances), ADV (53; 8% instances), ADP (43; 7% instances), NUM (33; 5% instances), PART (29; 4% instances), PRON (18; 3% instances), CCONJ (14; 2% instances), VERB (12; 2% instances), SCONJ (7; 1% instances), AUX (6; 1% instances), ADJ (5; 1% instances), PROPN (5; 1% instances), DET (1; 0% instances)