home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-IU: POS Tags: NUM

There are 303 NUM lemmas (2%), 354 NUM types (1%) and 1638 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 7 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: два, 1, 2, один, 5, три, 3, 7, 4, 10

The 10 most frequent NUM types: 1, 2, 5, 3, 7, 4, три, двох, 10, 6

The 10 most frequent ambiguous lemmas: 1 (NUM 100, ADJ 11, NOUN 1), 2 (NUM 87, ADJ 7), один (DET 221, NUM 83, ADJ 8), 5 (NUM 59, ADJ 5, NOUN 1), 3 (NUM 56, ADJ 11), 7 (NUM 48, ADJ 8), 4 (NUM 46, ADJ 5, NOUN 1), 10 (NUM 37, ADJ 5), 6 (NUM 37, ADJ 2), 8 (NUM 34, ADJ 6, NOUN 1)

The 10 most frequent ambiguous types: 1 (NUM 99, ADJ 11, NOUN 1), 2 (NUM 83, ADJ 7), 5 (NUM 58, ADJ 5, NOUN 1), 3 (NUM 55, ADJ 11), 7 (NUM 48, ADJ 8), 4 (NUM 46, ADJ 5, NOUN 1), 10 (NUM 37, ADJ 5), 6 (NUM 37, ADJ 2), 8 (NUM 34, ADJ 6, NOUN 1), один (DET 43, NUM 29, ADJ 1)

Morphology

The form / lemma ratio of NUM is 1.168317 (the average of all parts of speech is 1.738445).

The 1st highest number of forms (11) was observed with the lemma “один”: Одно, один, одна, одне, одним, одного, одному, одної, одну, одній, однієї.

The 2nd highest number of forms (4) was observed with the lemma “два”: два, двома, двох, дві.

The 3rd highest number of forms (4) was observed with the lemma “двоє”: Двом, двома, двох, двоє.

NUM occurs with 7 features: Case (1638; 100% instances), NumType (1638; 100% instances), Uninflect (1157; 71% instances), Gender (405; 25% instances), Number (59; 4% instances), Orth (24; 1% instances), Animacy (14; 1% instances)

NUM occurs with 15 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Orth=Alt, Uninflect=Yes

NUM occurs with 49 feature combinations. The most frequent feature combination is Case=Nom|NumType=Card|Uninflect=Yes (466 tokens). Examples: 3, 7, 5, 4, 6, 8, 10, 00, 15, 2017

Relations

NUM nodes are attached to their parents using 23 different relations: nummod (490; 30% instances), nummod:gov (431; 26% instances), compound (214; 13% instances), flat:title (188; 11% instances), root (97; 6% instances), flat:range (54; 3% instances), conj (43; 3% instances), nsubj (20; 1% instances), obl (20; 1% instances), nmod (16; 1% instances), obj (16; 1% instances), parataxis (13; 1% instances), appos (7; 0% instances), list (7; 0% instances), flat (5; 0% instances), discourse (4; 0% instances), orphan (4; 0% instances), advcl:sp (2; 0% instances), amod (2; 0% instances), flat:abs (2; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), fixed (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (1094; 67% instances), NUM (284; 17% instances), (97; 6% instances), VERB (60; 4% instances), PROPN (38; 2% instances), ADJ (34; 2% instances), X (24; 1% instances), PRON (4; 0% instances), DET (3; 0% instances)

845 (52%) NUM nodes are leaves.

662 (40%) NUM nodes have one child.

97 (6%) NUM nodes have two children.

34 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 11.

Children of NUM nodes are attached using 25 different relations: punct (481; 49% instances), compound (199; 20% instances), case (72; 7% instances), flat:range (51; 5% instances), conj (49; 5% instances), advmod (35; 4% instances), nmod (26; 3% instances), discourse (23; 2% instances), cc (16; 2% instances), appos (5; 1% instances), cop (3; 0% instances), det (3; 0% instances), nsubj (3; 0% instances), orphan (3; 0% instances), acl:relcl (2; 0% instances), csubj (2; 0% instances), flat:abs (2; 0% instances), flat:title (2; 0% instances), goeswith (2; 0% instances), obj (2; 0% instances), parataxis (2; 0% instances), amod (1; 0% instances), expl (1; 0% instances), list (1; 0% instances), mark (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: PUNCT (481; 49% instances), NUM (284; 29% instances), ADP (72; 7% instances), ADV (34; 3% instances), NOUN (32; 3% instances), PART (24; 2% instances), CCONJ (16; 2% instances), PROPN (9; 1% instances), SYM (7; 1% instances), VERB (7; 1% instances), DET (6; 1% instances), PRON (5; 1% instances), ADJ (4; 0% instances), AUX (3; 0% instances), SCONJ (2; 0% instances), X (1; 0% instances)