home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Georgian-GNC: POS Tags: NUM

There are 49 NUM lemmas (1%), 77 NUM types (1%) and 249 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: ერთი, მეორე, ორი, სამი, პირველი, ოცი, ხუთი, ოთხი, ათასი, მესამე

The 10 most frequent NUM types: ერთი, ერთ, მეორე, ორი, სამი, ორივე, ოცი, პირველი, პირველ, ხუთი

The 10 most frequent ambiguous lemmas: ერთი (NUM 73, INTJ 1), ნახევარი (NUM 3, ADJ 2)

The 10 most frequent ambiguous types: ერთი (NUM 41, INTJ 1)

Morphology

The form / lemma ratio of NUM is 1.571429 (the average of all parts of speech is 1.616911).

The 1st highest number of forms (6) was observed with the lemma “ერთი”: ერთ, ერთი, ერთის, ერთიც, ერთმა, ერთს.

The 2nd highest number of forms (5) was observed with the lemma “სამი”: სამ, სამა, სამად, სამი, სამს.

The 3rd highest number of forms (4) was observed with the lemma “ორი”: ორ, ორთა, ორი, ორივე.

NUM occurs with 3 features: NumType (249; 100% instances), Case (241; 97% instances), Encl (12; 5% instances)

NUM occurs with 11 feature-value pairs: Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Encl=C, Encl=Ve, NumType=Card, NumType=Frac, NumType=Ord

NUM occurs with 19 feature combinations. The most frequent feature combination is Case=Nom|NumType=Card (70 tokens). Examples: ერთი, ორი, სამი, ათიოდე, ასი, თორმეტი, ოთხი, ორიოდე, ოცი, რვა

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (206; 83% instances), nsubj (9; 4% instances), conj (8; 3% instances), obl (6; 2% instances), obj (5; 2% instances), iobj (4; 2% instances), orphan (3; 1% instances), amod (2; 1% instances), appos (2; 1% instances), nmod (1; 0% instances), parataxis (1; 0% instances), root (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (205; 82% instances), VERB (24; 10% instances), NUM (8; 3% instances), ADJ (6; 2% instances), ADV (2; 1% instances), DET (1; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances), (1; 0% instances)

212 (85%) NUM nodes are leaves.

25 (10%) NUM nodes have one child.

9 (4%) NUM nodes have two children.

3 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 14 different relations: punct (14; 27% instances), advmod (10; 19% instances), case (8; 15% instances), cc (3; 6% instances), conj (3; 6% instances), obl (3; 6% instances), orphan (3; 6% instances), cop (2; 4% instances), advmod:neg (1; 2% instances), amod (1; 2% instances), det (1; 2% instances), nmod (1; 2% instances), nsubj (1; 2% instances), nummod (1; 2% instances)

Children of NUM nodes belong to 11 different parts of speech: PUNCT (14; 27% instances), ADV (11; 21% instances), ADP (8; 15% instances), NUM (8; 15% instances), CCONJ (3; 6% instances), AUX (2; 4% instances), NOUN (2; 4% instances), ADJ (1; 2% instances), DET (1; 2% instances), PRON (1; 2% instances), PROPN (1; 2% instances)