Treebank Statistics: UD_Georgian-GLC: POS Tags: NUM
There are 11 NUM
lemmas (1%), 15 NUM
types (1%) and 21 NUM
tokens (1%).
Out of 13 observed tags, the rank of NUM
is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: ერთი, ორი, ცოტა, 100.000, 1907, 2003, 2004-2005, 30, 363, სამი
The 10 most frequent NUM
types: ერთი, მეორე, ცოტა, 100.000, 1907, 2003, 2004-2005, 30, 363, ერთ
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.363636 (the average of all parts of speech is 1.235874).
The 1st highest number of forms (3) was observed with the lemma “ერთი”: ერთ, ერთი, პირველივე.
The 2nd highest number of forms (3) was observed with the lemma “ორი”: მეორე, ორ, ორი.
The 3rd highest number of forms (1) was observed with the lemma “100.000”: 100.000.
NUM
occurs with 5 features: NumType (21; 100% instances), Case (15; 71% instances), Number (15; 71% instances), NumForm (6; 29% instances), PartType (1; 5% instances)
NUM
occurs with 9 feature-value pairs: Case=Dat
, Case=Gen
, Case=Ins
, Case=Nom
, NumForm=Digit
, NumType=Card
, NumType=Ord
, Number=Sing
, PartType=Emp
NUM
occurs with 8 feature combinations.
The most frequent feature combination is NumForm=Digit|NumType=Card
(6 tokens).
Examples: 100.000, 1907, 2003, 2004-2005, 30, 363
Relations
NUM
nodes are attached to their parents using 2 different relations: nummod (20; 95% instances), nmod (1; 5% instances)
Parents of NUM
nodes belong to 2 different parts of speech: NOUN (19; 90% instances), VERB (2; 10% instances)
20 (95%) NUM
nodes are leaves.
1 (5%) NUM
nodes have one child.
The highest child degree of a NUM
node is 1.
Children of NUM
nodes are attached using 1 different relations: advmod (1; 100% instances)
Children of NUM
nodes belong to 1 different parts of speech: ADV (1; 100% instances)