Treebank Statistics: UD_Veps-VWT: POS Tags: NUM
There are 9 NUM lemmas (2%), 10 NUM types (2%) and 11 NUM tokens (1%).
Out of 13 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 12 in number of tokens.
The 10 most frequent NUM lemmas: 40, kaksʼ, 15, 2017, 23., kahesa, koume, üks, üksʼ
The 10 most frequent NUM types: 40, 15, 2017, 23., kahesa, kaht, kaksʼ, koume, ühtes, üksʼ
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM is 1.111111 (the average of all parts of speech is 1.550649).
The 1st highest number of forms (2) was observed with the lemma “kaksʼ”: kaht, kaksʼ.
The 2nd highest number of forms (1) was observed with the lemma “15”: 15.
The 3rd highest number of forms (1) was observed with the lemma “2017”: 2017.
NUM occurs with 3 features: Case (11; 100% instances), NumForm (11; 100% instances), NumType (11; 100% instances)
NUM occurs with 8 feature-value pairs: Case=Ade, Case=Ine, Case=Nom, Case=Par, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Ord
NUM occurs with 5 feature combinations.
The most frequent feature combination is Case=Nom|NumForm=Word|NumType=Card (4 tokens).
Examples: kahesa, kaksʼ, koume, üksʼ
Relations
NUM nodes are attached to their parents using 1 different relations: nummod (11; 100% instances)
Parents of NUM nodes belong to 2 different parts of speech: NOUN (10; 91% instances), PRON (1; 9% instances)
8 (73%) NUM nodes are leaves.
3 (27%) NUM nodes have one child.
The highest child degree of a NUM node is 1.
Children of NUM nodes are attached using 1 different relations: advmod (3; 100% instances)
Children of NUM nodes belong to 1 different parts of speech: ADV (3; 100% instances)