Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: NUM
There are 6 NUM lemmas (2%), 6 NUM types (1%) and 6 NUM tokens (1%).
Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 11 in number of tokens.
The 10 most frequent NUM lemmas: දෙක, දෙදෙනා, පස්, හත්සීයක්, හැට, හැටපස්
The 10 most frequent NUM types: දෙක, දෙන්නෙක්, පස්, හත්සීයකට, හැට, හැටපස්දෙනෙක්
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.100000).
The 1st highest number of forms (1) was observed with the lemma “දෙක”: දෙක.
The 2nd highest number of forms (1) was observed with the lemma “දෙදෙනා”: දෙන්නෙක්.
The 3rd highest number of forms (1) was observed with the lemma “පස්”: පස්.
NUM occurs with 1 features: NumType (5; 83% instances)
NUM occurs with 1 feature-value pairs: NumType=Card
NUM occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card (5 tokens).
Examples: දෙක, පස්, හත්සීයකට, හැට, හැටපස්දෙනෙක්
Relations
NUM nodes are attached to their parents using 2 different relations: nummod (5; 83% instances), obl (1; 17% instances)
Parents of NUM nodes belong to 2 different parts of speech: NOUN (5; 83% instances), VERB (1; 17% instances)
5 (83%) NUM nodes are leaves.
0 (0%) NUM nodes have one child.
1 (17%) NUM nodes have two children.
The highest child degree of a NUM node is 2.
Children of NUM nodes are attached using 2 different relations: advmod (1; 50% instances), nmod (1; 50% instances)
Children of NUM nodes belong to 2 different parts of speech: ADV (1; 50% instances), NOUN (1; 50% instances)