Treebank Statistics: UD_Pashto-Sikaram: POS Tags: NUM
There are 16 NUM lemmas (1%), 21 NUM types (1%) and 69 NUM tokens (1%).
Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: یو, دوه, _, 0053, 1, 1032, 1044, 1075, 1100, 1106
The 10 most frequent NUM types: یوه, یو, یوې, دوو, 0053, 1, 1032, 1044, 1075, 1100
The 10 most frequent ambiguous lemmas: _ (NOUN 21, ADJ 14, VERB 9, X 8, PROPN 3, ADP 2, NUM 2, PART 1, PRON 1, SYM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM is 1.312500 (the average of all parts of speech is 1.318390).
The 1st highest number of forms (3) was observed with the lemma “دوه”: دوه, دوو, دوې.
The 2nd highest number of forms (3) was observed with the lemma “یو”: یو, یوه, یوې.
The 3rd highest number of forms (2) was observed with the lemma “_”: 30, 40.
NUM occurs with 4 features: NumType (69; 100% instances), Case (56; 81% instances), Gender (52; 75% instances), Typo (2; 3% instances)
NUM occurs with 8 feature-value pairs: Case=Abl, Case=Acc, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, NumType=Card, Typo=Yes
NUM occurs with 11 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Fem|NumType=Card (19 tokens).
Examples: یوه, دوې
Relations
NUM nodes are attached to their parents using 4 different relations: nummod (58; 84% instances), conj (4; 6% instances), parataxis (4; 6% instances), root (3; 4% instances)
Parents of NUM nodes belong to 7 different parts of speech: NOUN (53; 77% instances), PROPN (5; 7% instances), NUM (4; 6% instances), (3; 4% instances), ADJ (2; 3% instances), PRON (1; 1% instances), SYM (1; 1% instances)
57 (83%) NUM nodes are leaves.
4 (6%) NUM nodes have one child.
4 (6%) NUM nodes have two children.
4 (6%) NUM nodes have three or more children.
The highest child degree of a NUM node is 4.
Children of NUM nodes are attached using 4 different relations: punct (18; 64% instances), conj (5; 18% instances), nmod (4; 14% instances), parataxis (1; 4% instances)
Children of NUM nodes belong to 4 different parts of speech: PUNCT (18; 64% instances), ADJ (5; 18% instances), NUM (4; 14% instances), DET (1; 4% instances)