home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pashto-Sikaram: POS Tags: NUM

There are 16 NUM lemmas (1%), 21 NUM types (1%) and 69 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: یو, دوه, _, 0053, 1, 1032, 1044, 1075, 1100, 1106

The 10 most frequent NUM types: یوه, یو, یوې, دوو, 0053, 1, 1032, 1044, 1075, 1100

The 10 most frequent ambiguous lemmas: _ (NOUN 21, ADJ 14, VERB 9, X 8, PROPN 3, ADP 2, NUM 2, PART 1, PRON 1, SYM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.312500 (the average of all parts of speech is 1.318390).

The 1st highest number of forms (3) was observed with the lemma “دوه”: دوه, دوو, دوې.

The 2nd highest number of forms (3) was observed with the lemma “یو”: یو, یوه, یوې.

The 3rd highest number of forms (2) was observed with the lemma “_”: 30, 40.

NUM occurs with 4 features: NumType (69; 100% instances), Case (56; 81% instances), Gender (52; 75% instances), Typo (2; 3% instances)

NUM occurs with 8 feature-value pairs: Case=Abl, Case=Acc, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, NumType=Card, Typo=Yes

NUM occurs with 11 feature combinations. The most frequent feature combination is Case=Nom|Gender=Fem|NumType=Card (19 tokens). Examples: یوه, دوې

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (58; 84% instances), conj (4; 6% instances), parataxis (4; 6% instances), root (3; 4% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (53; 77% instances), PROPN (5; 7% instances), NUM (4; 6% instances), (3; 4% instances), ADJ (2; 3% instances), PRON (1; 1% instances), SYM (1; 1% instances)

57 (83%) NUM nodes are leaves.

4 (6%) NUM nodes have one child.

4 (6%) NUM nodes have two children.

4 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 4 different relations: punct (18; 64% instances), conj (5; 18% instances), nmod (4; 14% instances), parataxis (1; 4% instances)

Children of NUM nodes belong to 4 different parts of speech: PUNCT (18; 64% instances), ADJ (5; 18% instances), NUM (4; 14% instances), DET (1; 4% instances)