Treebank Statistics: UD_Beja-NSC: POS Tags: NUM
There are 1 NUM
lemmas (6%), 5 NUM
types (1%) and 12 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 9 in number of lemmas, 14 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: mhaj, gaːl, mhaja, -a, malia
The 10 most frequent ambiguous lemmas: _ (VERB 242, PUNCT 241, DET 176, NOUN 168, PRON 106, SCONJ 68, ADP 39, AUX 38, CCONJ 34, PART 28, ADV 18, ADJ 17, NUM 12, INTJ 8, X 7, PROPN 4)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 5.000000 (the average of all parts of speech is 26.312500).
The 1st highest number of forms (5) was observed with the lemma “_”: -a, gaːl, malia, mhaj, mhaja.
NUM
occurs with 1 features: NumType (1; 8% instances)
NUM
occurs with 1 feature-value pairs: NumType=Ord
NUM
occurs with 2 feature combinations.
The most frequent feature combination is _
(11 tokens).
Examples: mhaj, gaːl, mhaja, malia
Relations
NUM
nodes are attached to their parents using 6 different relations: nmod (4; 33% instances), nsubj (2; 17% instances), nummod (2; 17% instances), obj (2; 17% instances), nummod:det (1; 8% instances), obl:mod (1; 8% instances)
Parents of NUM
nodes belong to 3 different parts of speech: NOUN (7; 58% instances), VERB (4; 33% instances), PART (1; 8% instances)
5 (42%) NUM
nodes are leaves.
5 (42%) NUM
nodes have one child.
2 (17%) NUM
nodes have two children.
The highest child degree of a NUM
node is 2.
Children of NUM
nodes are attached using 3 different relations: det (7; 78% instances), nmod (1; 11% instances), punct (1; 11% instances)
Children of NUM
nodes belong to 3 different parts of speech: DET (7; 78% instances), ADJ (1; 11% instances), PUNCT (1; 11% instances)