Treebank Statistics: UD_Beja-Autogramm: POS Tags: NUM
There are 1 NUM lemmas (6%), 15 NUM types (1%) and 59 NUM tokens (0%).
Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 14 in number of types and 16 in number of tokens.
The 10 most frequent NUM lemmas: _
The 10 most frequent NUM types: mhaj, gaːl, mhall, alif, asarama, gali, gaːt, mhali, awwal, faɖig
The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1736, NOUN 1719, PRON 819, ADP 766, SCONJ 594, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)
The 10 most frequent ambiguous types: ʔawwal (ADV 1, NUM 1)
- ʔawwal
Morphology
The form / lemma ratio of NUM is 15.000000 (the average of all parts of speech is 126.875000).
The 1st highest number of forms (15) was observed with the lemma “_”: alif, asarama, awwal, faɖig, gali, gaːl, gaːlnaːj, gaːt, mhaj, mhali, mhall, mhallaː, mhaloː, ʃeː, ʔawwal.
NUM occurs with 2 features: Gender (1; 2% instances), Number (1; 2% instances)
NUM occurs with 2 feature-value pairs: Gender=Fem, Number=Plur
NUM occurs with 3 feature combinations.
The most frequent feature combination is _ (57 tokens).
Examples: mhaj, gaːl, mhall, alif, asarama, mhali, gali, gaːt, awwal, faɖig
Relations
NUM nodes are attached to their parents using 11 different relations: nummod (29; 49% instances), nmod (12; 20% instances), dep:comp (6; 10% instances), dislocated:mod (3; 5% instances), nsubj (2; 3% instances), obj (2; 3% instances), dislocated:obj (1; 2% instances), dislocated:subj (1; 2% instances), obl:arg (1; 2% instances), obl:mod (1; 2% instances), root (1; 2% instances)
Parents of NUM nodes belong to 6 different parts of speech: NOUN (42; 71% instances), VERB (8; 14% instances), ADP (6; 10% instances), ADJ (1; 2% instances), AUX (1; 2% instances), (1; 2% instances)
21 (36%) NUM nodes are leaves.
23 (39%) NUM nodes have one child.
11 (19%) NUM nodes have two children.
4 (7%) NUM nodes have three or more children.
The highest child degree of a NUM node is 5.
Children of NUM nodes are attached using 10 different relations: det (27; 44% instances), punct (14; 23% instances), dep (7; 11% instances), nmod (4; 7% instances), cop (2; 3% instances), nmod:poss (2; 3% instances), nsubj (2; 3% instances), acl:relcl (1; 2% instances), cc (1; 2% instances), dep:flat (1; 2% instances)
Children of NUM nodes belong to 8 different parts of speech: DET (27; 44% instances), PUNCT (14; 23% instances), PRON (7; 11% instances), ADJ (5; 8% instances), SCONJ (3; 5% instances), AUX (2; 3% instances), NOUN (2; 3% instances), CCONJ (1; 2% instances)