home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: NUM

There are 1 NUM lemmas (6%), 11 NUM types (1%) and 26 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 15 in number of types and 15 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: mhaj, gaːl, gali, gaːt, mhaja, awwal, malia, mhali, mhall, mhallaː

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: malia (PART 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 11.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (11) was observed with the lemma “_”: awwal, gali, gaːl, gaːt, malia, mhaj, mhaja, mhali, mhall, mhallaː, ʃeː.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 6 different relations: nmod (9; 35% instances), nummod (9; 35% instances), dep:comp (3; 12% instances), nsubj (2; 8% instances), obj (2; 8% instances), dislocated:obj (1; 4% instances)

Parents of NUM nodes belong to 4 different parts of speech: NOUN (17; 65% instances), VERB (5; 19% instances), ADP (3; 12% instances), ADJ (1; 4% instances)

11 (42%) NUM nodes are leaves.

11 (42%) NUM nodes have one child.

3 (12%) NUM nodes have two children.

1 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 5 different relations: det (12; 60% instances), punct (5; 25% instances), cc (1; 5% instances), nmod (1; 5% instances), obl:mod (1; 5% instances)

Children of NUM nodes belong to 5 different parts of speech: DET (12; 60% instances), PUNCT (5; 25% instances), ADJ (1; 5% instances), CCONJ (1; 5% instances), PRON (1; 5% instances)