home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: NUM

There are 1 NUM lemmas (6%), 5 NUM types (1%) and 12 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 14 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: mhaj, gaːl, mhaja, -a, malia

The 10 most frequent ambiguous lemmas: _ (VERB 242, PUNCT 241, DET 176, NOUN 168, PRON 106, SCONJ 68, ADP 39, AUX 38, CCONJ 34, PART 28, ADV 18, ADJ 17, NUM 12, INTJ 8, X 7, PROPN 4)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 5.000000 (the average of all parts of speech is 26.312500).

The 1st highest number of forms (5) was observed with the lemma “_”: -a, gaːl, malia, mhaj, mhaja.

NUM occurs with 1 features: NumType (1; 8% instances)

NUM occurs with 1 feature-value pairs: NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (11 tokens). Examples: mhaj, gaːl, mhaja, malia

Relations

NUM nodes are attached to their parents using 6 different relations: nmod (4; 33% instances), nsubj (2; 17% instances), nummod (2; 17% instances), obj (2; 17% instances), nummod:det (1; 8% instances), obl:mod (1; 8% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (7; 58% instances), VERB (4; 33% instances), PART (1; 8% instances)

5 (42%) NUM nodes are leaves.

5 (42%) NUM nodes have one child.

2 (17%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 3 different relations: det (7; 78% instances), nmod (1; 11% instances), punct (1; 11% instances)

Children of NUM nodes belong to 3 different parts of speech: DET (7; 78% instances), ADJ (1; 11% instances), PUNCT (1; 11% instances)