Treebank Statistics: UD_Tupinamba-TuDeT: POS Tags: NUM
There are 8 NUM lemmas (1%), 9 NUM types (0%) and 17 NUM tokens (0%).
Out of 14 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent NUM lemmas: mokõj, mosapɨr, _, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk
The 10 most frequent NUM types: mokõj, mosapɨr, Quatro, cento, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk
The 10 most frequent ambiguous lemmas: _ (NOUN 87, VERB 34, PUNCT 12, ADP 9, PRON 9, PROPN 9, PART 6, ADV 5, NUM 2, DET 1, X 1), ojepe (NOUN 1, NUM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM is 1.125000 (the average of all parts of speech is 1.577170).
The 1st highest number of forms (2) was observed with the lemma “_”: Quatro, cento.
The 2nd highest number of forms (1) was observed with the lemma “mokõj”: mokõj.
The 3rd highest number of forms (1) was observed with the lemma “mosapɨ”: mosapɨ.
NUM occurs with 1 features: NumType (13; 76% instances)
NUM occurs with 1 feature-value pairs: NumType=Card
NUM occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card (13 tokens).
Examples: mokõj, mosapɨr, mosapɨ, mosapɨrɨ, ojepe
Relations
NUM nodes are attached to their parents using 5 different relations: nummod (12; 71% instances), root (2; 12% instances), appos (1; 6% instances), nmod (1; 6% instances), nsubj (1; 6% instances)
Parents of NUM nodes belong to 5 different parts of speech: NOUN (10; 59% instances), NUM (3; 18% instances), (2; 12% instances), PROPN (1; 6% instances), VERB (1; 6% instances)
13 (76%) NUM nodes are leaves.
0 (0%) NUM nodes have one child.
1 (6%) NUM nodes have two children.
3 (18%) NUM nodes have three or more children.
The highest child degree of a NUM node is 5.
Children of NUM nodes are attached using 7 different relations: advmod (4; 29% instances), punct (4; 29% instances), nmod (2; 14% instances), appos (1; 7% instances), dep (1; 7% instances), nummod (1; 7% instances), obl (1; 7% instances)
Children of NUM nodes belong to 5 different parts of speech: ADV (4; 29% instances), PUNCT (4; 29% instances), NUM (3; 21% instances), NOUN (2; 14% instances), PART (1; 7% instances)