Treebank Statistics: UD_Tupinamba-TuDeT: POS Tags: NUM
There are 8 NUM
lemmas (1%), 9 NUM
types (0%) and 17 NUM
tokens (0%).
Out of 14 observed tags, the rank of NUM
is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: mokõj, mosapɨr, _, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk
The 10 most frequent NUM
types: mokõj, mosapɨr, Quatro, cento, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk
The 10 most frequent ambiguous lemmas: _ (NOUN 87, VERB 34, PUNCT 12, ADP 9, PRON 9, PROPN 9, PART 6, ADV 5, NUM 2, DET 1, X 1), ojepe (NOUN 1, NUM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.125000 (the average of all parts of speech is 1.577170).
The 1st highest number of forms (2) was observed with the lemma “_”: Quatro, cento.
The 2nd highest number of forms (1) was observed with the lemma “mokõj”: mokõj.
The 3rd highest number of forms (1) was observed with the lemma “mosapɨ”: mosapɨ.
NUM
occurs with 1 features: NumType (13; 76% instances)
NUM
occurs with 1 feature-value pairs: NumType=Card
NUM
occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card
(13 tokens).
Examples: mokõj, mosapɨr, mosapɨ, mosapɨrɨ, ojepe
Relations
NUM
nodes are attached to their parents using 5 different relations: nummod (12; 71% instances), root (2; 12% instances), appos (1; 6% instances), nmod (1; 6% instances), nsubj (1; 6% instances)
Parents of NUM
nodes belong to 5 different parts of speech: NOUN (10; 59% instances), NUM (3; 18% instances), (2; 12% instances), PROPN (1; 6% instances), VERB (1; 6% instances)
13 (76%) NUM
nodes are leaves.
0 (0%) NUM
nodes have one child.
1 (6%) NUM
nodes have two children.
3 (18%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 7 different relations: advmod (4; 29% instances), punct (4; 29% instances), nmod (2; 14% instances), appos (1; 7% instances), dep (1; 7% instances), nummod (1; 7% instances), obl (1; 7% instances)
Children of NUM
nodes belong to 5 different parts of speech: ADV (4; 29% instances), PUNCT (4; 29% instances), NUM (3; 21% instances), NOUN (2; 14% instances), PART (1; 7% instances)