Treebank Statistics: UD_Western_Sierra_Puebla_Nahuatl-ITML: POS Tags: NUM
There are 31 NUM
lemmas (2%), 33 NUM
types (1%) and 74 NUM
tokens (1%).
Out of 15 observed tags, the rank of NUM
is: 9 in number of lemmas, 10 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: ome, quince, yeyi, caxtol, ce, o, cuatro, ocho, tres, dieciocho
The 10 most frequent NUM
types: quince, ome, yeyi, caxtol, ocho, ce, cuatro, dieciocho, omi, tres
The 10 most frequent ambiguous lemmas: ce (DET 18, NUM 4, PRON 1), o (AUX 689, CCONJ 24, ADV 4, NUM 4, SCONJ 3, INTJ 1), se (DET 115, PRON 5, NUM 1), semeh (PRON 3, NUM 1)
The 10 most frequent ambiguous types: ce (DET 16, NUM 4, PRON 1), millones (NOUN 1, NUM 1), se (DET 106, PRON 3, NUM 1), semeh (PRON 3, NUM 1)
- ce
- millones
- se
- semeh
Morphology
The form / lemma ratio of NUM
is 1.064516 (the average of all parts of speech is 1.474576).
The 1st highest number of forms (3) was observed with the lemma “yeyi”: yeyen, yeyi, yeyin.
The 2nd highest number of forms (2) was observed with the lemma “o”: ocho, omi.
The 3rd highest number of forms (2) was observed with the lemma “ocho”: ocho, tiochoque.
NUM
occurs with 4 features: Foreign (9; 12% instances), Number (4; 5% instances), Number[subj] (1; 1% instances), Person[subj] (1; 1% instances)
NUM
occurs with 4 feature-value pairs: Foreign=Yes
, Number=Plur
, Number[subj]=Plur
, Person[subj]=1
NUM
occurs with 5 feature combinations.
The most frequent feature combination is _
(61 tokens).
Examples: ome, quince, yeyi, caxtol, ce, cuatro, ocho, omi, tres, dieciocho
Relations
NUM
nodes are attached to their parents using 9 different relations: nummod (39; 53% instances), nmod (11; 15% instances), conj (7; 9% instances), nsubj (5; 7% instances), obl (3; 4% instances), root (3; 4% instances), acl (2; 3% instances), compound (2; 3% instances), flat (2; 3% instances)
Parents of NUM
nodes belong to 5 different parts of speech: NOUN (47; 64% instances), NUM (15; 20% instances), VERB (8; 11% instances), (3; 4% instances), PRON (1; 1% instances)
36 (49%) NUM
nodes are leaves.
19 (26%) NUM
nodes have one child.
13 (18%) NUM
nodes have two children.
6 (8%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 15 different relations: det (14; 20% instances), conj (8; 11% instances), nmod (8; 11% instances), punct (7; 10% instances), nummod (6; 9% instances), case (5; 7% instances), cc (5; 7% instances), advmod (4; 6% instances), compound (3; 4% instances), aux (2; 3% instances), cop (2; 3% instances), flat (2; 3% instances), nsubj (2; 3% instances), acl (1; 1% instances), csubj (1; 1% instances)
Children of NUM
nodes belong to 11 different parts of speech: NUM (15; 21% instances), DET (14; 20% instances), NOUN (9; 13% instances), PUNCT (7; 10% instances), ADV (5; 7% instances), CCONJ (5; 7% instances), ADP (4; 6% instances), AUX (4; 6% instances), VERB (4; 6% instances), PRON (2; 3% instances), PROPN (1; 1% instances)