Treebank Statistics: UD_Western_Sierra_Puebla_Nahuatl-MesoTree: POS Tags: NUM
There are 42 NUM lemmas (1%), 48 NUM types (1%) and 208 NUM tokens (1%).
Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: se, ome, yeyi, mahtlactl, quince, caxtol, ce, mil, o, sempoual
The 10 most frequent NUM types: se, ome, yeyi, quince, omeh, caxtol, ocho, ce, mahtlactl, mil
The 10 most frequent ambiguous lemmas: se (DET 126, NUM 83, PRON 5, NOUN 1), ce (DET 18, NUM 4, PRON 1), o (AUX 31, CCONJ 31, ADV 4, NUM 4, SCONJ 3, INTJ 1), ocse (DET 5, NUM 1), semeh (PRON 3, NUM 1)
The 10 most frequent ambiguous types: se (DET 116, NUM 82, PRON 3, X 3, NOUN 1, VERB 1), ce (DET 16, NUM 4, PRON 1), millones (NOUN 1, NUM 1), ocse (DET 1, NUM 1), semeh (PRON 3, NUM 1)
- se
- ce
- millones
- ocse
- semeh
Morphology
The form / lemma ratio of NUM is 1.142857 (the average of all parts of speech is 1.597641).
The 1st highest number of forms (3) was observed with the lemma “mahtlactl”: mahtlactl, majtlactli, matlactli.
The 2nd highest number of forms (3) was observed with the lemma “ome”: ome, omeh, tiomen.
The 3rd highest number of forms (3) was observed with the lemma “yeyi”: yeyen, yeyi, yeyin.
NUM occurs with 2 features: Foreign (9; 4% instances), Number (4; 2% instances)
NUM occurs with 2 feature-value pairs: Foreign=Yes, Number=Plur
NUM occurs with 4 feature combinations.
The most frequent feature combination is _ (196 tokens).
Examples: se, ome, yeyi, omeh, caxtol, quince, ce, mahtlactl, mil, simpohual
Relations
NUM nodes are attached to their parents using 10 different relations: nummod (162; 78% instances), nmod (12; 6% instances), nsubj (11; 5% instances), conj (9; 4% instances), compound (3; 1% instances), obl (3; 1% instances), root (3; 1% instances), acl (2; 1% instances), flat (2; 1% instances), obj (1; 0% instances)
Parents of NUM nodes belong to 5 different parts of speech: NOUN (171; 82% instances), NUM (21; 10% instances), VERB (12; 6% instances), (3; 1% instances), PRON (1; 0% instances)
150 (72%) NUM nodes are leaves.
35 (17%) NUM nodes have one child.
17 (8%) NUM nodes have two children.
6 (3%) NUM nodes have three or more children.
The highest child degree of a NUM node is 5.
Children of NUM nodes are attached using 15 different relations: det (18; 19% instances), nmod (11; 12% instances), advmod (10; 11% instances), conj (10; 11% instances), nummod (9; 10% instances), cc (8; 9% instances), punct (7; 7% instances), case (5; 5% instances), compound (4; 4% instances), aux (3; 3% instances), acl (2; 2% instances), cop (2; 2% instances), flat (2; 2% instances), nsubj (2; 2% instances), csubj (1; 1% instances)
Children of NUM nodes belong to 11 different parts of speech: NUM (21; 22% instances), DET (18; 19% instances), ADV (11; 12% instances), NOUN (11; 12% instances), CCONJ (8; 9% instances), PUNCT (7; 7% instances), AUX (5; 5% instances), VERB (5; 5% instances), ADP (4; 4% instances), PRON (3; 3% instances), PROPN (1; 1% instances)