Treebank Statistics: UD_Highland_Puebla_Nahuatl-ITML: POS Tags: PROPN
There are 102 PROPN
lemmas (7%), 110 PROPN
types (6%) and 201 PROPN
tokens (2%).
Out of 15 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent PROPN
lemmas: Cuetzalan, Miguel, San, María, Tzinacapan, Centauri, Próxima, Osollo, Salazar, Tonalix
The 10 most frequent PROPN
types: Kuesalan, Miguel, San, Tzinacapan, Próxima, centauri, Osollo, Salazar, Tonalix, Vazquez
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 1.078431 (the average of all parts of speech is 1.381087).
The 1st highest number of forms (3) was observed with the lemma “María”: Maria, María, niMaria.
The 2nd highest number of forms (2) was observed with the lemma “Cuetzalan”: Kuesalan, Kwesalan.
The 3rd highest number of forms (2) was observed with the lemma “Ernesto”: Ernesto, niErnesto.
PROPN
occurs with 4 features: Gender (59; 29% instances), Number[subj] (6; 3% instances), Person[subj] (6; 3% instances), Number (3; 1% instances)
PROPN
occurs with 5 feature-value pairs: Gender=Fem
, Gender=Masc
, Number=Sing
, Number[subj]=Sing
, Person[subj]=1
PROPN
occurs with 7 feature combinations.
The most frequent feature combination is _
(142 tokens).
Examples: Kuesalan, San, Tzinacapan, Próxima, centauri, Osollo, Salazar, Tonalix, Vazquez, Xaltipan
Relations
PROPN
nodes are attached to their parents using 11 different relations: flat (80; 40% instances), obl (46; 23% instances), nmod (19; 9% instances), obj (12; 6% instances), root (12; 6% instances), conj (11; 5% instances), parataxis (9; 4% instances), appos (8; 4% instances), vocative (2; 1% instances), dislocated (1; 0% instances), nsubj (1; 0% instances)
Parents of PROPN
nodes belong to 5 different parts of speech: PROPN (93; 46% instances), VERB (49; 24% instances), NOUN (25; 12% instances), ADV (22; 11% instances), (12; 6% instances)
122 (61%) PROPN
nodes are leaves.
26 (13%) PROPN
nodes have one child.
26 (13%) PROPN
nodes have two children.
27 (13%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 10.
Children of PROPN
nodes are attached using 17 different relations: flat (83; 45% instances), punct (35; 19% instances), case (17; 9% instances), conj (10; 5% instances), nsubj (8; 4% instances), appos (7; 4% instances), advmod (4; 2% instances), cc (4; 2% instances), det (4; 2% instances), discourse (2; 1% instances), nmod (2; 1% instances), obl (2; 1% instances), parataxis (2; 1% instances), acl:relcl (1; 1% instances), dep (1; 1% instances), dislocated (1; 1% instances), reparandum (1; 1% instances)
Children of PROPN
nodes belong to 11 different parts of speech: PROPN (93; 51% instances), PUNCT (35; 19% instances), ADP (18; 10% instances), NOUN (14; 8% instances), ADV (5; 3% instances), X (5; 3% instances), CCONJ (4; 2% instances), DET (4; 2% instances), VERB (3; 2% instances), INTJ (2; 1% instances), PRON (1; 1% instances)