home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Highland_Puebla_Nahuatl-ITML: POS Tags: PROPN

There are 102 PROPN lemmas (7%), 109 PROPN types (6%) and 201 PROPN tokens (2%). Out of 15 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: Cuetzalan, Miguel, San, María, Tzinacapan, Centauri, Próxima, Osollo, Salazar, Tonalix

The 10 most frequent PROPN types: Kuesalan, Miguel, San, Tzinacapan, Próxima, centauri, Osollo, Salazar, Tonalix, Vazquez

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.068627 (the average of all parts of speech is 1.380113).

The 1st highest number of forms (3) was observed with the lemma “María”: Maria, María, niMaria.

The 2nd highest number of forms (2) was observed with the lemma “Cuetzalan”: Kuesalan, Kwesalan.

The 3rd highest number of forms (2) was observed with the lemma “Ernesto”: Ernesto, niErnesto.

PROPN occurs with 4 features: Gender (59; 29% instances), Number[subj] (6; 3% instances), Person[subj] (6; 3% instances), Number (3; 1% instances)

PROPN occurs with 5 feature-value pairs: Gender=Fem, Gender=Masc, Number=Sing, Number[subj]=Sing, Person[subj]=1

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (142 tokens). Examples: Kuesalan, San, Tzinacapan, Próxima, centauri, Osollo, Salazar, Tonalix, Vazquez, Xaltipan

Relations

PROPN nodes are attached to their parents using 11 different relations: flat (80; 40% instances), appos (28; 14% instances), obl (26; 13% instances), nmod (19; 9% instances), obj (12; 6% instances), root (12; 6% instances), conj (11; 5% instances), parataxis (9; 4% instances), vocative (2; 1% instances), dislocated (1; 0% instances), nsubj (1; 0% instances)

Parents of PROPN nodes belong to 5 different parts of speech: PROPN (93; 46% instances), VERB (49; 24% instances), NOUN (25; 12% instances), ADV (22; 11% instances), (12; 6% instances)

122 (61%) PROPN nodes are leaves.

26 (13%) PROPN nodes have one child.

26 (13%) PROPN nodes have two children.

27 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 17 different relations: flat (83; 45% instances), punct (35; 19% instances), case (17; 9% instances), conj (10; 5% instances), nsubj (8; 4% instances), appos (7; 4% instances), advmod (4; 2% instances), cc (4; 2% instances), det (4; 2% instances), discourse (2; 1% instances), nmod (2; 1% instances), obl (2; 1% instances), parataxis (2; 1% instances), acl:relcl (1; 1% instances), dep (1; 1% instances), dislocated (1; 1% instances), reparandum (1; 1% instances)

Children of PROPN nodes belong to 11 different parts of speech: PROPN (93; 51% instances), PUNCT (35; 19% instances), ADP (18; 10% instances), NOUN (14; 8% instances), ADV (5; 3% instances), X (5; 3% instances), CCONJ (4; 2% instances), DET (4; 2% instances), VERB (3; 2% instances), INTJ (2; 1% instances), PRON (1; 1% instances)