home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Guajajara-TuDeT: POS Tags: PROPN

There are 21 PROPN lemmas (3%), 27 PROPN types (2%) and 169 PROPN tokens (2%). Out of 15 observed tags, the rank of PROPN is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Zuze, Mari, Kari, Tupan, mair, Fábio, Pedro, Purutu, Siba, Zahɨ

The 10 most frequent PROPN types: Zuze, Mari, Kari, Maripe, Karipe, Mair, Tupan, Zuzepe, Fábio, Mairaʔi

The 10 most frequent ambiguous lemmas: maizu (NOUN 1, PROPN 1), zezu (NOUN 4, PROPN 2)

The 10 most frequent ambiguous types: Zahɨ (PROPN 2, NOUN 1), zezu (NOUN 2, PROPN 1), Tentehar (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.285714 (the average of all parts of speech is 1.933709).

The 1st highest number of forms (2) was observed with the lemma “Kari”: Kari, Karipe.

The 2nd highest number of forms (2) was observed with the lemma “Mair”: Mair, Mairaʔi.

The 3rd highest number of forms (2) was observed with the lemma “Mari”: Mari, Maripe.

PROPN occurs with 3 features: Case (15; 9% instances), Degree (2; 1% instances), Emph (2; 1% instances)

PROPN occurs with 3 feature-value pairs: Case=Dat, Degree=Dim, Emph=Yes

PROPN occurs with 4 feature combinations. The most frequent feature combination is _ (150 tokens). Examples: Zuze, Mari, Kari, Mair, Tupan, Fábio, Pedro, Purutu, Siba, Zahɨ

Relations

PROPN nodes are attached to their parents using 8 different relations: obl:subj (78; 46% instances), nmod (68; 40% instances), obl (14; 8% instances), root (3; 2% instances), appos (2; 1% instances), obj (2; 1% instances), flat (1; 1% instances), iobj (1; 1% instances)

Parents of PROPN nodes belong to 4 different parts of speech: NOUN (91; 54% instances), VERB (74; 44% instances), (3; 2% instances), PRON (1; 1% instances)

147 (87%) PROPN nodes are leaves.

14 (8%) PROPN nodes have one child.

4 (2%) PROPN nodes have two children.

4 (2%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 7 different relations: discourse (18; 46% instances), punct (11; 28% instances), obl (4; 10% instances), appos (2; 5% instances), obl:subj (2; 5% instances), case (1; 3% instances), ccomp (1; 3% instances)

Children of PROPN nodes belong to 6 different parts of speech: PRON (15; 38% instances), PUNCT (11; 28% instances), NOUN (8; 21% instances), PART (3; 8% instances), ADP (1; 3% instances), VERB (1; 3% instances)