Treebank Statistics: UD_Guajajara-TuDeT: POS Tags: PROPN
There are 21 PROPN
lemmas (3%), 27 PROPN
types (2%) and 169 PROPN
tokens (2%).
Out of 15 observed tags, the rank of PROPN
is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent PROPN
lemmas: Zuze, Mari, Kari, Tupan, mair, Fábio, Pedro, Purutu, Siba, Zahɨ
The 10 most frequent PROPN
types: Zuze, Mari, Kari, Maripe, Karipe, Mair, Tupan, Zuzepe, Fábio, Mairaʔi
The 10 most frequent ambiguous lemmas: maizu (NOUN 1, PROPN 1), zezu (NOUN 4, PROPN 2)
The 10 most frequent ambiguous types: Zahɨ (PROPN 2, NOUN 1), zezu (NOUN 2, PROPN 1), Tentehar (NOUN 1, PROPN 1)
- Zahɨ
- zezu
- Tentehar
Morphology
The form / lemma ratio of PROPN
is 1.285714 (the average of all parts of speech is 1.933709).
The 1st highest number of forms (2) was observed with the lemma “Kari”: Kari, Karipe.
The 2nd highest number of forms (2) was observed with the lemma “Mair”: Mair, Mairaʔi.
The 3rd highest number of forms (2) was observed with the lemma “Mari”: Mari, Maripe.
PROPN
occurs with 3 features: Case (15; 9% instances), Degree (2; 1% instances), Emph (2; 1% instances)
PROPN
occurs with 3 feature-value pairs: Case=Dat
, Degree=Dim
, Emph=Yes
PROPN
occurs with 4 feature combinations.
The most frequent feature combination is _
(150 tokens).
Examples: Zuze, Mari, Kari, Mair, Tupan, Fábio, Pedro, Purutu, Siba, Zahɨ
Relations
PROPN
nodes are attached to their parents using 8 different relations: obl:subj (78; 46% instances), nmod (68; 40% instances), obl (14; 8% instances), root (3; 2% instances), appos (2; 1% instances), obj (2; 1% instances), flat (1; 1% instances), iobj (1; 1% instances)
Parents of PROPN
nodes belong to 4 different parts of speech: NOUN (91; 54% instances), VERB (74; 44% instances), (3; 2% instances), PRON (1; 1% instances)
147 (87%) PROPN
nodes are leaves.
14 (8%) PROPN
nodes have one child.
4 (2%) PROPN
nodes have two children.
4 (2%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 5.
Children of PROPN
nodes are attached using 7 different relations: discourse (18; 46% instances), punct (11; 28% instances), obl (4; 10% instances), appos (2; 5% instances), obl:subj (2; 5% instances), case (1; 3% instances), ccomp (1; 3% instances)
Children of PROPN
nodes belong to 6 different parts of speech: PRON (15; 38% instances), PUNCT (11; 28% instances), NOUN (8; 21% instances), PART (3; 8% instances), ADP (1; 3% instances), VERB (1; 3% instances)