home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: PRON

There are 32 PRON lemmas (7%), 49 PRON types (6%) and 174 PRON tokens (9%). Out of 16 observed tags, the rank of PRON is: 4 in number of lemmas, 5 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: u, i, e, _, a, imi, pagi, inoba, iogudy, kabo

The 10 most frequent PRON types: ure, ere, imi, are, ire, kaboba, imode, Inure, pagi, inoba

The 10 most frequent ambiguous lemmas: u (PRON 31, INTJ 2, VERB 2, NOUN 1, X 1), i (PRON 27, ADP 4, NOUN 2, X 2, ADV 1), _ (VERB 46, NOUN 41, ADV 24, PRON 18, PROPN 17, X 16, ADP 14, PUNCT 13, PART 3, DET 2), a (PRON 15, NOUN 2, X 2), ce (PRON 3, X 1), re (PART 31, PRON 3), kai (PRON 2, ADP 1), pa (PRON 2, X 1), ta (PRON 2, X 1), tu (VERB 9, PRON 2)

The 10 most frequent ambiguous types: ure (PART 32, PRON 24, X 3, NOUN 1, VERB 1), inoba (ADV 2, PRON 2), emode (PRON 3, X 3), amode (X 2, PRON 1), Birimodo (PROPN 2, PRON 1), eiamedu (NOUN 1, PRON 1), jamedy (ADV 7, NOUN 1, PRON 1, X 1), pui (ADP 1, PRON 1), uture (VERB 2, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.531250 (the average of all parts of speech is 1.661638).

The 1st highest number of forms (11) was observed with the lemma “_”: Birimodo, Ioguduba, Pare, Tui, are, eiamedu, ema, kaboba, pudumi, umode, ure.

The 2nd highest number of forms (6) was observed with the lemma “i”: Inure, ido, iia, ikadykigodykare, imode, ire.

The 3rd highest number of forms (6) was observed with the lemma “u”: udo, ukare, umode, umodykare, ure, uture.

PRON occurs with 17 features: PronType (111; 64% instances), Number (110; 63% instances), Person (110; 63% instances), Mood (97; 56% instances), Tense (18; 10% instances), Number[subj] (14; 8% instances), Person[subj] (14; 8% instances), Clusivity (13; 7% instances), Aspect (8; 5% instances), Reflex (8; 5% instances), Voice (5; 3% instances), Int (4; 2% instances), Polarity (4; 2% instances), Nomzr (2; 1% instances), Number[obj] (2; 1% instances), Person[obj] (2; 1% instances), Subord (1; 1% instances)

PRON occurs with 30 feature-value pairs: Aspect=Hab, Aspect=Prog, Clusivity=Ex, Clusivity=In, Int=Yes, Mood=Ind, Mood=Irr, Nomzr=Rel, Number=Plur, Number=Sing, Number[obj]=Plur, Number[subj]=Plur, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[obj]=1, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, PronType=Bi, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Tot, Reflex=Yes, Subord=Yes, Tense=Fut, Voice=Cau

PRON occurs with 67 feature combinations. The most frequent feature combination is PronType=Int (18 tokens). Examples: kaboba, inoba, Ioguduba, iogudyba, Kaiba, Kodiba

Relations

PRON nodes are attached to their parents using 12 different relations: nsubj (101; 58% instances), dep (31; 18% instances), obj (14; 8% instances), obl (10; 6% instances), nmod (6; 3% instances), root (6; 3% instances), advcl (1; 1% instances), ccomp (1; 1% instances), det (1; 1% instances), discourse (1; 1% instances), parataxis (1; 1% instances), xcomp (1; 1% instances)

Parents of PRON nodes belong to 7 different parts of speech: VERB (140; 80% instances), NOUN (20; 11% instances), (6; 3% instances), PRON (4; 2% instances), PART (2; 1% instances), ADV (1; 1% instances), X (1; 1% instances)

150 (86%) PRON nodes are leaves.

18 (10%) PRON nodes have one child.

4 (2%) PRON nodes have two children.

2 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 11 different relations: case (8; 22% instances), advmod (7; 19% instances), punct (5; 14% instances), dep (3; 8% instances), nsubj (3; 8% instances), obl (3; 8% instances), xcomp (3; 8% instances), advcl (2; 5% instances), compound (1; 3% instances), obj (1; 3% instances), parataxis (1; 3% instances)

Children of PRON nodes belong to 8 different parts of speech: ADP (10; 27% instances), ADV (6; 16% instances), PUNCT (5; 14% instances), NOUN (4; 11% instances), PRON (4; 11% instances), VERB (4; 11% instances), PART (2; 5% instances), X (2; 5% instances)