home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: PRON

There are 52 PRON lemmas (4%), 106 PRON types (5%) and 599 PRON tokens (9%). Out of 16 observed tags, the rank of PRON is: 6 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: u, e, i, a, _, re, imi, pa, ta, ty

The 10 most frequent PRON types: ure, ere, imi, awo, ire, imode, are, ukare, inure, pamode

The 10 most frequent ambiguous lemmas: u (PRON 164, INTJ 3, VERB 3, X 2, NOUN 1), i (PRON 62, ADP 10, NOUN 7, X 2, ADV 1), a (PRON 59, X 3, NOUN 2), _ (NOUN 201, VERB 142, ADV 84, PUNCT 64, X 56, ADP 44, PRON 42, PROPN 36, DET 10, PART 6, SCONJ 6, CCONJ 2, ADJ 1), re (PRON 33, VERB 2, NOUN 1, PART 1, X 1), pa (PRON 16, NOUN 1, VERB 1, X 1), ta (PRON 14, VERB 3, X 3, NOUN 2), ty (VERB 66, ADV 22, PRON 8, NOUN 2, X 1), ce (PRON 6, NOUN 4, ADP 3, X 1), inoba (PRON 5, ADV 4)

The 10 most frequent ambiguous types: ure (PRON 128, X 3, NOUN 1, PART 1, VERB 1), are (PRON 13, NOUN 1), ukare (PRON 10, VERB 1), pamode (PRON 3, X 1), emode (PRON 8, X 3), amode (PRON 3, X 2), ema (PRON 5, NOUN 1), inoba (ADV 3, PRON 3), bure (PRON 4, NOUN 2), ikare (PRON 1, X 1)

Morphology

The form / lemma ratio of PRON is 2.038462 (the average of all parts of speech is 1.661916).

The 1st highest number of forms (25) was observed with the lemma “_”: Birimodo, Imode, Ioguduba, Pare, are, eiamedu, eiamedy, ema, ere, ewo, imi, imie, imireo, ire, jiboe, kaboba, pudui, pudumi, pui, tagireo, tui, tuie, tuwo, umode, ure.

The 2nd highest number of forms (20) was observed with the lemma “u”: Eerydyre, awure, boecodure, bure, dure, ekure, inodure, kaworure, kudugodure, kujagure, padure, re, udo, ukare, umode, umodykare, unure, ure, utyre, uwo.

The 3rd highest number of forms (9) was observed with the lemma “i”: ido, iia, ikadykigodykare, ikare, imode, imodykare, inure, ire, iwo.

PRON occurs with 20 features: Number (489; 82% instances), Person (484; 81% instances), Mood (423; 71% instances), PronType (165; 28% instances), Tense (63; 11% instances), Clusivity (33; 6% instances), Polarity (22; 4% instances), Aspect (18; 3% instances), Reflex (18; 3% instances), Number[subj] (14; 2% instances), Person[subj] (14; 2% instances), Voice (13; 2% instances), Int (4; 1% instances), Nomzr (3; 1% instances), Number[obj] (2; 0% instances), Person[obj] (2; 0% instances), Definite (1; 0% instances), Speech (1; 0% instances), Subord (1; 0% instances), VerbForm (1; 0% instances)

PRON occurs with 40 feature-value pairs: Aspect=Inc, Aspect=IncProg, Aspect=Prog, Clusivity=Ex, Clusivity=In, Definite=Ind, Int=Yes, Mood=Imp, Mood=Ind, Mood=Irr, Mood=Sub, Nomzr=Rel, Number=Plur, Number=Sing, Number[obj]=Plur, Number[subj]=Plur, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[obj]=1, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, PronType=Art, PronType=Bi, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Speech=Ind, Subord=Yes, Tense=Fut, VerbForm=Ger, Voice=Cau

PRON occurs with 121 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3 (138 tokens). Examples: ure, bure, kudugodure, padure, Eerydyre, awure, boecodure, dure, ekure, kaworure

Relations

PRON nodes are attached to their parents using 16 different relations: nsubj (402; 67% instances), dep (91; 15% instances), obj (21; 4% instances), obl (17; 3% instances), nmod (15; 3% instances), conj (12; 2% instances), parataxis (11; 2% instances), root (10; 2% instances), discourse (5; 1% instances), ccomp (4; 1% instances), xcomp (3; 1% instances), advcl (2; 0% instances), det (2; 0% instances), dislocated (2; 0% instances), appos (1; 0% instances), compound (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (474; 79% instances), NOUN (70; 12% instances), PRON (19; 3% instances), ADV (12; 2% instances), (10; 2% instances), ADP (6; 1% instances), X (4; 1% instances), PART (2; 0% instances), PROPN (2; 0% instances)

502 (84%) PRON nodes are leaves.

66 (11%) PRON nodes have one child.

13 (2%) PRON nodes have two children.

18 (3%) PRON nodes have three or more children.

The highest child degree of a PRON node is 14.

Children of PRON nodes are attached using 19 different relations: dep (46; 26% instances), punct (30; 17% instances), conj (20; 11% instances), advmod (19; 11% instances), case (13; 7% instances), obl (12; 7% instances), nsubj (9; 5% instances), parataxis (7; 4% instances), advcl (5; 3% instances), xcomp (5; 3% instances), det (2; 1% instances), discourse (2; 1% instances), mark (2; 1% instances), obj (2; 1% instances), appos (1; 1% instances), compound (1; 1% instances), dislocated (1; 1% instances), nmod (1; 1% instances), vocative (1; 1% instances)

Children of PRON nodes belong to 13 different parts of speech: NOUN (47; 26% instances), PUNCT (30; 17% instances), ADV (23; 13% instances), VERB (21; 12% instances), PRON (19; 11% instances), ADP (15; 8% instances), PROPN (7; 4% instances), X (7; 4% instances), DET (3; 2% instances), INTJ (2; 1% instances), PART (2; 1% instances), SCONJ (2; 1% instances), NUM (1; 1% instances)