home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: PRON

There are 3 PRON lemmas (0%), 130 PRON types (0%) and 7368 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 13 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: _, eu, ele

The 10 most frequent PRON types: que, se, ele, isso, o, ela, um, eu, eles, quem

The 10 most frequent ambiguous lemmas: _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1)

The 10 most frequent ambiguous types: que (PRON 2962, CCONJ 2230, ADP 115, DET 7, NOUN 3, SCONJ 2, X 1), se (PRON 748, PART 390, CCONJ 187, ADP 3), o (DET 16553, PRON 226, ADP 1, PROPN 1, X 1), ela (PRON 135, NOUN 3), um (DET 1701, PRON 175, NUM 120, NOUN 1), você (PRON 81, PROPN 1), uma (DET 1630, NUM 89, PRON 87), qual (PRON 80, DET 9), me (PRON 71, ADP 1, NOUN 1), nós (PRON 39, NOUN 10)

Morphology

The form / lemma ratio of PRON is 43.333333 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (130) was observed with the lemma “_”: Agra, Almeida, Big, Elano, Gu, Hiato, Lynn, Maxim, Merss, Mosquini, OQ, Odenville, PMs, Paraisópolis, Quantos, Sharapova, Tidico, Top, Vos, Xandele, a, algo, alguma, algumas, alguns, alguém, ambas, ambos, aquela, aquelas, aquele, aqueles, aquilo, as, bastante, cada, de, dele, demais, diferencial, duque, ela, elas, ele, eles, elle, essa, essas, esse, esses, esta, estas, este, estes, eu, hoc, isso, isto, la, las, latim, lhe, lhes, lo, los, me, mesma, mesmas, mesmo, mesmos, mim, minha, muitas, muito, muitos, nada, nenhum, nenhuma, ninguém, no, nos, nossa, nosso, nós, o, os, outos, outra, outras, outro, outros, poucas, pouco, poucos, próprio, quais, qual, qualquer, quanto, que, quebra, quem, quê, se, seu, si, sua, tais, tal, tanto, te, that, they, toda, todas, todo, todos, tudo, ue, um, uma, umas, uns, vive, você, vocês, várias, vários, vós, which.

The 2nd highest number of forms (1) was observed with the lemma “ele”: Eles.

The 3rd highest number of forms (1) was observed with the lemma “eu”: mim.

PRON occurs with 3 features: Number (1; 0% instances), Person (1; 0% instances), PronType (1; 0% instances)

PRON occurs with 3 feature-value pairs: Number=Sing, Person=1, PronType=Prs

PRON occurs with 2 feature combinations. The most frequent feature combination is _ (7367 tokens). Examples: que, se, ele, isso, o, ela, um, eu, eles, quem

Relations

PRON nodes are attached to their parents using 26 different relations: nsubj (3956; 54% instances), obj (1317; 18% instances), nmod (822; 11% instances), nsubj:pass (311; 4% instances), expl:pv (269; 4% instances), root (165; 2% instances), iobj (116; 2% instances), appos (113; 2% instances), conj (81; 1% instances), ccomp (46; 1% instances), dep (46; 1% instances), mark (38; 1% instances), acl:relcl (21; 0% instances), det (19; 0% instances), advcl (12; 0% instances), parataxis (12; 0% instances), acl (7; 0% instances), det:poss (5; 0% instances), fixed (3; 0% instances), cc (2; 0% instances), csubj (2; 0% instances), advmod (1; 0% instances), expl (1; 0% instances), flat (1; 0% instances), obl (1; 0% instances), xcomp (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (6271; 85% instances), NOUN (647; 9% instances), (165; 2% instances), PRON (104; 1% instances), ADJ (79; 1% instances), PROPN (60; 1% instances), ADV (16; 0% instances), NUM (10; 0% instances), PART (5; 0% instances), ADP (4; 0% instances), AUX (3; 0% instances), SYM (2; 0% instances), DET (1; 0% instances), X (1; 0% instances)

5554 (75%) PRON nodes are leaves.

869 (12%) PRON nodes have one child.

530 (7%) PRON nodes have two children.

415 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 20.

Children of PRON nodes are attached using 28 different relations: case (854; 24% instances), punct (658; 18% instances), nmod (528; 15% instances), det (365; 10% instances), cop (257; 7% instances), acl:relcl (248; 7% instances), nsubj (177; 5% instances), conj (82; 2% instances), advmod (81; 2% instances), cc (80; 2% instances), appos (54; 1% instances), amod (52; 1% instances), acl (42; 1% instances), csubj (37; 1% instances), advcl (27; 1% instances), mark (27; 1% instances), parataxis (10; 0% instances), aux (9; 0% instances), acl:inf (8; 0% instances), ccomp (5; 0% instances), aux:pass (4; 0% instances), expl:pv (4; 0% instances), dep (3; 0% instances), obj (3; 0% instances), fixed (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), det:poss (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (852; 24% instances), PUNCT (658; 18% instances), NOUN (569; 16% instances), VERB (392; 11% instances), DET (352; 10% instances), AUX (267; 7% instances), PROPN (129; 4% instances), PRON (104; 3% instances), CCONJ (98; 3% instances), ADV (89; 2% instances), ADJ (56; 2% instances), X (38; 1% instances), NUM (7; 0% instances), SYM (6; 0% instances), PART (5; 0% instances)