This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home pt/pos issue tracker

PRON: pronoun

Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.

Lemmatization rules = ?

Examples

clitic pronouns: se, me, te, lhe (including reflexive pronouns)

demonstrative pronouns: isto, esse, aquilo

personal pronouns: eu, tu, ele, vocês

indefinite pronouns: um, outro, qualquer

possessive pronouns: meu, seu, dele

interrogative pronouns: que, quanto, qual

relative pronouns: que, cujo, qual

totality pronouns: todo, todas

negative pronouns: nenhum, ninguém


Treebank Statistics (UD_Portuguese)

There are 66 PRON lemmas (0%), 136 PRON types (0%) and 6718 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: que, se, ele, o, eu, isso, ela, quem, eles, tudo

The 10 most frequent PRON types: que, se, -se, ele, o, isso, quem, tudo, eles, eu

The 10 most frequent ambiguous lemmas: que (PRON 2331, SCONJ 1996, ADV 90, DET 20, ADP 3, X 1), se (PRON 1444, SCONJ 261, NOUN 21), o (DET 28065, PRON 220, ADP 3, NOUN 1), isso (PRON 171, NOUN 1), ela (PRON 170, NOUN 1), quem (PRON 142, ADV 1), tudo (PRON 114, DET 3), qual (PRON 94, DET 8, INTJ 1, ADV 1), outro (DET 266, PRON 85, ADJ 20), este (DET 597, PRON 83)

The 10 most frequent ambiguous types: que (PRON 2326, SCONJ 1988, ADV 90, DET 18, ADP 3, X 1), se (PRON 805, SCONJ 172, NOUN 21), o (DET 10652, PRON 222, NOUN 1), isso (PRON 158, NOUN 1), quem (PRON 110, ADV 1), tudo (PRON 95, DET 3), ela (PRON 79, NOUN 1), a (DET 9811, ADP 3672, PRON 91, PROPN 1, ADV 1), os (DET 3342, PRON 64), nada (PRON 58, NOUN 6, ADV 4)

Morphology

The form / lemma ratio of PRON is 2.060606 (the average of all parts of speech is 1.432674).

The 1st highest number of forms (13) was observed with the lemma “ele”: -lhe, -lo, -no, -o, Ihe, ela, elas, ele, eles, lhe, lhe-, lo, o.

The 2nd highest number of forms (9) was observed with the lemma “ela”: -a, -la, -las, -lhe, -na, a, ela, la-, lhe.

The 3rd highest number of forms (7) was observed with the lemma “elas”: -as, -las, -lhes, -nas, as, elas, lhes.

PRON occurs with 14 features: pt-feat/PronType (6503; 97% instances), pt-feat/Number (6339; 94% instances), pt-feat/Gender (5977; 89% instances), pt-feat/Person (2763; 41% instances), pt-feat/Case (2742; 41% instances), pt-feat/Reflex (799; 12% instances), pt-feat/NumType (463; 7% instances), pt-feat/PrepCase (206; 3% instances), pt-feat/Degree (123; 2% instances), pt-feat/Definite (28; 0% instances), pt-feat/Poss (23; 0% instances), pt-feat/Number[psor] (16; 0% instances), pt-feat/Hyph (13; 0% instances), pt-feat/Typo (1; 0% instances)

PRON occurs with 31 feature-value pairs: Case=Acc, Case=Acc,Dat, Case=Acc,Nom, Case=Dat, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Gender=Fem, Gender=Masc, Hyph=Yes, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PrepCase=Pre, PronType=Dem, PronType=Ind, PronType=Ind,Neg,Tot, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 176 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Rel (1071 tokens). Examples: que, quem, qual, quanto, Nada, cujo

Relations

PRON nodes are attached to their parents using 24 different relations: pt-dep/nsubj (2971; 44% instances), pt-dep/dobj (1964; 29% instances), pt-dep/nmod (576; 9% instances), pt-dep/iobj (386; 6% instances), pt-dep/advmod (193; 3% instances), pt-dep/nsubjpass (154; 2% instances), pt-dep/root (120; 2% instances), pt-dep/conj (103; 2% instances), pt-dep/compound (86; 1% instances), pt-dep/auxpass:reflex (27; 0% instances), pt-dep/dep (25; 0% instances), pt-dep/mwe (24; 0% instances), pt-dep/ccomp (18; 0% instances), pt-dep/xcomp (13; 0% instances), pt-dep/appos (11; 0% instances), pt-dep/advcl (10; 0% instances), pt-dep/parataxis (10; 0% instances), pt-dep/acl (9; 0% instances), pt-dep/cop (6; 0% instances), pt-dep/mark (4; 0% instances), pt-dep/case (3; 0% instances), pt-dep/aux (2; 0% instances), pt-dep/cc (2; 0% instances), pt-dep/csubj (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (5668; 84% instances), NOUN (361; 5% instances), ADJ (185; 3% instances), ROOT (120; 2% instances), ADV (87; 1% instances), DET (85; 1% instances), PRON (78; 1% instances), NUM (56; 1% instances), ADP (30; 0% instances), PROPN (27; 0% instances), SYM (15; 0% instances), SCONJ (4; 0% instances), CONJ (1; 0% instances), INTJ (1; 0% instances)

5352 (80%) PRON nodes are leaves.

828 (12%) PRON nodes have one child.

285 (4%) PRON nodes have two children.

253 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 25 different relations: pt-dep/case (794; 32% instances), pt-dep/punct (316; 13% instances), pt-dep/nmod (294; 12% instances), pt-dep/acl (255; 10% instances), pt-dep/cop (183; 7% instances), pt-dep/det (135; 5% instances), pt-dep/nsubj (113; 5% instances), pt-dep/advmod (95; 4% instances), pt-dep/compound (81; 3% instances), pt-dep/cc (52; 2% instances), pt-dep/conj (51; 2% instances), pt-dep/mark (31; 1% instances), pt-dep/amod (25; 1% instances), pt-dep/advcl (24; 1% instances), pt-dep/neg (13; 1% instances), pt-dep/appos (10; 0% instances), pt-dep/dobj (10; 0% instances), pt-dep/dep (9; 0% instances), pt-dep/nummod (6; 0% instances), pt-dep/advmod:emph (3; 0% instances), pt-dep/ccomp (3; 0% instances), pt-dep/csubj (3; 0% instances), pt-dep/aux (1; 0% instances), pt-dep/parataxis (1; 0% instances), pt-dep/xcomp (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (777; 31% instances), VERB (450; 18% instances), NOUN (355; 14% instances), PUNCT (316; 13% instances), DET (192; 8% instances), ADV (137; 5% instances), PRON (78; 3% instances), PROPN (71; 3% instances), CONJ (49; 2% instances), ADJ (40; 2% instances), SCONJ (31; 1% instances), NUM (11; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)


Treebank Statistics (UD_Portuguese-Bosque)

There are 59 PRON lemmas (0%), 115 PRON types (0%) and 7058 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 8 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: que, se, ele, o, eu, ela, isso, quem, eles, tudo

The 10 most frequent PRON types: que, se, o, ele, isso, quem, lhe, tudo, eles, eu

The 10 most frequent ambiguous lemmas: que (PRON 2657, SCONJ 1615, ADV 88, NOUN 53, DET 21, ADP 9, PROPN 4, X 1), se (PRON 1431, SCONJ 278, ADP 2), o (DET 27984, PRON 325, PROPN 21, NOUN 4, ADP 3), ela (PRON 171, NOUN 1), isso (PRON 166, NOUN 1), quem (PRON 143, ADV 1), tudo (PRON 112, DET 3), outro (DET 279, PRON 97, NOUN 1), este (DET 574, PRON 88), qual (PRON 85, DET 19, ADV 1)

The 10 most frequent ambiguous types: que (PRON 2652, SCONJ 1607, ADV 88, NOUN 53, DET 18, ADP 9, PROPN 4, X 1, VERB 1), se (PRON 1403, SCONJ 188, ADP 2), o (DET 10520, PRON 344, PROPN 21, NOUN 4), isso (PRON 153, NOUN 1), quem (PRON 111, ADV 1), tudo (PRON 95, DET 3), ela (PRON 80, NOUN 1), me (PRON 88, PROPN 1, INTJ 1), a (DET 9579, ADP 4007, PRON 89, PROPN 27, NOUN 4, ADV 2), os (DET 3324, PRON 76, ADP 5, PROPN 4)

Morphology

The form / lemma ratio of PRON is 1.949153 (the average of all parts of speech is 1.449059).

The 1st highest number of forms (10) was observed with the lemma “ele”: Ihe, ela, elas, ele, eles, lhe, lhe-, lo, no, o.

The 2nd highest number of forms (7) was observed with the lemma “ela”: a, ela, la, la-, las, lhe, na.

The 3rd highest number of forms (6) was observed with the lemma “se”: s, se, se-, se-á, se-ão, si.

PRON occurs with 7 features: pt-feat/Gender (7034; 100% instances), pt-feat/Number (6738; 95% instances), pt-feat/PronType (6565; 93% instances), pt-feat/Case (2542; 36% instances), pt-feat/Person (2426; 34% instances), pt-feat/Definite (22; 0% instances), pt-feat/VerbForm (1; 0% instances)

PRON occurs with 20 feature-value pairs: Case=Acc, Case=Dat, Case=Nom, Definite=Def, Gender=Fem, Gender=Masc, Gender=Unsp, Number=Plur, Number=Sing, Number=Unsp, Person=1, Person=2, Person=3, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, VerbForm=Ger

PRON occurs with 107 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Rel (1301 tokens). Examples: que, quem, tudo, qual, quanto, Nada

Relations

PRON nodes are attached to their parents using 24 different relations: pt-dep/nsubj (3213; 46% instances), pt-dep/dobj (2044; 29% instances), pt-dep/nmod (831; 12% instances), pt-dep/iobj (231; 3% instances), pt-dep/mwe (198; 3% instances), pt-dep/root (125; 2% instances), pt-dep/det (114; 2% instances), pt-dep/conj (73; 1% instances), pt-dep/dep (52; 1% instances), pt-dep/mark (41; 1% instances), pt-dep/xcomp (30; 0% instances), pt-dep/vocative (18; 0% instances), pt-dep/appos (15; 0% instances), pt-dep/ccomp (15; 0% instances), pt-dep/acl:relcl (14; 0% instances), pt-dep/parataxis (14; 0% instances), pt-dep/nmod:npmod (12; 0% instances), pt-dep/advcl (5; 0% instances), pt-dep/neg (5; 0% instances), pt-dep/advmod (2; 0% instances), pt-dep/cop (2; 0% instances), pt-dep/dislocated (2; 0% instances), pt-dep/csubj (1; 0% instances), pt-dep/remnant (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (5860; 83% instances), NOUN (550; 8% instances), PRON (184; 3% instances), ADJ (174; 2% instances), ROOT (125; 2% instances), ADV (74; 1% instances), PROPN (35; 0% instances), NUM (33; 0% instances), DET (15; 0% instances), ADP (3; 0% instances), SYM (3; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

5268 (75%) PRON nodes are leaves.

958 (14%) PRON nodes have one child.

534 (8%) PRON nodes have two children.

298 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 12.

Children of PRON nodes are attached using 26 different relations: pt-dep/case (1064; 33% instances), pt-dep/det (408; 13% instances), pt-dep/punct (385; 12% instances), pt-dep/nmod (327; 10% instances), pt-dep/acl:relcl (220; 7% instances), pt-dep/cop (177; 5% instances), pt-dep/mwe (145; 4% instances), pt-dep/nsubj (139; 4% instances), pt-dep/advmod (103; 3% instances), pt-dep/cc (52; 2% instances), pt-dep/conj (43; 1% instances), pt-dep/acl (29; 1% instances), pt-dep/appos (28; 1% instances), pt-dep/advcl (23; 1% instances), pt-dep/mark (23; 1% instances), pt-dep/neg (19; 1% instances), pt-dep/aux (13; 0% instances), pt-dep/dep (11; 0% instances), pt-dep/nmod:npmod (11; 0% instances), pt-dep/amod (10; 0% instances), pt-dep/parataxis (10; 0% instances), pt-dep/xcomp (9; 0% instances), pt-dep/csubj (4; 0% instances), pt-dep/ccomp (3; 0% instances), pt-dep/dobj (2; 0% instances), pt-dep/remnant (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (1077; 33% instances), VERB (468; 14% instances), DET (442; 14% instances), PUNCT (385; 12% instances), NOUN (383; 12% instances), PRON (184; 6% instances), ADV (122; 4% instances), PROPN (68; 2% instances), CONJ (52; 2% instances), ADJ (36; 1% instances), SCONJ (18; 1% instances), AUX (13; 0% instances), NUM (11; 0% instances)


Treebank Statistics (UD_Portuguese-BR)

There are 1 PRON lemmas (7%), 132 PRON types (0%) and 7392 PRON tokens (2%). Out of 14 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: que, se, ele, isso, o, ela, um, eu, eles, quem

The 10 most frequent ambiguous lemmas: _ (NOUN 57316, ADP 51928, PUNCT 42033, PROPN 32948, VERB 29700, DET 26122, ADJ 15107, CONJ 10984, ADV 9773, NUM 8491, PRON 7392, AUX 5242, PART 748, X 539)

The 10 most frequent ambiguous types: que (PRON 2970, CONJ 2237, ADP 113, DET 7, NOUN 3, X 1), se (PRON 755, PART 392, CONJ 186, ADP 3, PROPN 1), o (DET 6544, PRON 226, ADP 1, PROPN 1, X 1), ela (PRON 135, NOUN 3), um (DET 1704, PRON 176, NUM 121, NOUN 1), você (PRON 87, PROPN 1), uma (DET 1631, NUM 89, PRON 87), qual (PRON 80, DET 9), me (PRON 71, NOUN 1, ADP 1), nós (PRON 39, NOUN 10)

Morphology

The form / lemma ratio of PRON is 132.000000 (the average of all parts of speech is 2514.000000).

The 1st highest number of forms (132) was observed with the lemma “_”: Agra, Almeida, Big, Como, Elano, Gu, Hiato, Lynn, Maxim, Merss, Mosquini, OQ, Odenville, PMs, Paraisópolis, Quantos, Sharapova, Tidico, Top, Vos, Xandele, a, algo, alguma, algumas, alguns, alguém, ambas, ambos, ao, aquela, aquelas, aquele, aqueles, aquilo, as, bastante, cada, de, demais, dessa, diferencial, duque, ela, elas, ele, eles, elle, essa, essas, esse, esses, esta, estas, este, estes, eu, hoc, isso, isto, la, las, latim, lhe, lhes, lo, los, me, mesma, mesmas, mesmo, mesmos, mim, minha, muitas, muito, muitos, nada, nenhum, nenhuma, ninguém, no, nos, nossa, nosso, nós, o, os, outos, outra, outras, outro, outros, poucas, pouco, poucos, próprio, quais, qual, qualquer, quanto, que, quebra, quem, quê, se, seu, si, sua, tais, tal, tanto, te, that, they, toda, todas, todo, todos, tudo, ue, um, uma, umas, uns, vive, você, vocês, várias, vários, vós, which.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 24 different relations: pt-dep/nsubj (3967; 54% instances), pt-dep/dobj (1323; 18% instances), pt-dep/nmod (826; 11% instances), pt-dep/nsubjpass (312; 4% instances), pt-dep/expl (270; 4% instances), pt-dep/root (166; 2% instances), pt-dep/appos (113; 2% instances), pt-dep/iobj (113; 2% instances), pt-dep/conj (82; 1% instances), pt-dep/ccomp (46; 1% instances), pt-dep/dep (46; 1% instances), pt-dep/mark (43; 1% instances), pt-dep/acl:relcl (21; 0% instances), pt-dep/det (19; 0% instances), pt-dep/advcl (12; 0% instances), pt-dep/parataxis (12; 0% instances), pt-dep/acl:part (7; 0% instances), pt-dep/det:poss (3; 0% instances), pt-dep/mwe (3; 0% instances), pt-dep/advmod (2; 0% instances), pt-dep/cc (2; 0% instances), pt-dep/csubj (2; 0% instances), pt-dep/name (1; 0% instances), pt-dep/xcomp (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (6289; 85% instances), NOUN (651; 9% instances), ROOT (166; 2% instances), PRON (106; 1% instances), ADJ (79; 1% instances), PROPN (60; 1% instances), ADV (16; 0% instances), NUM (10; 0% instances), PART (5; 0% instances), ADP (4; 0% instances), AUX (4; 0% instances), DET (1; 0% instances), X (1; 0% instances)

5589 (76%) PRON nodes are leaves.

971 (13%) PRON nodes have one child.

442 (6%) PRON nodes have two children.

390 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 36.

Children of PRON nodes are attached using 29 different relations: pt-dep/case (855; 24% instances), pt-dep/punct (667; 19% instances), pt-dep/nmod (527; 15% instances), pt-dep/det (260; 7% instances), pt-dep/cop (258; 7% instances), pt-dep/acl:relcl (248; 7% instances), pt-dep/nsubj (178; 5% instances), pt-dep/conj (83; 2% instances), pt-dep/advmod (70; 2% instances), pt-dep/cc (68; 2% instances), pt-dep/appos (54; 2% instances), pt-dep/amod (52; 1% instances), pt-dep/acl:part (43; 1% instances), pt-dep/csubj (37; 1% instances), pt-dep/mark (30; 1% instances), pt-dep/advcl (27; 1% instances), pt-dep/aux (11; 0% instances), pt-dep/neg (11; 0% instances), pt-dep/parataxis (10; 0% instances), pt-dep/acl:inf (8; 0% instances), pt-dep/ccomp (5; 0% instances), pt-dep/auxpass (4; 0% instances), pt-dep/dobj (4; 0% instances), pt-dep/expl (4; 0% instances), pt-dep/dep (3; 0% instances), pt-dep/mwe (2; 0% instances), pt-dep/nsubjpass (2; 0% instances), pt-dep/xcomp:adj (2; 0% instances), pt-dep/det:poss (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (853; 24% instances), PUNCT (667; 19% instances), VERB (649; 18% instances), NOUN (576; 16% instances), DET (247; 7% instances), PROPN (129; 4% instances), PRON (106; 3% instances), ADV (89; 3% instances), CONJ (87; 2% instances), ADJ (56; 2% instances), X (38; 1% instances), AUX (15; 0% instances), NUM (7; 0% instances), PART (5; 0% instances)


PRON in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]