home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: PROPN

There are 3968 PROPN lemmas (28%), 3968 PROPN types (19%) and 10905 PROPN tokens (6%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent PROPN types: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent ambiguous lemmas: the (PROPN 4, X 1), in (PROPN 18, X 3), a (ADP 2153, PROPN 3, NOUN 2), PM (PROPN 6, NOUN 1), and (PROPN 4, X 1), to (PROPN 4, X 1), 5 (NUM 25, PROPN 3), 55 (NUM 5, PROPN 2), se (PRON 731, SCONJ 275, PROPN 1), 157 (NUM 6, PROPN 1)

The 10 most frequent ambiguous types: São (PROPN 135, AUX 35, VERB 4), Justiça (PROPN 48, NOUN 1), Polícia (PROPN 30, NOUN 2), Copa (PROPN 27, NOUN 4), the (PROPN 4, X 1), Norte (PROPN 22, NOUN 1), Brasileiro (PROPN 20, NOUN 1), in (PROPN 18, X 3), União (PROPN 19, NOUN 2), Exército (PROPN 17, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.491519).

The 1st highest number of forms (1) was observed with the lemma “#Tamojunto”: #Tamojunto.

The 2nd highest number of forms (1) was observed with the lemma “1-Azul”: 1-Azul.

The 3rd highest number of forms (1) was observed with the lemma “157”: 157.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 22 different relations: flat:name (3355; 31% instances), nmod (3039; 28% instances), nsubj (1825; 17% instances), obl (987; 9% instances), conj (584; 5% instances), obj (316; 3% instances), appos (296; 3% instances), obl:agent (139; 1% instances), parataxis (118; 1% instances), root (82; 1% instances), nsubj:pass (64; 1% instances), advcl (34; 0% instances), xcomp (23; 0% instances), list (9; 0% instances), vocative (9; 0% instances), ccomp:speech (6; 0% instances), dislocated (5; 0% instances), acl:relcl (4; 0% instances), orphan (4; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), csubj (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: PROPN (4232; 39% instances), NOUN (3166; 29% instances), VERB (3043; 28% instances), ADJ (145; 1% instances), (82; 1% instances), PRON (76; 1% instances), ADV (74; 1% instances), NUM (37; 0% instances), X (31; 0% instances), SYM (15; 0% instances), AUX (3; 0% instances), INTJ (1; 0% instances)

3904 (36%) PROPN nodes are leaves.

2201 (20%) PROPN nodes have one child.

2679 (25%) PROPN nodes have two children.

2121 (19%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 24 different relations: case (4002; 26% instances), det (3393; 22% instances), flat:name (3356; 22% instances), punct (2162; 14% instances), conj (616; 4% instances), cc (451; 3% instances), appos (390; 3% instances), nmod (342; 2% instances), parataxis (181; 1% instances), acl:relcl (162; 1% instances), amod (145; 1% instances), cop (89; 1% instances), nsubj (64; 0% instances), advmod (57; 0% instances), acl (54; 0% instances), mark (41; 0% instances), list (17; 0% instances), nummod (16; 0% instances), orphan (15; 0% instances), advcl (6; 0% instances), ccomp (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), expl:impers (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (4232; 27% instances), ADP (4012; 26% instances), DET (3393; 22% instances), PUNCT (2162; 14% instances), NOUN (552; 4% instances), CCONJ (448; 3% instances), VERB (203; 1% instances), NUM (168; 1% instances), ADJ (155; 1% instances), AUX (89; 1% instances), ADV (71; 0% instances), PRON (30; 0% instances), SCONJ (20; 0% instances), SYM (19; 0% instances), X (9; 0% instances)