home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PUD: POS Tags: PROPN

There are 993 PROPN lemmas (25%), 993 PROPN types (16%) and 1393 PROPN tokens (6%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: China, Trump, Mediterrâneo, América, the, Austrália, Europa, França, Grécia, Hong

The 10 most frequent PROPN types: China, Trump, Mediterrâneo, América, the, Austrália, Europa, França, Grécia, Hong

The 10 most frequent ambiguous lemmas: (PUNCT 4, PROPN 3), - (PUNCT 63, PROPN 2), a (ADP 154, PROPN 2, SCONJ 2), de (ADP 1736, PROPN 1), t (PROPN 2, NOUN 1), ele (PRON 14, PROPN 1)

The 10 most frequent ambiguous types: Mar (NOUN 7, PROPN 5), (PUNCT 4, PROPN 3), - (PUNCT 63, PROPN 2), Andes (PROPN 2, VERB 1), Balcãs (PROPN 2, NOUN 1), Carolina (PROPN 2, NOUN 1), a (DET 972, ADP 277, PRON 8, SCONJ 3, PROPN 2), and (PROPN 2, NOUN 1), de (ADP 1732, PROPN 1), Estados (NOUN 8, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.570742).

The 1st highest number of forms (1) was observed with the lemma “&”: &.

The 2nd highest number of forms (1) was observed with the lemma “’”: .

The 3rd highest number of forms (1) was observed with the lemma “-”: -.

PROPN occurs with 5 features: Gender (1393; 100% instances), Number (1393; 100% instances), Foreign (164; 12% instances), Case (1; 0% instances), Person (1; 0% instances)

PROPN occurs with 7 feature-value pairs: Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3

PROPN occurs with 8 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (685 tokens). Examples: Trump, Mediterrâneo, Caribe, Hong, Donald, Joseph, Mar, The, Rafferty, Bogd

Relations

PROPN nodes are attached to their parents using 14 different relations: nmod (369; 26% instances), nsubj (222; 16% instances), appos (167; 12% instances), obl (165; 12% instances), flat (158; 11% instances), flat:name (152; 11% instances), conj (70; 5% instances), obj (52; 4% instances), nsubj:pass (27; 2% instances), root (7; 1% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: NOUN (525; 38% instances), VERB (426; 31% instances), PROPN (401; 29% instances), ADJ (18; 1% instances), ADV (8; 1% instances), (7; 1% instances), ADP (4; 0% instances), NUM (2; 0% instances), PRON (2; 0% instances)

521 (37%) PROPN nodes are leaves.

341 (24%) PROPN nodes have one child.

332 (24%) PROPN nodes have two children.

199 (14%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 9.

Children of PROPN nodes are attached using 21 different relations: case (542; 31% instances), det (360; 21% instances), punct (181; 10% instances), flat (158; 9% instances), flat:name (152; 9% instances), conj (87; 5% instances), cc (52; 3% instances), appos (48; 3% instances), nmod (46; 3% instances), amod (42; 2% instances), acl:relcl (19; 1% instances), compound (17; 1% instances), advmod (10; 1% instances), acl (9; 1% instances), cop (8; 0% instances), nsubj (7; 0% instances), obl:tmod (2; 0% instances), cc:preconj (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances), orphan (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: ADP (543; 31% instances), PROPN (401; 23% instances), DET (360; 21% instances), PUNCT (181; 10% instances), NOUN (94; 5% instances), ADJ (56; 3% instances), CCONJ (52; 3% instances), VERB (25; 1% instances), NUM (10; 1% instances), ADV (9; 1% instances), AUX (8; 0% instances), X (3; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)