This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home it/pos issue tracker

PROPN: proper noun

Definition

A proper noun is a noun that is the name (or part of the name) of a unique entity, be it an individual, a place, or an object.

Acronyms of proper nouns, such as UN and NATO, are also tagged as PROPN.

Corresponding language-specific part-of-speech tags

SP: Proper noun

Examples


Treebank Statistics (UD_Italian)

There are 5610 PROPN lemmas (28%), 5656 PROPN types (19%) and 14760 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Italia, Shakespeare, Balzac, Roma, Europa, Stati, Uniti, San, Albania, Marco

The 10 most frequent PROPN types: Italia, Shakespeare, Balzac, Europa, Roma, Stati, Uniti, San, Albania, Marco

The 10 most frequent ambiguous lemmas: de (ADP 38, PROPN 6, X 3, DET 3), Unione (PROPN 43, NOUN 6), europea (ADJ 21, PROPN 5), Germania (PROPN 32, NOUN 1), a (ADP 7523, NOUN 10, X 8, PROPN 3, DET 3, ADV 2, CONJ 1), Stato (PROPN 24, NOUN 12), Camera (PROPN 23, NOUN 9), Nord (PROPN 20, NOUN 1), repubblica (NOUN 105, PROPN 1), Papa (PROPN 17, NOUN 1)

The 10 most frequent ambiguous types: Stati (PROPN 79, NOUN 49), San (PROPN 78, ADJ 3), de (ADP 43, PROPN 6, DET 2, X 2), Unione (PROPN 43, NOUN 29), europea (ADJ 54, PROPN 5), Stato (NOUN 117, PROPN 24), a (ADP 6766, NOUN 10, X 8, PROPN 2, ADV 2), Camera (NOUN 32, PROPN 23), nazioni (NOUN 9, PROPN 1), Broglio (PROPN 20, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.008200 (the average of all parts of speech is 1.488836).

The 1st highest number of forms (4) was observed with the lemma “il”: L’, il, la, le.

The 2nd highest number of forms (2) was observed with the lemma “Aids”: AIDS, Aids.

The 3rd highest number of forms (2) was observed with the lemma “As”: AS, As.

PROPN occurs with 4 features: it-feat/Gender (3; 0% instances), it-feat/Number (3; 0% instances), it-feat/NumType (2; 0% instances), it-feat/Degree (1; 0% instances)

PROPN occurs with 5 feature-value pairs: Degree=Abs, Gender=Fem, NumType=Card, Number=Plur, Number=Sing

PROPN occurs with 5 feature combinations. The most frequent feature combination is _ (14754 tokens). Examples: Italia, Shakespeare, Balzac, Europa, Roma, Stati, Uniti, San, Albania, Marco

Relations

PROPN nodes are attached to their parents using 18 different relations: it-dep/nmod (7117; 48% instances), it-dep/name (3512; 24% instances), it-dep/nsubj (2146; 15% instances), it-dep/conj (832; 6% instances), it-dep/dobj (393; 3% instances), it-dep/root (247; 2% instances), it-dep/appos (242; 2% instances), it-dep/nsubjpass (156; 1% instances), it-dep/xcomp (66; 0% instances), it-dep/parataxis (15; 0% instances), it-dep/vocative (13; 0% instances), it-dep/advcl (7; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/acl:relcl (3; 0% instances), it-dep/case (2; 0% instances), it-dep/csubj (2; 0% instances), it-dep/compound (1; 0% instances), it-dep/foreign (1; 0% instances)

Parents of PROPN nodes belong to 17 different parts of speech: NOUN (5137; 35% instances), PROPN (4930; 33% instances), VERB (3934; 27% instances), PRON (253; 2% instances), ROOT (247; 2% instances), ADJ (193; 1% instances), NUM (18; 0% instances), ADV (17; 0% instances), X (6; 0% instances), AUX (5; 0% instances), INTJ (4; 0% instances), PUNCT (4; 0% instances), SYM (4; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), CONJ (2; 0% instances), SCONJ (1; 0% instances)

5470 (37%) PROPN nodes are leaves.

4055 (27%) PROPN nodes have one child.

2697 (18%) PROPN nodes have two children.

2538 (17%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 38.

Children of PROPN nodes are attached using 30 different relations: it-dep/case (5625; 29% instances), it-dep/name (3575; 18% instances), it-dep/det (3196; 16% instances), it-dep/punct (2830; 14% instances), it-dep/nmod (1222; 6% instances), it-dep/conj (917; 5% instances), it-dep/cc (587; 3% instances), it-dep/amod (411; 2% instances), it-dep/appos (331; 2% instances), it-dep/acl:relcl (222; 1% instances), it-dep/nummod (173; 1% instances), it-dep/acl (166; 1% instances), it-dep/advmod (145; 1% instances), it-dep/cop (68; 0% instances), it-dep/nsubj (51; 0% instances), it-dep/parataxis (24; 0% instances), it-dep/advcl (18; 0% instances), it-dep/det:predet (16; 0% instances), it-dep/mark (10; 0% instances), it-dep/neg (10; 0% instances), it-dep/det:poss (9; 0% instances), it-dep/aux (8; 0% instances), it-dep/mwe (8; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/discourse (3; 0% instances), it-dep/foreign (2; 0% instances), it-dep/vocative (2; 0% instances), it-dep/compound (1; 0% instances), it-dep/dobj (1; 0% instances), it-dep/xcomp (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (5573; 28% instances), PROPN (4930; 25% instances), DET (3224; 16% instances), PUNCT (2831; 14% instances), NOUN (1044; 5% instances), CONJ (585; 3% instances), VERB (468; 2% instances), ADJ (431; 2% instances), NUM (249; 1% instances), ADV (211; 1% instances), PRON (50; 0% instances), SCONJ (11; 0% instances), AUX (8; 0% instances), SYM (8; 0% instances), X (7; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances)


PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]