home it/pos edit page issue tracker

PROPN: proper noun

Definition

A proper noun is a noun that is the name (or part of the name) of a unique entity, be it an individual, a place, or an object.

Acronyms of proper nouns, such as UN and NATO, are also tagged as PROPN.

Corresponding language-specific part-of-speech tags

SP: Proper noun

Examples


Treebank Statistics (UD_Italian)

There are 5328 PROPN lemmas (27%), 5372 PROPN types (19%) and 13401 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Shakespeare, Balzac, Italia, Stati, Europa, San, Roma, Uniti, Albania, Marco

The 10 most frequent PROPN types: Shakespeare, Balzac, Italia, stati, Europa, San, Uniti, Albania, Marco, Roma

The 10 most frequent ambiguous lemmas: de (ADP 34, PROPN 8, X 3, DET 1), Stato (PROPN 40, NOUN 3), Germania (PROPN 28, NOUN 1), a (ADP 7044, NOUN 10, X 8, PROPN 3, DET 3, ADV 2, CONJ 1), Grande (PROPN 13, ADJ 1), Mondiali (PROPN 12, NOUN 2), Regione (PROPN 11, NOUN 1), di (ADP 17740, DET 27, PROPN 3, NOUN 1), C (PROPN 9, X 1), Ministro (PROPN 9, NOUN 1)

The 10 most frequent ambiguous types: stati (AUX 104, NOUN 46, VERB 26, PROPN 1), de (ADP 39, PROPN 8, X 2), Stato (NOUN 88, PROPN 40), Unione (PROPN 37, NOUN 6), europea (ADJ 31, PROPN 5), Camera (NOUN 23, PROPN 23), a (ADP 6336, NOUN 10, X 8, ADV 2, PROPN 2), nazioni (NOUN 7, PROPN 1), Broglio (PROPN 20, NOUN 2), Nord (PROPN 20, NOUN 3)

Morphology

The form / lemma ratio of PROPN is 1.008258 (the average of all parts of speech is 1.491677).

The 1st highest number of forms (2) was observed with the lemma “Aids”: AIDS, Aids.

The 2nd highest number of forms (2) was observed with the lemma “As”: AS, As.

The 3rd highest number of forms (2) was observed with the lemma “Bogotà”: BOGOTÀ, Bogotà.

PROPN occurs with 3 features: it-feat/Degree (1; 0% instances), it-feat/Gender (1; 0% instances), it-feat/Number (1; 0% instances)

PROPN occurs with 3 feature-value pairs: Degree=Abs, Gender=Fem, Number=Plur

PROPN occurs with 3 feature combinations. The most frequent feature combination is _ (13399 tokens). Examples: Shakespeare, Balzac, Italia, stati, Europa, San, Uniti, Albania, Marco, Roma

Relations

PROPN nodes are attached to their parents using 16 different relations: it-dep/nmod (6489; 48% instances), it-dep/name (3183; 24% instances), it-dep/nsubj (1974; 15% instances), it-dep/conj (723; 5% instances), it-dep/dobj (356; 3% instances), it-dep/root (222; 2% instances), it-dep/appos (209; 2% instances), it-dep/nsubjpass (140; 1% instances), it-dep/xcomp (58; 0% instances), it-dep/parataxis (15; 0% instances), it-dep/vocative (13; 0% instances), it-dep/advcl (8; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/acl:relcl (3; 0% instances), it-dep/csubj (2; 0% instances), it-dep/punct (1; 0% instances)

Parents of PROPN nodes belong to 16 different parts of speech: NOUN (4656; 35% instances), PROPN (4447; 33% instances), VERB (3589; 27% instances), PRON (251; 2% instances), ROOT (222; 2% instances), ADJ (181; 1% instances), ADV (17; 0% instances), NUM (15; 0% instances), AUX (4; 0% instances), INTJ (4; 0% instances), PUNCT (4; 0% instances), SYM (4; 0% instances), DET (3; 0% instances), CONJ (2; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

4933 (37%) PROPN nodes are leaves.

3588 (27%) PROPN nodes have one child.

2498 (19%) PROPN nodes have two children.

2382 (18%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 38.

Children of PROPN nodes are attached using 30 different relations: it-dep/case (5111; 28% instances), it-dep/name (3261; 18% instances), it-dep/det (3009; 17% instances), it-dep/punct (2627; 15% instances), it-dep/nmod (1139; 6% instances), it-dep/conj (812; 4% instances), it-dep/cc (535; 3% instances), it-dep/amod (407; 2% instances), it-dep/appos (300; 2% instances), it-dep/acl:relcl (217; 1% instances), it-dep/nummod (183; 1% instances), it-dep/acl (163; 1% instances), it-dep/advmod (136; 1% instances), it-dep/cop (42; 0% instances), it-dep/nsubj (25; 0% instances), it-dep/parataxis (23; 0% instances), it-dep/advcl (18; 0% instances), it-dep/det:predet (16; 0% instances), it-dep/mwe (16; 0% instances), it-dep/mark (11; 0% instances), it-dep/neg (11; 0% instances), it-dep/det:poss (9; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/aux (4; 0% instances), it-dep/discourse (3; 0% instances), it-dep/xcomp (3; 0% instances), it-dep/foreign (2; 0% instances), it-dep/vocative (2; 0% instances), it-dep/dep (1; 0% instances), it-dep/dobj (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (5073; 28% instances), PROPN (4447; 25% instances), DET (3034; 17% instances), PUNCT (2632; 15% instances), NOUN (976; 5% instances), CONJ (533; 3% instances), VERB (435; 2% instances), ADJ (428; 2% instances), NUM (241; 1% instances), ADV (194; 1% instances), PRON (46; 0% instances), SYM (25; 0% instances), SCONJ (11; 0% instances), X (6; 0% instances), AUX (4; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances)


PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]