PROPN
: proper noun
This document is a placeholder for the language-specific documentation
for PROPN
.
Treebank Statistics (UD_Polish)
There are 1800 PROPN
lemmas (14%), 2136 PROPN
types (9%) and 2959 PROPN
tokens (4%).
Out of 15 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: Polska, Warszawa, Polak, Piotr, Adam, Europa, Bóg, Jan, Jerzy, Poznań
The 10 most frequent PROPN
types: Polski, Polsce, Warszawie, Polska, Andrzej, Jan, Jerzy, SLD, Warszawy, Adam
The 10 most frequent ambiguous lemmas: SA (PROPN 5, X 1), C (PROPN 1, X 1), D (X 1, PROPN 1), M (X 5, PROPN 1), Nocny (ADJ 1, PROPN 1)
The 10 most frequent ambiguous types: Polski (PROPN 34, ADJ 3), Polska (PROPN 15, ADJ 3), Bóg (PROPN 7, NOUN 3), SA (PROPN 5, X 1), PO (PROPN 4, ADP 1), Pan (NOUN 46, PROPN 4), Panie (NOUN 8, PROPN 4), Woli (PROPN 4, NOUN 1), Zachodu (PROPN 3, NOUN 2), Tygodnia (PROPN 2, NOUN 1)
- Polski
- Polska
- Bóg
- SA
- PO
- Pan
- Panie
- Woli
- Zachodu
- Tygodnia
Morphology
The form / lemma ratio of PROPN
is 1.186667 (the average of all parts of speech is 1.801337).
The 1st highest number of forms (5) was observed with the lemma “Bóg”: Boga, Bogiem, Bogu, Boże, Bóg.
The 2nd highest number of forms (5) was observed with the lemma “Jezus”: Jezus, Jezusa, Jezusem, Jezusie, Jezusowi.
The 3rd highest number of forms (5) was observed with the lemma “Kaczyński”: Kaczyńscy, Kaczyński, Kaczyńskich, Kaczyńskiego, Kaczyńskiemu.
PROPN
occurs with 4 features: Case (2959; 100% instances), Gender (2959; 100% instances), Number (2959; 100% instances), Animacy (1862; 63% instances)
PROPN
occurs with 15 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Animacy=Nhum
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 52 feature combinations.
The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing
(744 tokens).
Examples: Andrzej, Jan, Jerzy, Adam, Grzegorz, Piotr, Bóg, Maciej, Stefan, Tomasz
Relations
PROPN
nodes are attached to their parents using 12 different relations: nmod (1480; 50% instances), nsubj (721; 24% instances), dobj (303; 10% instances), appos (245; 8% instances), conj (122; 4% instances), iobj (52; 2% instances), nsubjpass (19; 1% instances), root (7; 0% instances), case (4; 0% instances), name (3; 0% instances), ccomp (2; 0% instances), advcl (1; 0% instances)
Parents of PROPN
nodes belong to 14 different parts of speech: VERB (1242; 42% instances), NOUN (1097; 37% instances), PROPN (476; 16% instances), ADJ (47; 2% instances), X (31; 1% instances), NUM (17; 1% instances), PUNCT (13; 0% instances), ADP (7; 0% instances), PART (7; 0% instances), ROOT (7; 0% instances), PRON (6; 0% instances), ADV (4; 0% instances), AUX (3; 0% instances), DET (2; 0% instances)
1599 (54%) PROPN
nodes are leaves.
1048 (35%) PROPN
nodes have one child.
208 (7%) PROPN
nodes have two children.
104 (4%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 11.
Children of PROPN
nodes are attached using 16 different relations: case (760; 41% instances), nmod (435; 23% instances), amod (186; 10% instances), punct (157; 8% instances), conj (132; 7% instances), cc (89; 5% instances), acl (53; 3% instances), appos (25; 1% instances), cop (12; 1% instances), nsubj (7; 0% instances), det (6; 0% instances), advmod (5; 0% instances), name (3; 0% instances), neg (3; 0% instances), nummod (2; 0% instances), mark (1; 0% instances)
Children of PROPN
nodes belong to 14 different parts of speech: ADP (752; 40% instances), PROPN (476; 25% instances), ADJ (187; 10% instances), PUNCT (157; 8% instances), CONJ (90; 5% instances), NOUN (83; 4% instances), VERB (64; 3% instances), X (34; 2% instances), PART (16; 1% instances), DET (6; 0% instances), NUM (4; 0% instances), PRON (4; 0% instances), SCONJ (2; 0% instances), ADV (1; 0% instances)
PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]