home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-MPDT: POS Tags: PROPN

There are 888 PROPN lemmas (11%), 1087 PROPN types (8%) and 1462 PROPN tokens (3%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: Bóg, Polska, Jan, Chrystus, Jezus, Warszawa, Rzeczpospolita, Turek, Paweł, Polak

The 10 most frequent PROPN types: Boga, Bóg, Bogu, Boże, Warszawie, Jan, Rzeczypospolitej, m, BÓG, B

The 10 most frequent ambiguous lemmas: Panna (ADV 1, PROPN 1), V (X 4, PROPN 1)

The 10 most frequent ambiguous types: Bóg (PROPN 30, NOUN 1), Boże (PROPN 12, ADJ 1), Rzeczypospolitej (PROPN 9, NOUN 1), m (AUX 83, ADV 2, NOUN 1, PROPN 1), Polski (ADJ 6, PROPN 6), Księżyca (PROPN 4, NOUN 1), N (PROPN 4, ADV 2), A (CCONJ 99, PART 6, PROPN 3), BOGA (PROPN 2, NOUN 1), K (ADV 7, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.224099 (the average of all parts of speech is 1.675682).

The 1st highest number of forms (11) was observed with the lemma “Bóg”: BOGA, BOGIEM, BOGU, BOŻE, BOże, Boga, Bogiem, Bogu, Boże, BÓG, Bóg.

The 2nd highest number of forms (6) was observed with the lemma “Chrystus”: CHRYSTUS, CHRYSTUSA, CHRYSTUSIE, Chrystus, Chrystusa, Chrystusie.

The 3rd highest number of forms (6) was observed with the lemma “Jezus”: JEZU, JEZUS, JEZUSA, JEZUSOWI, JEzus, Jezus.

PROPN occurs with 5 features: Case (1434; 98% instances), Gender (1434; 98% instances), Number (1434; 98% instances), Animacy (93; 6% instances), Abbr (28; 2% instances)

PROPN occurs with 16 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Nhum, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Ptan, Number=Sing

PROPN occurs with 36 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (385 tokens). Examples: Bóg, Jan, BÓG, JEZUS, Józef, Mojżesz, Stanisław, Chrystus, Fołtyn, Jacko

Relations

PROPN nodes are attached to their parents using 26 different relations: appos (250; 17% instances), obl (239; 16% instances), nsubj (225; 15% instances), conj (169; 12% instances), flat (119; 8% instances), nmod (106; 7% instances), iobj (58; 4% instances), obj (49; 3% instances), obl:arg (49; 3% instances), nmod:poss (40; 3% instances), root (37; 3% instances), nmod:arg (29; 2% instances), vocative (28; 2% instances), obl:agent (19; 1% instances), obl:cmpr (9; 1% instances), orphan (9; 1% instances), list (8; 1% instances), nsubj:pass (6; 0% instances), dep (3; 0% instances), parataxis:insert (3; 0% instances), acl:relcl (2; 0% instances), ccomp:cleft (1; 0% instances), ccomp:obj (1; 0% instances), parataxis:obj (1; 0% instances), xcomp (1; 0% instances), xcomp:pred (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: VERB (566; 39% instances), NOUN (446; 31% instances), PROPN (278; 19% instances), ADJ (90; 6% instances), (37; 3% instances), ADV (19; 1% instances), PRON (11; 1% instances), X (8; 1% instances), NUM (3; 0% instances), DET (2; 0% instances), SCONJ (2; 0% instances)

535 (37%) PROPN nodes are leaves.

589 (40%) PROPN nodes have one child.

195 (13%) PROPN nodes have two children.

143 (10%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 12.

Children of PROPN nodes are attached using 33 different relations: case (402; 26% instances), punct (250; 16% instances), conj (198; 13% instances), appos (139; 9% instances), flat (120; 8% instances), amod (113; 7% instances), cc (68; 4% instances), acl:relcl (42; 3% instances), nmod (30; 2% instances), acl (29; 2% instances), amod:flat (20; 1% instances), advmod:emph (19; 1% instances), det (18; 1% instances), det:poss (18; 1% instances), orphan (18; 1% instances), mark (8; 1% instances), nsubj (6; 0% instances), advmod (5; 0% instances), cop (5; 0% instances), parataxis:obj (5; 0% instances), nmod:flat (4; 0% instances), cc:preconj (3; 0% instances), nummod (3; 0% instances), advcl (2; 0% instances), dep (2; 0% instances), list (2; 0% instances), nmod:arg (2; 0% instances), parataxis:insert (2; 0% instances), advmod:neg (1; 0% instances), aux (1; 0% instances), aux:cnd (1; 0% instances), discourse:intj (1; 0% instances), nmod:poss (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: ADP (402; 26% instances), PROPN (278; 18% instances), PUNCT (250; 16% instances), NOUN (199; 13% instances), ADJ (179; 12% instances), CCONJ (68; 4% instances), VERB (45; 3% instances), DET (39; 3% instances), PART (23; 1% instances), ADV (16; 1% instances), X (15; 1% instances), SCONJ (8; 1% instances), AUX (7; 0% instances), NUM (7; 0% instances), INTJ (1; 0% instances), PRON (1; 0% instances)