home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-FicTree: POS Tags: PROPN

There are 423 PROPN lemmas (3%), 692 PROPN types (3%) and 2255 PROPN tokens (1%). Out of 16 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 13 in number of tokens.

The 10 most frequent PROPN lemmas: Valentýna, Láďa, Havel, Leoš, Alžběta, Eduard, Flajšman, Veronika, Filip, Havlena

The 10 most frequent PROPN types: Láďa, Leoš, Valentýna, Eduard, Havel, Alžběta, Flajšman, Veronika, Havlena, Filip

The 10 most frequent ambiguous lemmas: O (PROPN 1, X 1)

The 10 most frequent ambiguous types: K (ADP 19, PROPN 19), V (ADP 222, PROPN 6), A (CCONJ 577, PROPN 5, INTJ 1), Krásné (PROPN 5, ADJ 1), Krásná (PROPN 3, ADJ 1), S (ADP 39, PROPN 3), U (ADP 18, PROPN 2), Mašínovi (ADJ 3, PROPN 1), O (ADP 25, PROPN 1, X 1), Z (ADP 37, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.635934 (the average of all parts of speech is 1.970842).

The 1st highest number of forms (6) was observed with the lemma “Alžběta”: Alžběta, Alžběto, Alžbětou, Alžbětu, Alžběty, Alžbětě.

The 2nd highest number of forms (6) was observed with the lemma “Hanička”: Haničce, Hanička, Haničko, Haničkou, Haničku, Haničky.

The 3rd highest number of forms (6) was observed with the lemma “Havel”: HAVEL, Havel, Havla, Havle, Havlem, Havlovi.

PROPN occurs with 7 features: Case (2255; 100% instances), Gender (2255; 100% instances), Number (2255; 100% instances), Polarity (2255; 100% instances), NameType (1994; 88% instances), Animacy (1391; 62% instances), Abbr (111; 5% instances)

PROPN occurs with 21 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NameType=Com, NameType=Geo, NameType=Giv, NameType=Nat, NameType=Sur, Number=Plur, Number=Sing, Polarity=Pos

PROPN occurs with 98 feature combinations. The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|Polarity=Pos (449 tokens). Examples: Láďa, Leoš, Eduard, Filip, Honza, Pavel, Don, Ivan, Juan, Marek

Relations

PROPN nodes are attached to their parents using 18 different relations: nsubj (891; 40% instances), flat (301; 13% instances), nmod (236; 10% instances), obl (236; 10% instances), obl:arg (138; 6% instances), obj (135; 6% instances), conj (97; 4% instances), vocative (78; 3% instances), root (71; 3% instances), dep (22; 1% instances), appos (13; 1% instances), nsubj:pass (12; 1% instances), orphan (10; 0% instances), advcl (7; 0% instances), iobj (4; 0% instances), ccomp (2; 0% instances), csubj (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: VERB (1393; 62% instances), NOUN (462; 20% instances), PROPN (197; 9% instances), (71; 3% instances), ADJ (61; 3% instances), ADV (27; 1% instances), PRON (24; 1% instances), PART (15; 1% instances), NUM (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), X (1; 0% instances)

1367 (61%) PROPN nodes are leaves.

587 (26%) PROPN nodes have one child.

162 (7%) PROPN nodes have two children.

139 (6%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 26 different relations: case (426; 29% instances), punct (298; 20% instances), nmod (104; 7% instances), flat (100; 7% instances), amod (95; 6% instances), conj (85; 6% instances), cc (71; 5% instances), advmod:emph (47; 3% instances), cop (33; 2% instances), xcomp (31; 2% instances), det (28; 2% instances), acl:relcl (26; 2% instances), nsubj (24; 2% instances), mark (21; 1% instances), appos (18; 1% instances), orphan (16; 1% instances), advmod (10; 1% instances), dep (10; 1% instances), advcl (6; 0% instances), nummod (5; 0% instances), aux (4; 0% instances), obl (4; 0% instances), parataxis (3; 0% instances), det:numgov (2; 0% instances), acl (1; 0% instances), vocative (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (423; 29% instances), PUNCT (298; 20% instances), PROPN (197; 13% instances), NOUN (131; 9% instances), ADJ (108; 7% instances), CCONJ (78; 5% instances), VERB (52; 4% instances), DET (46; 3% instances), AUX (37; 3% instances), PART (33; 2% instances), ADV (30; 2% instances), SCONJ (21; 1% instances), PRON (7; 0% instances), NUM (6; 0% instances), X (2; 0% instances)