home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: PROPN

There are 309 PROPN lemmas (8%), 348 PROPN types (6%) and 758 PROPN tokens (3%). Out of 16 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent PROPN lemmas: [name:personal], [name:surname], Slovenija, Irak, Jones, Tom, [name:address], Božje, David, Healy

The 10 most frequent PROPN types: [name:personal], [name:surname], slovenija, sloveniji, [name:address], jones, slovenije, tom, [name:organisation], david

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: božje (ADJ 1, PROPN 1), maja (NOUN 2, PROPN 1), mark (NOUN 3, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.126214 (the average of all parts of speech is 1.573353).

The 1st highest number of forms (4) was observed with the lemma “Ljubljana”: ljubljana, ljubljane, ljubljani, ljubljano.

The 2nd highest number of forms (3) was observed with the lemma “Ana”: ana, ani, ano.

The 3rd highest number of forms (3) was observed with the lemma “Irak”: irak, iraka, iraku.

PROPN occurs with 4 features: Case (444; 59% instances), Gender (444; 59% instances), Number (444; 59% instances), Animacy (25; 3% instances)

PROPN occurs with 14 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

PROPN occurs with 28 feature combinations. The most frequent feature combination is _ (314 tokens). Examples: [name:personal], [name:surname], [name:address], [name:organisation], [name:place]

Relations

PROPN nodes are attached to their parents using 19 different relations: nmod (139; 18% instances), flat:name (125; 16% instances), nsubj (115; 15% instances), obl (103; 14% instances), root (88; 12% instances), vocative (52; 7% instances), conj (51; 7% instances), parataxis (29; 4% instances), obj (27; 4% instances), appos (10; 1% instances), dislocated (7; 1% instances), amod (2; 0% instances), discourse (2; 0% instances), iobj (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), flat:foreign (1; 0% instances), parataxis:restart (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: VERB (268; 35% instances), PROPN (193; 25% instances), NOUN (155; 20% instances), (88; 12% instances), ADJ (23; 3% instances), DET (12; 2% instances), PRON (8; 1% instances), ADV (5; 1% instances), NUM (3; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)

361 (48%) PROPN nodes are leaves.

248 (33%) PROPN nodes have one child.

88 (12%) PROPN nodes have two children.

61 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 28 different relations: case (165; 24% instances), flat:name (124; 18% instances), conj (57; 8% instances), advmod (53; 8% instances), discourse (34; 5% instances), cc (32; 5% instances), cop (27; 4% instances), parataxis (27; 4% instances), punct (26; 4% instances), amod (22; 3% instances), nmod (19; 3% instances), det (17; 2% instances), discourse:filler (17; 2% instances), nsubj (17; 2% instances), appos (13; 2% instances), reparandum (8; 1% instances), acl (4; 1% instances), flat:foreign (4; 1% instances), mark (4; 1% instances), aux (3; 0% instances), nummod (3; 0% instances), parataxis:discourse (3; 0% instances), conj:extend (2; 0% instances), advcl (1; 0% instances), cc:preconj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (193; 28% instances), ADP (163; 24% instances), ADV (40; 6% instances), CCONJ (38; 6% instances), PART (38; 6% instances), DET (34; 5% instances), AUX (30; 4% instances), PUNCT (26; 4% instances), ADJ (24; 3% instances), NOUN (24; 3% instances), INTJ (23; 3% instances), VERB (21; 3% instances), X (13; 2% instances), SCONJ (8; 1% instances), PRON (7; 1% instances), NUM (4; 1% instances)