Treebank Statistics: UD_Slovenian-SST: POS Tags: PROPN
There are 309 PROPN
lemmas (8%), 348 PROPN
types (6%) and 758 PROPN
tokens (3%).
Out of 16 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.
The 10 most frequent PROPN
lemmas: [name:personal], [name:surname], Slovenija, Irak, Jones, Tom, [name:address], Božje, David, Healy
The 10 most frequent PROPN
types: [name:personal], [name:surname], slovenija, sloveniji, [name:address], jones, slovenije, tom, [name:organisation], david
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: božje (ADJ 1, PROPN 1), maja (NOUN 2, PROPN 1), mark (NOUN 3, PROPN 1)
- božje
- maja
- mark
Morphology
The form / lemma ratio of PROPN
is 1.126214 (the average of all parts of speech is 1.573353).
The 1st highest number of forms (4) was observed with the lemma “Ljubljana”: ljubljana, ljubljane, ljubljani, ljubljano.
The 2nd highest number of forms (3) was observed with the lemma “Ana”: ana, ani, ano.
The 3rd highest number of forms (3) was observed with the lemma “Irak”: irak, iraka, iraku.
PROPN
occurs with 4 features: Case (444; 59% instances), Gender (444; 59% instances), Number (444; 59% instances), Animacy (25; 3% instances)
PROPN
occurs with 14 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
PROPN
occurs with 28 feature combinations.
The most frequent feature combination is _
(314 tokens).
Examples: [name:personal], [name:surname], [name:address], [name:organisation], [name:place]
Relations
PROPN
nodes are attached to their parents using 20 different relations: nmod (139; 18% instances), flat:name (125; 16% instances), nsubj (115; 15% instances), obl (103; 14% instances), root (88; 12% instances), vocative (52; 7% instances), conj (49; 6% instances), parataxis (29; 4% instances), obj (27; 4% instances), appos (10; 1% instances), dislocated (7; 1% instances), amod (2; 0% instances), discourse (2; 0% instances), fixed (2; 0% instances), iobj (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), flat:foreign (1; 0% instances), parataxis:restart (1; 0% instances)
Parents of PROPN
nodes belong to 11 different parts of speech: VERB (268; 35% instances), PROPN (193; 25% instances), NOUN (155; 20% instances), (88; 12% instances), ADJ (23; 3% instances), DET (12; 2% instances), PRON (8; 1% instances), ADV (5; 1% instances), NUM (3; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)
361 (48%) PROPN
nodes are leaves.
248 (33%) PROPN
nodes have one child.
88 (12%) PROPN
nodes have two children.
61 (8%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 10.
Children of PROPN
nodes are attached using 29 different relations: case (165; 24% instances), flat:name (124; 18% instances), conj (55; 8% instances), advmod (53; 8% instances), discourse (34; 5% instances), cc (32; 5% instances), cop (27; 4% instances), parataxis (27; 4% instances), punct (26; 4% instances), amod (22; 3% instances), nmod (19; 3% instances), det (17; 2% instances), discourse:filler (17; 2% instances), nsubj (17; 2% instances), appos (13; 2% instances), reparandum (8; 1% instances), acl (4; 1% instances), flat:foreign (4; 1% instances), mark (4; 1% instances), aux (3; 0% instances), nummod (3; 0% instances), parataxis:discourse (3; 0% instances), conj:extend (2; 0% instances), fixed (2; 0% instances), advcl (1; 0% instances), cc:preconj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: PROPN (193; 28% instances), ADP (163; 24% instances), ADV (40; 6% instances), CCONJ (38; 6% instances), PART (38; 6% instances), DET (34; 5% instances), AUX (30; 4% instances), PUNCT (26; 4% instances), ADJ (24; 3% instances), NOUN (24; 3% instances), INTJ (23; 3% instances), VERB (21; 3% instances), X (13; 2% instances), SCONJ (8; 1% instances), PRON (7; 1% instances), NUM (4; 1% instances)