Treebank Statistics: UD_Slovenian-SSJ: POS Tags: PROPN
There are 5051 PROPN
lemmas (20%), 6250 PROPN
types (12%) and 10239 PROPN
tokens (4%).
Out of 17 observed tags, the rank of PROPN
is: 3 in number of lemmas, 4 in number of types and 9 in number of tokens.
The 10 most frequent PROPN
lemmas: Slovenija, Ljubljana, Evropa, EU, Maribor, ZDA, Amerika, Nemčija, Slovenec, Italija
The 10 most frequent PROPN
types: Slovenije, Sloveniji, EU, Slovenija, ZDA, Evropi, Ljubljana, Ljubljani, Evrope, Slovenijo
The 10 most frequent ambiguous lemmas: New (PROPN 20, X 12), London (PROPN 17, X 1), Berlin (PROPN 15, X 1), Marija (PROPN 14, X 1), York (PROPN 14, X 10), Milan (PROPN 13, X 1), Sonce (PROPN 13, NOUN 1), Windows (PROPN 13, X 4), Koper (PROPN 12, X 1), Microsoft (PROPN 10, X 1)
The 10 most frequent ambiguous types: New (PROPN 20, X 12), Windows (PROPN 13, X 5), Ali (ADV 44, CCONJ 16, PROPN 10, X 1), Milan (PROPN 9, X 1), Yorku (PROPN 9, X 1), Zemlja (PROPN 8, NOUN 1), Hrvaške (PROPN 7, ADJ 1), Microsoft (PROPN 7, X 1), Union (PROPN 7, X 2), Dolenjske (PROPN 6, ADJ 3)
- New
- Windows
- Ali
- ADV 44: Ali gre za prve znake infekcijeske bolezni ali tumorja ?
- CCONJ 16: Ali natančneje : z veseljem vozili .
- PROPN 10: Ali je hitro pogledala mlajšo čarodejko .
- X 1: Če se pod odrom , na katerem “ igra ” Loop Guru , zbere več ljudi kot pod tistim , na katerem poje Nusrat Fateh Ali Khan , če ljudi bolj od Cesarie Evore gane Natacha Atlas , potem nisem ostarela jaz , ampak so posiveli moderni , mladi časi .
- Milan
- PROPN 9: Zaključnega srečanja sta se udeležila tudi dr. Dušan Plut in predsednik RS Milan Kučan .
- X 1: Zanimanje za Beckhama sta ob koncu minule sezone kazala tudi italijanska kluba Inter in AC Milan , pred tednom dni pa je United sporočil , da se je dogovoril za možnost prodaje nogometaša v Barcelono .
- Yorku
- Zemlja
- Hrvaške
- Microsoft
- Union
- Dolenjske
Morphology
The form / lemma ratio of PROPN
is 1.237379 (the average of all parts of speech is 1.932008).
The 1st highest number of forms (6) was observed with the lemma “Francoz”: Francoz, Francoza, Francoze, Francozi, Francozom, Francozov.
The 2nd highest number of forms (6) was observed with the lemma “Hrvat”: Hrvat, Hrvate, Hrvati, Hrvatom, Hrvatov, Hrvatu.
The 3rd highest number of forms (6) was observed with the lemma “Ljubljančan”: Ljubljančan, Ljubljančana, Ljubljančane, Ljubljančani, Ljubljančanom, Ljubljančanov.
PROPN
occurs with 4 features: Case (10239; 100% instances), Gender (10239; 100% instances), Number (10239; 100% instances), Animacy (420; 4% instances)
PROPN
occurs with 14 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
PROPN
occurs with 35 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing
(4085 tokens).
Examples: Maribor, Janez, New, Bojan, Jože, Boris, John, Peter, Dušan, Gregor
Relations
PROPN
nodes are attached to their parents using 20 different relations: nmod (3466; 34% instances), nsubj (1761; 17% instances), flat:name (1603; 16% instances), obl (1002; 10% instances), conj (933; 9% instances), appos (368; 4% instances), obj (281; 3% instances), list (272; 3% instances), parataxis (197; 2% instances), root (142; 1% instances), iobj (109; 1% instances), orphan (41; 0% instances), vocative (25; 0% instances), acl (17; 0% instances), xcomp (7; 0% instances), advcl (6; 0% instances), ccomp (4; 0% instances), amod (3; 0% instances), csubj (1; 0% instances), dep (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: NOUN (3577; 35% instances), PROPN (3103; 30% instances), VERB (2903; 28% instances), ADJ (323; 3% instances), (142; 1% instances), X (122; 1% instances), DET (22; 0% instances), ADV (15; 0% instances), NUM (15; 0% instances), PRON (8; 0% instances), PART (4; 0% instances), SYM (3; 0% instances), AUX (2; 0% instances)
4517 (44%) PROPN
nodes are leaves.
3340 (33%) PROPN
nodes have one child.
1458 (14%) PROPN
nodes have two children.
924 (9%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 32.
Children of PROPN
nodes are attached using 25 different relations: case (2003; 20% instances), punct (1932; 19% instances), flat:name (1631; 16% instances), conj (988; 10% instances), nmod (590; 6% instances), nummod (531; 5% instances), amod (479; 5% instances), cc (438; 4% instances), appos (357; 4% instances), list (324; 3% instances), advmod (235; 2% instances), acl (204; 2% instances), orphan (85; 1% instances), cop (55; 1% instances), det (52; 1% instances), parataxis (42; 0% instances), nsubj (39; 0% instances), dep (22; 0% instances), mark (21; 0% instances), obl (18; 0% instances), aux (14; 0% instances), cc:preconj (9; 0% instances), advcl (3; 0% instances), discourse (2; 0% instances), flat (2; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (3103; 31% instances), ADP (1980; 20% instances), PUNCT (1932; 19% instances), NUM (575; 6% instances), NOUN (566; 6% instances), ADJ (503; 5% instances), CCONJ (456; 5% instances), X (284; 3% instances), VERB (207; 2% instances), PART (166; 2% instances), ADV (83; 1% instances), AUX (69; 1% instances), DET (68; 1% instances), SCONJ (52; 1% instances), PRON (17; 0% instances), SYM (11; 0% instances), INTJ (4; 0% instances)