Treebank Statistics: UD_Slovenian-SST: POS Tags: PRON
There are 18 PRON
lemmas (0%), 79 PRON
types (1%) and 1641 PRON
tokens (6%).
Out of 16 observed tags, the rank of PRON
is: 13 in number of lemmas, 9 in number of types and 10 in number of tokens.
The 10 most frequent PRON
lemmas: se, jaz, on, kaj, ti, kar, kdo, nekdo, zame, karkoli
The 10 most frequent PRON
types: se, kaj, jaz, mi, ti, ga, jih, si, jo, kar
The 10 most frequent ambiguous lemmas: on (PRON 308, X 2), kaj (PRON 197, ADV 43, X 1), ti (PRON 194, INTJ 1, X 1), kar (ADV 71, PRON 36)
The 10 most frequent ambiguous types: se (PRON 398, X 1), kaj (PRON 186, ADV 43, X 1), ti (PRON 100, DET 15, INTJ 1, X 1), ga (PRON 60, X 1), si (AUX 55, PRON 49, VERB 16), jo (PRON 37, INTJ 1), kar (ADV 71, PRON 34), on (PRON 24, X 2), ona (PRON 23, DET 6), te (DET 37, ADV 21, PRON 13)
- se
- kaj
- ti
- ga
- si
- jo
- kar
- on
- ona
- te
Morphology
The form / lemma ratio of PRON
is 4.388889 (the average of all parts of speech is 1.573353).
The 1st highest number of forms (22) was observed with the lemma “on”: ga, je, ji, jih, jim, jo, mu, nje, njega, njej, njem, njemu, njih, njim, njima, njimi, njo, on, ona, onadva, one, oni.
The 2nd highest number of forms (12) was observed with the lemma “jaz”: jaz, mano, me, mene, meni, mi, midva, midve, nam, nama, nami, nas.
The 3rd highest number of forms (12) was observed with the lemma “ti”: tabo, te, tebe, tebi, ti, vaju, vam, vami, vas, vi, vidva, vidve.
PRON
occurs with 7 features: PronType (1641; 100% instances), Case (1243; 76% instances), Number (1179; 72% instances), Person (894; 54% instances), Variant (804; 49% instances), Gender (682; 42% instances), Reflex (462; 28% instances)
PRON
occurs with 23 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Prs
, PronType=Rel
, Reflex=Yes
, Variant=Bound
, Variant=Short
PRON
occurs with 102 feature combinations.
The most frequent feature combination is PronType=Prs|Reflex=Yes|Variant=Short
(398 tokens).
Examples: se
Relations
PRON
nodes are attached to their parents using 21 different relations: obj (434; 26% instances), nsubj (428; 26% instances), expl (419; 26% instances), iobj (88; 5% instances), obl (81; 5% instances), root (58; 4% instances), reparandum (24; 1% instances), nmod (21; 1% instances), cc (16; 1% instances), discourse (13; 1% instances), parataxis (12; 1% instances), conj:extend (11; 1% instances), conj (10; 1% instances), orphan (6; 0% instances), ccomp (5; 0% instances), acl (4; 0% instances), dislocated (3; 0% instances), fixed (3; 0% instances), vocative (3; 0% instances), advcl (1; 0% instances), appos (1; 0% instances)
Parents of PRON
nodes belong to 12 different parts of speech: VERB (1382; 84% instances), NOUN (59; 4% instances), (58; 4% instances), ADJ (51; 3% instances), PRON (19; 1% instances), DET (18; 1% instances), X (17; 1% instances), AUX (11; 1% instances), ADV (9; 1% instances), PART (9; 1% instances), PROPN (7; 0% instances), NUM (1; 0% instances)
1385 (84%) PRON
nodes are leaves.
152 (9%) PRON
nodes have one child.
45 (3%) PRON
nodes have two children.
59 (4%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 8.
Children of PRON
nodes are attached using 29 different relations: case (108; 23% instances), advmod (69; 15% instances), punct (51; 11% instances), reparandum (27; 6% instances), cc (26; 6% instances), cop (19; 4% instances), fixed (19; 4% instances), nmod (19; 4% instances), nsubj (18; 4% instances), discourse (17; 4% instances), parataxis (16; 3% instances), discourse:filler (11; 2% instances), conj (8; 2% instances), advcl (7; 2% instances), orphan (7; 2% instances), acl (6; 1% instances), obj (6; 1% instances), parataxis:discourse (5; 1% instances), appos (4; 1% instances), det (4; 1% instances), vocative (4; 1% instances), amod (3; 1% instances), parataxis:restart (3; 1% instances), dislocated (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), cc:preconj (1; 0% instances), csubj (1; 0% instances), obl (1; 0% instances)
Children of PRON
nodes belong to 16 different parts of speech: ADP (104; 22% instances), PUNCT (51; 11% instances), VERB (46; 10% instances), CCONJ (42; 9% instances), ADV (39; 8% instances), PART (36; 8% instances), NOUN (26; 6% instances), DET (21; 5% instances), AUX (19; 4% instances), PRON (19; 4% instances), X (15; 3% instances), SCONJ (13; 3% instances), INTJ (12; 3% instances), ADJ (10; 2% instances), PROPN (8; 2% instances), NUM (5; 1% instances)