Variant
: alternative form of word
In Slovenian, the Variant
feature is either a lexical or inflectional features of some pronouns.
Bound
: bound form
This value is assigned as a lexical feature of fused combinations of prepositions and personal pronouns that are currently tokenized as one word form and annotated as personal pronouns.
Examples
- To je zame preveč. “This is too much for me”
- Skozenj je stekel električni tok. “Electric current ran through him.”
- Prednjo so postavili kozarec vina. “A glass of wine was put in front of her.”
Short
: clitic form
This value is assigned as an inflectional feature to clitic personal pronouns in genitive, dative and accusative to distinguish them from their longer counterparts with the same lemma and set of features.
Examples
- ga (“him”, variant of njega)
- jih (“them”, variant of njih)
- jo (“her”, variant of njo)
- mi (“to me”, variant of meni) *se (“oneself”, used either as a variant of reflexive pronoun sebi or as an obligatory free morpheme with pseudo-reflexive verbs, such as smejati se “to laugh”)
Treebank Statistics (UD_Slovenian)
This feature is language-specific.
It occurs with 2 different values: Bound
, Short
.
3917 tokens (3%) have a non-empty value of Variant
.
57 types (0%) occur at least once with a non-empty value of Variant
.
15 lemmas (0%) occur at least once with a non-empty value of Variant
.
The feature is used with 1 part-of-speech tags: sl-pos/PRON (3917; 3% instances).
PRON
3917 sl-pos/PRON tokens (56% of all PRON
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which PRON
and Variant
co-occurred: PronType=Prs (3917; 100%), Gender=EMPTY (2575; 66%), Person=EMPTY (2328; 59%), Reflex=Yes (2328; 59%), Number=EMPTY (2328; 59%), Case=EMPTY (2062; 53%).
PRON
tokens may have the following values of Variant
:
Bound
(118; 3% of non-emptyVariant
): zanj, zame, zase, nanjo, zanjo, nanj, vanjo, vanj, zanje, nameShort
(3799; 97% of non-emptyVariant
): se, ga, jih, si, jo, mu, mi, ji, me, jim
Variant
seems to be lexical feature of PRON
. 100% lemmas (15) occur only with one value of Variant
.
Treebank Statistics (UD_Slovenian-SST)
This feature is language-specific.
It occurs with 2 different values: Bound
, Short
.
806 tokens (3%) have a non-empty value of Variant
.
19 types (0%) occur at least once with a non-empty value of Variant
.
7 lemmas (0%) occur at least once with a non-empty value of Variant
.
The feature is used with 1 part-of-speech tags: sl-pos/PRON (806; 3% instances).
PRON
806 sl-pos/PRON tokens (31% of all PRON
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which PRON
and Variant
co-occurred: PronType=Prs (806; 100%), Gender=EMPTY (596; 74%), Number=EMPTY (447; 55%), Person=EMPTY (447; 55%).
PRON
tokens may have the following values of Variant
:
Bound
(7; 1% of non-emptyVariant
): zanjo, vanj, zame, zanj, zase, zateShort
(799; 99% of non-emptyVariant
): se, mi, ga, jih, si, ti, jo, me, jim, mu