Treebank Statistics: UD_Romanian-RRT: Features: Variant
This feature is language-specific.
It occurs with 1 different values: Short.
3271 tokens (1%) have a non-empty value of Variant.
311 types (1%) occur at least once with a non-empty value of Variant.
237 lemmas (1%) occur at least once with a non-empty value of Variant.
The feature is used with 11 part-of-speech tags: PRON (2062; 1% instances), ADP (575; 0% instances), VERB (270; 0% instances), PART (140; 0% instances), DET (115; 0% instances), AUX (70; 0% instances), ADV (12; 0% instances), SCONJ (11; 0% instances), NOUN (10; 0% instances), ADJ (3; 0% instances), CCONJ (3; 0% instances).
PRON
2062 PRON tokens (17% of all PRON tokens) have a non-empty value of Variant.
The most frequent other feature values with which PRON and Variant co-occurred: PronType=Prs (2057; 100%), Strength=Weak (2055; 100%), Person=3 (1740; 84%), Gender=EMPTY (1629; 79%), Case=Acc (1332; 65%), Number=EMPTY (1114; 54%), Reflex=Yes (1114; 54%).
PRON tokens may have the following values of Variant:
Short(2062; 100% of non-emptyVariant): s-, -și, -l, și-, -i, -se, -o, l-, i-, m-
ADP
575 ADP tokens (2% of all ADP tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADP and Variant co-occurred: AdpType=Prep (575; 100%), Case=Acc (574; 100%), ExtPos=EMPTY (555; 97%).
ADP tokens may have the following values of Variant:
Short(575; 100% of non-emptyVariant): într-, dintr-, de-, printr-, -n, pe-, d-, n, p-, c-
VERB
270 VERB tokens (1% of all VERB tokens) have a non-empty value of Variant.
The most frequent other feature values with which VERB and Variant co-occurred: Gender=EMPTY (268; 99%), Number=EMPTY (237; 88%), Mood=EMPTY (224; 83%), Person=EMPTY (224; 83%), Tense=EMPTY (224; 83%), VerbForm=Ger (221; 82%).
VERB tokens may have the following values of Variant:
Short(270; 100% of non-emptyVariant): făcându, dându, -i, asigurându, tăindu, lovindu, mișcându, rupându, transformându, întărindu
Variant seems to be lexical feature of VERB. 100% lemmas (187) occur only with one value of Variant.
PART
140 PART tokens (3% of all PART tokens) have a non-empty value of Variant.
The most frequent other feature values with which PART and Variant co-occurred: PartType=EMPTY (139; 99%), Mood=EMPTY (99; 71%), Polarity=Neg (98; 70%).
PART tokens may have the following values of Variant:
Short(140; 100% of non-emptyVariant): n-, s-, -a
DET
115 DET tokens (1% of all DET tokens) have a non-empty value of Variant.
The most frequent other feature values with which DET and Variant co-occurred: Person=EMPTY (115; 100%), Position=EMPTY (115; 100%), Poss=EMPTY (115; 100%), PronType=Art (110; 96%), Number=Sing (106; 92%), Gender=Masc (84; 73%), Case=EMPTY (66; 57%).
DET tokens may have the following values of Variant:
Short(115; 100% of non-emptyVariant): -lea, -ul, -a, -ului, -uri, -urilor, -ilor, -urile
AUX
70 AUX tokens (1% of all AUX tokens) have a non-empty value of Variant.
The most frequent other feature values with which AUX and Variant co-occurred: Number=Sing (56; 80%), Person=3 (56; 80%), Mood=EMPTY (49; 70%), Tense=EMPTY (49; 70%), VerbForm=EMPTY (47; 67%).
AUX tokens may have the following values of Variant:
Short(70; 100% of non-emptyVariant): -a, -i, -au, -ai, -aș, -ar, E-, fiindu, -am, -ați
ADV
12 ADV tokens (0% of all ADV tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADV and Variant co-occurred: Degree=Pos (11; 92%), PronType=EMPTY (11; 92%).
ADV tokens may have the following values of Variant:
Short(12; 100% of non-emptyVariant): așa-, -nainte, -aici, -așa, -ncoace, -ntotdeauna, cân’
SCONJ
11 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which SCONJ and Variant co-occurred: Polarity=Pos (11; 100%).
SCONJ tokens may have the following values of Variant:
Short(11; 100% of non-emptyVariant): c-, de-, dac-
NOUN
10 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Variant.
The most frequent other feature values with which NOUN and Variant co-occurred: Number=Sing (10; 100%), Gender=Masc (9; 90%), Case=Acc,Nom (7; 70%), Definite=Def (7; 70%).
NOUN tokens may have the following values of Variant:
Short(10; 100% of non-emptyVariant): rându, -mai, -nceput, -nlăuntrul, -ntinderea, -ntuneric, dracu, sufletu, timpu’
ADJ
3 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADJ and Variant co-occurred: Case=EMPTY (3; 100%), Definite=Ind (3; 100%), Degree=Pos (3; 100%), Number=Plur (3; 100%), Gender=EMPTY (2; 67%).
ADJ tokens may have the following values of Variant:
Short(3; 100% of non-emptyVariant): -ntregi, -ndelungate
CCONJ
3 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which CCONJ and Variant co-occurred: Polarity=Pos (3; 100%).
CCONJ tokens may have the following values of Variant:
Short(3; 100% of non-emptyVariant): da’, Ș-
Relations with Agreement in Variant
The 10 most frequent relations where parent and child node agree in Variant:
PRON –[iobj]–> PRON (1; 100%).