Treebank Statistics: UD_Romanian-RRT: Features: Variant
This feature is language-specific.
It occurs with 1 different values: Short.
3331 tokens (2%) have a non-empty value of Variant.
319 types (1%) occur at least once with a non-empty value of Variant.
251 lemmas (1%) occur at least once with a non-empty value of Variant.
The feature is used with 12 part-of-speech tags: PRON (2113; 1% instances), ADP (578; 0% instances), VERB (271; 0% instances), PART (140; 0% instances), DET (115; 0% instances), AUX (57; 0% instances), ADV (18; 0% instances), NOUN (11; 0% instances), SCONJ (10; 0% instances), NUM (8; 0% instances), ADJ (7; 0% instances), CCONJ (3; 0% instances).
PRON
2113 PRON tokens (17% of all PRON tokens) have a non-empty value of Variant.
The most frequent other feature values with which PRON and Variant co-occurred: PronType=Prs (2106; 100%), Strength=Weak (2104; 100%), Person=3 (1775; 84%), Gender=EMPTY (1672; 79%), Case=Acc (1340; 63%), Number=EMPTY (1120; 53%), Reflex=Yes (1120; 53%).
PRON tokens may have the following values of Variant:
Short(2113; 100% of non-emptyVariant): s-, -l, -și, -i, și-, -se, -o, l-, i-, m-
ADP
578 ADP tokens (2% of all ADP tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADP and Variant co-occurred: AdpType=Prep (578; 100%), Case=Acc (577; 100%), ExtPos=EMPTY (555; 96%).
ADP tokens may have the following values of Variant:
Short(578; 100% of non-emptyVariant): într-, dintr-, de-, printr-, -n, d-, p-, c-, pân’, -mpotriva
Variant seems to be lexical feature of ADP. 100% lemmas (10) occur only with one value of Variant.
VERB
271 VERB tokens (1% of all VERB tokens) have a non-empty value of Variant.
The most frequent other feature values with which VERB and Variant co-occurred: Gender=EMPTY (269; 99%), Number=EMPTY (238; 88%), Mood=EMPTY (225; 83%), Person=EMPTY (225; 83%), Tense=EMPTY (225; 83%), VerbForm=Ger (222; 82%).
VERB tokens may have the following values of Variant:
Short(271; 100% of non-emptyVariant): făcându, dându, -i, asigurându, tăindu, lovindu, mișcându, rupându, transformându, întărindu
Variant seems to be lexical feature of VERB. 100% lemmas (188) occur only with one value of Variant.
PART
140 PART tokens (3% of all PART tokens) have a non-empty value of Variant.
The most frequent other feature values with which PART and Variant co-occurred: PartType=EMPTY (140; 100%), Mood=EMPTY (99; 71%), Polarity=Neg (98; 70%).
PART tokens may have the following values of Variant:
Short(140; 100% of non-emptyVariant): n-, s-, -o
DET
115 DET tokens (1% of all DET tokens) have a non-empty value of Variant.
The most frequent other feature values with which DET and Variant co-occurred: Person=EMPTY (115; 100%), Position=EMPTY (115; 100%), Poss=EMPTY (115; 100%), Number=Sing (106; 92%), PronType=Art (106; 92%), Gender=Masc (85; 74%), Case=EMPTY (62; 54%).
DET tokens may have the following values of Variant:
Short(115; 100% of non-emptyVariant): -lea, -ul, -a, -ului, -uri, -o, -urilor, -ilor, -un, -urile
AUX
57 AUX tokens (1% of all AUX tokens) have a non-empty value of Variant.
The most frequent other feature values with which AUX and Variant co-occurred: Number=Sing (44; 77%), Person=3 (44; 77%), Mood=EMPTY (37; 65%), Tense=EMPTY (37; 65%), VerbForm=EMPTY (35; 61%).
AUX tokens may have the following values of Variant:
Short(57; 100% of non-emptyVariant): -i, -a, -ai, -am, -au, -aș, -ar, -or, fiindu, -oi
ADV
18 ADV tokens (0% of all ADV tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADV and Variant co-occurred: PronType=EMPTY (17; 94%), Degree=Pos (16; 89%).
ADV tokens may have the following values of Variant:
Short(18; 100% of non-emptyVariant): așa-, -așa, -nainte, ne-, -acu, -apoi, -napoi, -ncet, -ncoace, -ntotdeauna
Variant seems to be lexical feature of ADV. 100% lemmas (11) occur only with one value of Variant.
NOUN
11 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Variant.
The most frequent other feature values with which NOUN and Variant co-occurred: Number=Sing (10; 91%), Case=Acc,Nom (9; 82%), Definite=Def (9; 82%), Gender=Masc (9; 82%).
NOUN tokens may have the following values of Variant:
Short(11; 100% of non-emptyVariant): rându, -nceput, -nlăuntrul, -npicioarele, -ntinderea, -ntuneric, dracu, mezu-, sufletu, timpu’
Variant seems to be lexical feature of NOUN. 100% lemmas (10) occur only with one value of Variant.
SCONJ
10 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which SCONJ and Variant co-occurred: Polarity=Pos (10; 100%).
SCONJ tokens may have the following values of Variant:
Short(10; 100% of non-emptyVariant): c-, dac-, de-
NUM
8 NUM tokens (0% of all NUM tokens) have a non-empty value of Variant.
The most frequent other feature values with which NUM and Variant co-occurred: Gender=Masc (8; 100%), NumForm=Word (8; 100%), NumType=Ord (8; 100%), Number=Sing (8; 100%).
NUM tokens may have the following values of Variant:
Short(8; 100% of non-emptyVariant): prim-
ADJ
7 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which ADJ and Variant co-occurred: Definite=Ind (7; 100%), Degree=Pos (7; 100%), Case=EMPTY (6; 86%), Gender=EMPTY (5; 71%).
ADJ tokens may have the following values of Variant:
Short(7; 100% of non-emptyVariant): -ntregi, -ndelungate, -nșelată, ex-, pre-, supra-
CCONJ
3 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Variant.
The most frequent other feature values with which CCONJ and Variant co-occurred: Polarity=Pos (3; 100%).
CCONJ tokens may have the following values of Variant:
Short(3; 100% of non-emptyVariant): da’, Ș-
Relations with Agreement in Variant
The 10 most frequent relations where parent and child node agree in Variant:
NOUN –[det]–> PRON (1; 100%),
PRON –[iobj]–> PRON (1; 100%).