home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: Case

This feature is universal. It occurs with 5 different values: Acc, Dat, Gen, Nom, Voc. Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom, Dat|Gen.

93973 tokens (43%) have a non-empty value of Case. 15031 types (48%) occur at least once with a non-empty value of Case. 7817 lemmas (45%) occur at least once with a non-empty value of Case. The feature is used with 7 part-of-speech tags: NOUN (36891; 17% instances), ADP (31054; 14% instances), PRON (11659; 5% instances), DET (7942; 4% instances), ADJ (5612; 3% instances), NUM (495; 0% instances), PROPN (320; 0% instances).

NOUN

36891 NOUN tokens (68% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (29994; 81%), Definite=Def (27196; 74%), Gender=Fem (27019; 73%).

NOUN tokens may have the following values of Case:

Paradigm țarăAcc,NomDat,GenAcc
Definite=Def|Number=Singțarațării, țărei
Definite=Def|Number=Plurțărilețărilor
Definite=Ind|Number=Singțarățărițări

ADP

31054 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (31054; 100%).

ADP tokens may have the following values of Case:

Case seems to be lexical feature of ADP. 100% lemmas (57) occur only with one value of Case.

PRON

11659 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: Person=3 (10601; 91%), Variant=EMPTY (9597; 82%), Gender=EMPTY (8599; 74%), Reflex=EMPTY (7723; 66%), PronType=Prs (7100; 61%), Number=EMPTY (6685; 57%), Strength=Weak (6197; 53%).

PRON tokens may have the following values of Case:

Paradigm elAcc,NomDat,GenAccDat
Gender=Masc|Number=Sing|Strength=Strongellui
Gender=Masc|Number=Sing|Strength=Weakîl
Gender=Masc|Number=Sing|Strength=Weak|Variant=Short-l, l-, l
Gender=Masc|Number=Plur|Strength=Strongei
Gender=Masc|Number=Plur|Strength=Weakîi, i
Gender=Masc|Number=Plur|Strength=Weak|Variant=Short-i, i--i
Gender=Fem|Number=Sing|Strength=Strongeaei
Gender=Fem|Number=Sing|Strength=Weako
Gender=Fem|Number=Sing|Strength=Weak|Variant=Short-o
Gender=Fem|Number=Plur|Strength=Strongele
Gender=Fem|Number=Plur|Strength=Weakle
Gender=Fem|Number=Plur|Strength=Weak|Variant=Shortle-, -le
Number=Sing|Strength=Weakîi, i
Number=Sing|Strength=Weak|Variant=Shorti--i, i-
Number=Plur|Strength=Stronglor
Number=Plur|Strength=Weakle, li
Number=Plur|Strength=Weak|Variant=Shortle-, -le, -li

DET

7942 DET tokens (66% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (7742; 97%), Number=Sing (6390; 80%), Position=EMPTY (6056; 76%), PronType=Ind (5659; 71%), Person=EMPTY (5258; 66%).

DET tokens may have the following values of Case:

Paradigm unAcc,NomDat,Gen
Gender=Masc|Number=Singun, -ununui
Gender=Masc|Number=Plur|Person=3|Position=Prenomunii
Gender=Fem|Number=Singo, -ounei
Gender=Fem|Number=Plur|Person=3|Position=Prenomunele
Number=Plurunor

ADJ

5612 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (5596; 100%), Number=Sing (5413; 96%), Gender=Fem (5182; 92%), Definite=Ind (4718; 84%).

ADJ tokens may have the following values of Case:

Paradigm mareAcc,NomDat,Gen
Definite=Def|Gender=Masc|Number=Singmarelemarelui
Definite=Def|Gender=Masc|Number=Plurmarii
Definite=Def|Gender=Fem|Number=SingmareaMarii
Definite=Def|Gender=Fem|Number=Plurmarile
Definite=Def|Number=Plurmarilor
Definite=Ind|Gender=Fem|Number=Singmari

NUM

495 NUM tokens (9% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (447; 90%), NumType=Ord (316; 64%), Gender=Fem (298; 60%), Number=Sing (297; 60%).

NUM tokens may have the following values of Case:

Paradigm primAcc,NomDat,Gen
Definite=Def|Gender=Masc|Number=Singprimulprimului
Definite=Def|Gender=Masc|Number=Plurprimiiprimilor
Definite=Def|Gender=Fem|Number=Singprimaprimei
Definite=Def|Gender=Fem|Number=Plurprimeleprimelor
Definite=Ind|Gender=Fem|Number=Singprimăprime

PROPN

320 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Case.

PROPN tokens may have the following values of Case:

Paradigm BucureștiAcc,NomDat,Gen
BucureștiulBucureștiului

Case seems to be lexical feature of PROPN. 93% lemmas (95) occur only with one value of Case.

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[conj]–> NOUN (1950; 76%), ADP –[fixed]–> ADP (1284; 99%), ADJ –[conj]–> ADJ (200; 89%), NOUN –[nsubj]–> NOUN (161; 51%), PRON –[fixed]–> PRON (67; 100%), NOUN –[nsubj]–> PRON (56; 56%), NOUN –[flat]–> ADJ (52; 76%), NOUN –[fixed]–> ADJ (50; 78%), NOUN –[fixed]–> NOUN (47; 52%), NOUN –[conj]–> PRON (44; 65%).