home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: Case

This feature is universal. It occurs with 5 different values: Acc, Dat, Gen, Nom, Voc. Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom, Dat|Gen.

94508 tokens (43%) have a non-empty value of Case. 15028 types (48%) occur at least once with a non-empty value of Case. 7803 lemmas (45%) occur at least once with a non-empty value of Case. The feature is used with 7 part-of-speech tags: NOUN (36892; 17% instances), ADP (31052; 14% instances), PRON (12180; 6% instances), DET (7965; 4% instances), ADJ (5604; 3% instances), NUM (495; 0% instances), PROPN (320; 0% instances).

NOUN

36892 NOUN tokens (68% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (29938; 81%), Definite=Def (27201; 74%), Gender=Fem (27017; 73%).

NOUN tokens may have the following values of Case:

Paradigm țarăAcc,NomDat,GenAcc
Definite=Def|Number=Singțarațării, țărei
Definite=Def|Number=Plurțărilețărilor
Definite=Ind|Number=Singțarățărițări

ADP

31052 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (31052; 100%), ExtPos=EMPTY (27747; 89%).

ADP tokens may have the following values of Case:

Paradigm îndărătulAccGen
îndărătul
Variant=Short-ndărătul

Case seems to be lexical feature of ADP. 98% lemmas (53) occur only with one value of Case.

PRON

12180 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: Person=3 (11122; 91%), Variant=EMPTY (10067; 83%), Gender=EMPTY (8781; 72%), Reflex=EMPTY (8244; 68%), PronType=Prs (7618; 63%), Number=EMPTY (6687; 55%), Strength=Weak (6202; 51%).

PRON tokens may have the following values of Case:

Paradigm elAcc,NomDat,GenAccDat
Gender=Masc|Number=Sing|Strength=Strongellui
Gender=Masc|Number=Sing|Strength=Weakîl
Gender=Masc|Number=Sing|Strength=Weak|Variant=Short-l, l-
Gender=Masc|Number=Plur|Strength=Strongei
Gender=Masc|Number=Plur|Strength=Weakîi
Gender=Masc|Number=Plur|Strength=Weak|Variant=Short-i, i--i
Gender=Fem|Number=Sing|Strength=Strongeaei
Gender=Fem|Number=Sing|Strength=Weako
Gender=Fem|Number=Sing|Strength=Weak|Variant=Short-o
Gender=Fem|Number=Plur|Strength=Strongele
Gender=Fem|Number=Plur|Strength=Weakle
Gender=Fem|Number=Plur|Strength=Weak|Variant=Shortle-, -le
Number=Sing|Strength=Weakîi, i
Number=Sing|Strength=Weak|Variant=Shorti--i, i-
Number=Plur|Strength=Stronglor
Number=Plur|Strength=Weakle, li
Number=Plur|Strength=Weak|Variant=Shortle-, -le, -li

DET

7965 DET tokens (69% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (7766; 98%), Number=Sing (6413; 81%), Position=EMPTY (6079; 76%), PronType=Ind (5663; 71%), Person=EMPTY (5282; 66%).

DET tokens may have the following values of Case:

Paradigm unAcc,NomDat,Gen
ExtPos=ADV|Gender=Masc|Number=Singun
ExtPos=ADV|Gender=Fem|Number=Singo
Gender=Masc|Number=Singununui
Gender=Masc|Number=Sing|Variant=Short-un
Gender=Masc|Number=Plur|Person=3|Position=Prenomunii
Gender=Fem|Number=Singounei
Gender=Fem|Number=Sing|Variant=Short-o
Gender=Fem|Number=Plur|Person=3|Position=Prenomunele
Number=Plurunor

ADJ

5604 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (5588; 100%), Number=Sing (5406; 96%), Gender=Fem (5175; 92%), Definite=Ind (4711; 84%).

ADJ tokens may have the following values of Case:

Paradigm mareAcc,NomDat,Gen
Definite=Def|Gender=Masc|Number=Singmarelemarelui
Definite=Def|Gender=Masc|Number=Plurmarii
Definite=Def|Gender=Fem|Number=SingmareaMarii
Definite=Def|Gender=Fem|Number=Plurmarile
Definite=Def|Number=Plurmarilor
Definite=Ind|Gender=Fem|Number=Singmari

NUM

495 NUM tokens (9% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (447; 90%), NumType=Ord (316; 64%), Gender=Fem (298; 60%), Number=Sing (297; 60%).

NUM tokens may have the following values of Case:

Paradigm primAcc,NomDat,Gen
Definite=Def|Gender=Masc|Number=Singprimulprimului
Definite=Def|Gender=Masc|Number=Plurprimiiprimilor
Definite=Def|Gender=Fem|Number=Singprimaprimei
Definite=Def|Gender=Fem|Number=Plurprimeleprimelor
Definite=Ind|Gender=Fem|Number=Singprimăprime

PROPN

320 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Case.

PROPN tokens may have the following values of Case:

Paradigm BucureștiAcc,NomDat,Gen
BucureștiulBucureștiului

Case seems to be lexical feature of PROPN. 93% lemmas (95) occur only with one value of Case.

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[conj]–> NOUN (1952; 76%), ADP –[fixed]–> ADP (1266; 99%), ADJ –[conj]–> ADJ (201; 89%), NOUN –[nsubj]–> NOUN (169; 52%), PRON –[fixed]–> PRON (66; 100%), NOUN –[nsubj]–> PRON (59; 58%), NOUN –[flat]–> ADJ (52; 76%), NOUN –[conj]–> PRON (43; 62%), PRON –[nsubj]–> NOUN (37; 80%), PRON –[nmod]–> PRON (30; 94%).