Treebank Statistics: UD_Romanian-RRT: Features: Case
This feature is universal.
It occurs with 5 different values: Acc, Dat, Gen, Nom, Voc.
Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom, Dat|Gen.
94508 tokens (43%) have a non-empty value of Case.
15028 types (48%) occur at least once with a non-empty value of Case.
7803 lemmas (45%) occur at least once with a non-empty value of Case.
The feature is used with 7 part-of-speech tags: NOUN (36892; 17% instances), ADP (31052; 14% instances), PRON (12180; 6% instances), DET (7965; 4% instances), ADJ (5604; 3% instances), NUM (495; 0% instances), PROPN (320; 0% instances).
NOUN
36892 NOUN tokens (68% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (29938; 81%), Definite=Def (27201; 74%), Gender=Fem (27017; 73%).
NOUN tokens may have the following values of Case:
Acc(1; 0% of non-emptyCase): țăriAcc,Nom(28809; 78% of non-emptyCase): cazul, conformitate, timpul, statele, Comisia, parte, față, cadrul, partea, fațaDat,Gen(8034; 22% of non-emptyCase): comisiei, consiliului, Uniunii, comunității, tratamentului, partidului, statului, țării, produselor, statelorVoc(48; 0% of non-emptyCase): domnule, Marino, Graham, Porcule, tovarășe, Labrador, bowling, doamne, Adonis, BenjaminEMPTY(17365): ani, timp, loc, membre, mod, acord, art., b, lucru, a.
| Paradigm țară | Acc,Nom | Dat,Gen | Acc |
|---|---|---|---|
| Definite=Def|Number=Sing | țara | țării, țărei | |
| Definite=Def|Number=Plur | țările | țărilor | |
| Definite=Ind|Number=Sing | țară | țări | țări |
ADP
31052 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (31052; 100%), ExtPos=EMPTY (27747; 89%).
ADP tokens may have the following values of Case:
Acc(30637; 99% of non-emptyCase): de, în, la, cu, din, pe, pentru, prin, după, caDat(116; 0% of non-emptyCase): conform, datorită, potrivit, aidoma, grațieGen(299; 1% of non-emptyCase): asupra, împotriva, deasupra, înaintea, dinaintea, contra, împrejurul, înlăuntrul, -mpotriva, înafaraEMPTY(1): pre
| Paradigm îndărătul | Acc | Gen |
|---|---|---|
| îndărătul | ||
| Variant=Short | -ndărătul |
Case seems to be lexical feature of ADP. 98% lemmas (53) occur only with one value of Case.
PRON
12180 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Person=3 (11122; 91%), Variant=EMPTY (10067; 83%), Gender=EMPTY (8781; 72%), Reflex=EMPTY (8244; 68%), PronType=Prs (7618; 63%), Number=EMPTY (6687; 55%), Strength=Weak (6202; 51%).
PRON tokens may have the following values of Case:
Acc(4836; 40% of non-emptyCase): se, s-, -l, o, îl, le, -se, mă, te, -oAcc,Nom(4978; 41% of non-emptyCase): care, ce, el, ea, ceea, aceasta, acestea, unul, una, eiDat(1442; 12% of non-emptyCase): își, -și, și-, îi, -i, i, i-, -mi, mi-, leDat,Gen(839; 7% of non-emptyCase): lui, lor, ei, acestuia, acestora, celor, acesteia, cărora, căruia, căreiaNom(85; 1% of non-emptyCase): eu, tuEMPTY(133): dumneavoastră, ș.a., sale, dvs., nostru, dumnealui, săi, tale, dumneaei, dumnealor
| Paradigm el | Acc,Nom | Dat,Gen | Acc | Dat |
|---|---|---|---|---|
| Gender=Masc|Number=Sing|Strength=Strong | el | lui | ||
| Gender=Masc|Number=Sing|Strength=Weak | îl | |||
| Gender=Masc|Number=Sing|Strength=Weak|Variant=Short | -l, l- | |||
| Gender=Masc|Number=Plur|Strength=Strong | ei | |||
| Gender=Masc|Number=Plur|Strength=Weak | îi | |||
| Gender=Masc|Number=Plur|Strength=Weak|Variant=Short | -i, i- | -i | ||
| Gender=Fem|Number=Sing|Strength=Strong | ea | ei | ||
| Gender=Fem|Number=Sing|Strength=Weak | o | |||
| Gender=Fem|Number=Sing|Strength=Weak|Variant=Short | -o | |||
| Gender=Fem|Number=Plur|Strength=Strong | ele | |||
| Gender=Fem|Number=Plur|Strength=Weak | le | |||
| Gender=Fem|Number=Plur|Strength=Weak|Variant=Short | le-, -le | |||
| Number=Sing|Strength=Weak | îi, i | |||
| Number=Sing|Strength=Weak|Variant=Short | i- | -i, i- | ||
| Number=Plur|Strength=Strong | lor | |||
| Number=Plur|Strength=Weak | le, li | |||
| Number=Plur|Strength=Weak|Variant=Short | le-, -le, -li |
DET
7965 DET tokens (69% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (7766; 98%), Number=Sing (6413; 81%), Position=EMPTY (6079; 76%), PronType=Ind (5663; 71%), Person=EMPTY (5282; 66%).
DET tokens may have the following values of Case:
Acc,Nom(6381; 80% of non-emptyCase): o, un, acest, cel, orice, toate, această, aceste, cele, alteDat,Gen(1584; 20% of non-emptyCase): lui, unei, unui, unor, acestor, acestei, acestui, tuturor, celor, altorEMPTY(3559): a, al, ale, multe, său, ai, anumite, sale, niște, -lea
| Paradigm un | Acc,Nom | Dat,Gen |
|---|---|---|
| ExtPos=ADV|Gender=Masc|Number=Sing | un | |
| ExtPos=ADV|Gender=Fem|Number=Sing | o | |
| Gender=Masc|Number=Sing | un | unui |
| Gender=Masc|Number=Sing|Variant=Short | -un | |
| Gender=Masc|Number=Plur|Person=3|Position=Prenom | unii | |
| Gender=Fem|Number=Sing | o | unei |
| Gender=Fem|Number=Sing|Variant=Short | -o | |
| Gender=Fem|Number=Plur|Person=3|Position=Prenom | unele | |
| Number=Plur | unor |
ADJ
5604 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (5588; 100%), Number=Sing (5406; 96%), Gender=Fem (5175; 92%), Definite=Ind (4711; 84%).
ADJ tokens may have the following values of Case:
Acc,Nom(4481; 80% of non-emptyCase): prezentul, prezenta, europeană, mică, română, maximă, necesară, românească, nouă, bunăDat,Gen(1123; 20% of non-emptyCase): europene, prezentului, prezentei, naționale, publice, române, românești, umane, comunitare, politiceEMPTY(9682): mare, asemenea, nou, necesare, mari, european, general, mici, vechi, chimice
| Paradigm mare | Acc,Nom | Dat,Gen |
|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | marele | marelui |
| Definite=Def|Gender=Masc|Number=Plur | marii | |
| Definite=Def|Gender=Fem|Number=Sing | marea | Marii |
| Definite=Def|Gender=Fem|Number=Plur | marile | |
| Definite=Def|Number=Plur | marilor | |
| Definite=Ind|Gender=Fem|Number=Sing | mari |
NUM
495 NUM tokens (9% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (447; 90%), NumType=Ord (316; 64%), Gender=Fem (298; 60%), Number=Sing (297; 60%).
NUM tokens may have the following values of Case:
Acc,Nom(450; 91% of non-emptyCase): primul, prima, primele, milioane, o, ambele, ultimii, un, ultimul, unuDat,Gen(45; 9% of non-emptyCase): primului, primei, primelor, ambelor, ultimelor, ultimei, ultimilor, primilor, prime, suteEMPTY(5057): 1, 2, 3, două, 4, trei, 5, 6, doi, 7
| Paradigm prim | Acc,Nom | Dat,Gen |
|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | primul | primului |
| Definite=Def|Gender=Masc|Number=Plur | primii | primilor |
| Definite=Def|Gender=Fem|Number=Sing | prima | primei |
| Definite=Def|Gender=Fem|Number=Plur | primele | primelor |
| Definite=Ind|Gender=Fem|Number=Sing | primă | prime |
PROPN
320 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Case.
PROPN tokens may have the following values of Case:
Acc,Nom(38; 12% of non-emptyCase): Banatul, Iașii, Israelul, Carpații, Contemporanul, Dunărea, Ierusalimul, Irakul, Brașovul, BrâncovanulDat,Gen(282; 88% of non-emptyCase): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiEMPTY(5563): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
| Paradigm București | Acc,Nom | Dat,Gen |
|---|---|---|
| Bucureștiul | Bucureștiului |
Case seems to be lexical feature of PROPN. 93% lemmas (95) occur only with one value of Case.
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[conj]–> NOUN (1952; 76%),
ADP –[fixed]–> ADP (1266; 99%),
ADJ –[conj]–> ADJ (201; 89%),
NOUN –[nsubj]–> NOUN (169; 52%),
PRON –[fixed]–> PRON (66; 100%),
NOUN –[nsubj]–> PRON (59; 58%),
NOUN –[flat]–> ADJ (52; 76%),
NOUN –[conj]–> PRON (43; 62%),
PRON –[nsubj]–> NOUN (37; 80%),
PRON –[nmod]–> PRON (30; 94%).