Treebank Statistics: UD_Romanian-RRT: Features: Case
This feature is universal.
It occurs with 5 different values: Acc
, Dat
, Gen
, Nom
, Voc
.
Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom
, Dat|Gen
.
93973 tokens (43%) have a non-empty value of Case
.
15031 types (48%) occur at least once with a non-empty value of Case
.
7817 lemmas (45%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (36891; 17% instances), ADP (31054; 14% instances), PRON (11659; 5% instances), DET (7942; 4% instances), ADJ (5612; 3% instances), NUM (495; 0% instances), PROPN (320; 0% instances).
NOUN
36891 NOUN tokens (68% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (29994; 81%), Definite=Def (27196; 74%), Gender=Fem (27019; 73%).
NOUN
tokens may have the following values of Case
:
Acc
(1; 0% of non-emptyCase
): țăriAcc,Nom
(28805; 78% of non-emptyCase
): cazul, conformitate, timpul, statele, Comisia, parte, față, cadrul, partea, fațaDat,Gen
(8036; 22% of non-emptyCase
): comisiei, consiliului, Uniunii, comunității, tratamentului, partidului, statului, țării, produselor, statelorNom
(1; 0% of non-emptyCase
): niVoc
(48; 0% of non-emptyCase
): domnule, Marino, Graham, Porcule, tovarășe, Labrador, bowling, doamne, Adonis, BenjaminEMPTY
(17367): ani, timp, loc, membre, mod, acord, art., b, lucru, a.
Paradigm țară | Acc,Nom | Dat,Gen | Acc |
---|---|---|---|
Definite=Def|Number=Sing | țara | țării, țărei | |
Definite=Def|Number=Plur | țările | țărilor | |
Definite=Ind|Number=Sing | țară | țări | țări |
ADP
31054 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADP
and Case
co-occurred: AdpType=Prep (31054; 100%).
ADP
tokens may have the following values of Case
:
Acc
(30640; 99% of non-emptyCase
): de, în, la, cu, din, pe, pentru, prin, după, caDat
(115; 0% of non-emptyCase
): conform, datorită, potrivit, aidoma, grațieGen
(299; 1% of non-emptyCase
): asupra, împotriva, deasupra, înaintea, dinaintea, contra, împrejurul, înlăuntrul, -mpotriva, înafaraEMPTY
(1): pre
Case
seems to be lexical feature of ADP
. 100% lemmas (57) occur only with one value of Case
.
PRON
11659 PRON tokens (99% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Person=3 (10601; 91%), Variant=EMPTY (9597; 82%), Gender=EMPTY (8599; 74%), Reflex=EMPTY (7723; 66%), PronType=Prs (7100; 61%), Number=EMPTY (6685; 57%), Strength=Weak (6197; 53%).
PRON
tokens may have the following values of Case
:
Acc
(4835; 41% of non-emptyCase
): se, s-, o, -l, îl, le, -se, mă, te, -oAcc,Nom
(4977; 43% of non-emptyCase
): care, ce, el, ea, ceea, aceasta, acestea, unul, una, eiDat
(1438; 12% of non-emptyCase
): își, -și, și-, îi, -i, i, i-, -mi, ne, leDat,Gen
(324; 3% of non-emptyCase
): acestuia, acestora, celor, acesteia, lui, cărora, căruia, căreia, celui, eiNom
(85; 1% of non-emptyCase
): eu, tuEMPTY
(148): dumneavoastră, lui, lor, sale, ș.a., dvs., nostru, dumnealui, săi, tale
Paradigm el | Acc,Nom | Dat,Gen | Acc | Dat |
---|---|---|---|---|
Gender=Masc|Number=Sing|Strength=Strong | el | lui | ||
Gender=Masc|Number=Sing|Strength=Weak | îl | |||
Gender=Masc|Number=Sing|Strength=Weak|Variant=Short | -l, l-, l | |||
Gender=Masc|Number=Plur|Strength=Strong | ei | |||
Gender=Masc|Number=Plur|Strength=Weak | îi, i | |||
Gender=Masc|Number=Plur|Strength=Weak|Variant=Short | -i, i- | -i | ||
Gender=Fem|Number=Sing|Strength=Strong | ea | ei | ||
Gender=Fem|Number=Sing|Strength=Weak | o | |||
Gender=Fem|Number=Sing|Strength=Weak|Variant=Short | -o | |||
Gender=Fem|Number=Plur|Strength=Strong | ele | |||
Gender=Fem|Number=Plur|Strength=Weak | le | |||
Gender=Fem|Number=Plur|Strength=Weak|Variant=Short | le-, -le | |||
Number=Sing|Strength=Weak | îi, i | |||
Number=Sing|Strength=Weak|Variant=Short | i- | -i, i- | ||
Number=Plur|Strength=Strong | lor | |||
Number=Plur|Strength=Weak | le, li | |||
Number=Plur|Strength=Weak|Variant=Short | le-, -le, -li |
DET
7942 DET tokens (66% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Poss=EMPTY (7742; 97%), Number=Sing (6390; 80%), Position=EMPTY (6056; 76%), PronType=Ind (5659; 71%), Person=EMPTY (5258; 66%).
DET
tokens may have the following values of Case
:
Acc,Nom
(6380; 80% of non-emptyCase
): o, un, acest, cel, orice, toate, această, aceste, cele, alteDat,Gen
(1562; 20% of non-emptyCase
): lui, unei, unui, unor, acestor, acestei, acestui, tuturor, celor, altorEMPTY
(4083): a, al, ale, lui, lor, ei, multe, său, ai, anumite
Paradigm un | Acc,Nom | Dat,Gen |
---|---|---|
Gender=Masc|Number=Sing | un, -un | unui |
Gender=Masc|Number=Plur|Person=3|Position=Prenom | unii | |
Gender=Fem|Number=Sing | o, -o | unei |
Gender=Fem|Number=Plur|Person=3|Position=Prenom | unele | |
Number=Plur | unor |
ADJ
5612 ADJ tokens (37% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=Pos (5596; 100%), Number=Sing (5413; 96%), Gender=Fem (5182; 92%), Definite=Ind (4718; 84%).
ADJ
tokens may have the following values of Case
:
Acc,Nom
(4488; 80% of non-emptyCase
): prezentul, prezenta, europeană, mică, română, maximă, necesară, românească, bună, nouăDat,Gen
(1124; 20% of non-emptyCase
): europene, prezentului, prezentei, naționale, publice, române, românești, umane, comunitare, politiceEMPTY
(9686): mare, asemenea, nou, necesare, mari, european, general, mici, vechi, chimice
Paradigm mare | Acc,Nom | Dat,Gen |
---|---|---|
Definite=Def|Gender=Masc|Number=Sing | marele | marelui |
Definite=Def|Gender=Masc|Number=Plur | marii | |
Definite=Def|Gender=Fem|Number=Sing | marea | Marii |
Definite=Def|Gender=Fem|Number=Plur | marile | |
Definite=Def|Number=Plur | marilor | |
Definite=Ind|Gender=Fem|Number=Sing | mari |
NUM
495 NUM tokens (9% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (447; 90%), NumType=Ord (316; 64%), Gender=Fem (298; 60%), Number=Sing (297; 60%).
NUM
tokens may have the following values of Case
:
Acc,Nom
(450; 91% of non-emptyCase
): primul, prima, primele, milioane, o, ambele, ultimii, un, ultimul, unuDat,Gen
(45; 9% of non-emptyCase
): primului, primei, primelor, ambelor, ultimelor, ultimei, ultimilor, primilor, prime, suteEMPTY
(5054): 1, 2, 3, două, 4, trei, 5, 6, doi, 7
Paradigm prim | Acc,Nom | Dat,Gen |
---|---|---|
Definite=Def|Gender=Masc|Number=Sing | primul | primului |
Definite=Def|Gender=Masc|Number=Plur | primii | primilor |
Definite=Def|Gender=Fem|Number=Sing | prima | primei |
Definite=Def|Gender=Fem|Number=Plur | primele | primelor |
Definite=Ind|Gender=Fem|Number=Sing | primă | prime |
PROPN
320 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Case
.
PROPN
tokens may have the following values of Case
:
Acc,Nom
(38; 12% of non-emptyCase
): Banatul, Iașii, Israelul, Carpații, Contemporanul, Dunărea, Ierusalimul, Irakul, Brașovul, BrâncovanulDat,Gen
(282; 88% of non-emptyCase
): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiEMPTY
(5565): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
Paradigm București | Acc,Nom | Dat,Gen |
---|---|---|
Bucureștiul | Bucureștiului |
Case
seems to be lexical feature of PROPN
. 93% lemmas (95) occur only with one value of Case
.
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[conj]–> NOUN (1950; 76%),
ADP –[fixed]–> ADP (1284; 99%),
ADJ –[conj]–> ADJ (200; 89%),
NOUN –[nsubj]–> NOUN (161; 51%),
PRON –[fixed]–> PRON (67; 100%),
NOUN –[nsubj]–> PRON (56; 56%),
NOUN –[flat]–> ADJ (52; 76%),
NOUN –[fixed]–> ADJ (50; 78%),
NOUN –[fixed]–> NOUN (47; 52%),
NOUN –[conj]–> PRON (44; 65%).