Treebank Statistics: UD_Romanian-SiMoNERo: Features: Case
This feature is universal.
It occurs with 5 different values: Acc, Dat, Gen, Nom, Voc.
Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom, Dat|Gen.
63929 tokens (44%) have a non-empty value of Case.
9392 types (52%) occur at least once with a non-empty value of Case.
5092 lemmas (48%) occur at least once with a non-empty value of Case.
The feature is used with 7 part-of-speech tags: NOUN (28429; 19% instances), ADP (20075; 14% instances), ADJ (6828; 5% instances), PRON (4199; 3% instances), DET (4148; 3% instances), NUM (237; 0% instances), PROPN (13; 0% instances).
NOUN
28429 NOUN tokens (67% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (23175; 82%), Definite=Def (21800; 77%), Gender=Fem (20957; 74%).
NOUN tokens may have the following values of Case:
Gen(7796; 27% of non-emptyCase): pacienților, diabetului, insulinei, tratamentului, bolii, celulelor, glucozei, riscului, pacientului, funcțieiNom(20629; 73% of non-emptyCase): pacienții, nivelul, cazul, insulină, creșterea, tratamentul, vârsta, scăderea, creștere, risculVoc(4; 0% of non-emptyCase): postoperator, prolactina, retinopatia, trimetorpinEMPTY(14269): pacienți, ani, diabet, risc, tip, tratament, timp, studiu, cazuri, mg
| Paradigm retinopatie | Nom | Gen | Voc |
|---|---|---|---|
| Definite=Def | retinopatiei | retinopatia | |
| Definite=Ind | retinopatie |
ADP
20075 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (20075; 100%).
ADP tokens may have the following values of Case:
Acc(19774; 99% of non-emptyCase): de, în, la, cu, din, pentru, prin, pe, dintre, dupăDat(120; 1% of non-emptyCase): datorită, conform, potrivit, coform, grațieGen(181; 1% of non-emptyCase): asupra, înaintea, împotriva, deasupraEMPTY(3): vs.
Case seems to be lexical feature of ADP. 100% lemmas (40) occur only with one value of Case.
ADJ
6828 ADJ tokens (40% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (6806; 100%), Gender=Fem (6762; 99%), Number=Sing (6726; 99%), Definite=Ind (6600; 97%).
ADJ tokens may have the following values of Case:
Gen(1404; 21% of non-emptyCase): cardiace, ventriculare, arteriale, renale, aortice, cronice, diabetice, coronariene, orale, &b.beta;-celulareNom(5424; 79% of non-emptyCase): mare, cardiacă, renală, cronică, severă, chirurgicală, crescută, aortică, mică, necesarăEMPTY(10224): vârstnici, crescut, zaharat, mici, clinice, mare, mari, important, clinic, adverse
| Paradigm mare | Nom | Gen |
|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | Marele | |
| Definite=Def|Gender=Fem|Number=Sing | marea | marii |
| Definite=Def|Gender=Fem|Number=Plur | marile | |
| Definite=Ind|Gender=Fem|Number=Sing | mare | mari |
PRON
4199 PRON tokens (100% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Person=3 (4156; 99%), Gender=EMPTY (3135; 75%), Number=EMPTY (2919; 70%), Reflex=EMPTY (2884; 69%), Strength=EMPTY (2581; 61%).
PRON tokens may have the following values of Case:
Acc(1399; 33% of non-emptyCase): se, s-, o, îl, le, ne, sine, l-, vă, lDat(103; 2% of non-emptyCase): își, li, și-, i, le, îi, se, i-, le-, neGen(194; 5% of non-emptyCase): acestora, celor, acestuia, acesteia, cărora, celei, căreia, căruia, celui, luiNom(2503; 60% of non-emptyCase): care, ce, ceea, acestea, cei, cea, cele, aceasta, cel, aceeaEMPTY(4): dumneavoastră, lor, lui, sale
| Paradigm el | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Gender=Masc|Number=Sing|Strength=Strong | el | lui | ||
| Gender=Masc|Number=Sing|Strength=Weak | îl | |||
| Gender=Masc|Number=Sing|Strength=Weak|Variant=Short | l-, l | |||
| Gender=Masc|Number=Plur|Strength=Strong | ei | |||
| Gender=Masc|Number=Plur|Strength=Weak | îi | |||
| Gender=Masc|Number=Plur|Strength=Weak|Variant=Short | i, i-, l | |||
| Gender=Fem|Number=Sing|Strength=Strong | ea | ei | ||
| Gender=Fem|Number=Sing|Strength=Weak | o | |||
| Gender=Fem|Number=Sing|Strength=Weak|Variant=Short | o | |||
| Gender=Fem|Number=Plur|Strength=Strong | ele | |||
| Gender=Fem|Number=Plur|Strength=Weak | le | |||
| Gender=Fem|Number=Plur|Strength=Weak|Variant=Short | le-, le | |||
| Number=Sing|Strength=Weak | i, îi | |||
| Number=Sing|Strength=Weak|Variant=Short | i-, l | |||
| Number=Plur|Strength=Strong | lor | |||
| Number=Plur|Strength=Weak | li, le | |||
| Number=Plur|Strength=Weak|Variant=Short | le, le-, i |
DET
4148 DET tokens (56% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (4105; 99%), Number=Sing (3182; 77%), Position=EMPTY (2969; 72%), PronType=Ind (2840; 68%), Person=EMPTY (2765; 67%).
DET tokens may have the following values of Case:
Dat,Gen(6; 0% of non-emptyCase): căruiGen(834; 20% of non-emptyCase): unui, unei, unor, acestor, acestei, acestui, celor, altor, lui, cărorNom(3308; 80% of non-emptyCase): o, un, acest, această, cel, cele, aceste, alte, cea, toateEMPTY(3276): a, al, ale, lor, multe, ai, săi, alt, ei, său
| Paradigm care | Dat,Gen | Nom | Gen |
|---|---|---|---|
| Gender=Masc|Number=Sing | cărui | ||
| Gender=Fem|Number=Sing | cărei | ||
| Number=Plur | căror | ||
| care |
NUM
237 NUM tokens (5% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (202; 85%), NumType=Ord (194; 82%), Number=Sing (143; 60%).
NUM tokens may have the following values of Case:
Acc,Nom(1; 0% of non-emptyCase): unulGen(22; 9% of non-emptyCase): primei, ambelor, ultimilor, ultimului, primului, ambilor, primilor, ultimeiNom(214; 90% of non-emptyCase): primul, prima, ambele, primele, primă, ultimii, ultimul, ultima, ultimele, primiiEMPTY(4368): 2, 1, două, 3, 4, 5, 30, 10, 20, 6
Case seems to be lexical feature of NUM. 100% lemmas (27) occur only with one value of Case.
PROPN
13 PROPN tokens (2% of all PROPN tokens) have a non-empty value of Case.
PROPN tokens may have the following values of Case:
Gen(12; 92% of non-emptyCase): Americii, Asiei, Europei, Franței, Greciei, RomânieiNom(1; 8% of non-emptyCase): AmericăEMPTY(704): Graves-Basedow, Doppler, Rubino, Europa, România, Langerhans, Paulescu, Pendred, Britanie, Esnaola
| Paradigm America | Nom | Gen |
|---|---|---|
| Definite=Def | Americii | |
| Definite=Ind | Americă |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[amod]–> ADJ (5529; 54%),
NOUN –[conj]–> NOUN (2491; 69%),
ADP –[fixed]–> ADP (533; 100%),
ADJ –[conj]–> ADJ (223; 89%),
PRON –[fixed]–> PRON (86; 100%),
NOUN –[conj]–> PRON (49; 77%),
NOUN –[case]–> NOUN (39; 65%),
PRON –[nsubj]–> NOUN (37; 84%),
ADJ –[nsubj:pass]–> NOUN (21; 55%),
NOUN –[nsubj]–> PRON (21; 54%).