Treebank Statistics: UD_Romanian-SiMoNERo: Features: Case
This feature is universal.
It occurs with 5 different values: Acc
, Dat
, Gen
, Nom
, Voc
.
Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom
, Dat|Gen
.
63929 tokens (44%) have a non-empty value of Case
.
9392 types (52%) occur at least once with a non-empty value of Case
.
5092 lemmas (48%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (28429; 19% instances), ADP (20075; 14% instances), ADJ (6828; 5% instances), PRON (4199; 3% instances), DET (4148; 3% instances), NUM (237; 0% instances), PROPN (13; 0% instances).
NOUN
28429 NOUN tokens (67% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (23175; 82%), Definite=Def (21800; 77%), Gender=Fem (20957; 74%).
NOUN
tokens may have the following values of Case
:
Gen
(7796; 27% of non-emptyCase
): pacienților, diabetului, insulinei, tratamentului, bolii, celulelor, glucozei, riscului, pacientului, funcțieiNom
(20629; 73% of non-emptyCase
): pacienții, nivelul, cazul, insulină, creșterea, tratamentul, vârsta, scăderea, creștere, risculVoc
(4; 0% of non-emptyCase
): postoperator, prolactina, retinopatia, trimetorpinEMPTY
(14269): pacienți, ani, diabet, risc, tip, tratament, timp, studiu, cazuri, mg
Paradigm retinopatie | Nom | Gen | Voc |
---|---|---|---|
Definite=Def | retinopatiei | retinopatia | |
Definite=Ind | retinopatie |
ADP
20075 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADP
and Case
co-occurred: AdpType=Prep (20075; 100%).
ADP
tokens may have the following values of Case
:
Acc
(19774; 99% of non-emptyCase
): de, în, la, cu, din, pentru, prin, pe, dintre, dupăDat
(120; 1% of non-emptyCase
): datorită, conform, potrivit, coform, grațieGen
(181; 1% of non-emptyCase
): asupra, înaintea, împotriva, deasupraEMPTY
(3): vs.
Case
seems to be lexical feature of ADP
. 100% lemmas (40) occur only with one value of Case
.
ADJ
6828 ADJ tokens (40% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=Pos (6806; 100%), Gender=Fem (6762; 99%), Number=Sing (6726; 99%), Definite=Ind (6600; 97%).
ADJ
tokens may have the following values of Case
:
Gen
(1404; 21% of non-emptyCase
): cardiace, ventriculare, arteriale, renale, aortice, cronice, diabetice, coronariene, orale, &b.beta;-celulareNom
(5424; 79% of non-emptyCase
): mare, cardiacă, renală, cronică, severă, chirurgicală, crescută, aortică, mică, necesarăEMPTY
(10224): vârstnici, crescut, zaharat, mici, clinice, mare, mari, important, clinic, adverse
Paradigm mare | Nom | Gen |
---|---|---|
Definite=Def|Gender=Masc|Number=Sing | Marele | |
Definite=Def|Gender=Fem|Number=Sing | marea | marii |
Definite=Def|Gender=Fem|Number=Plur | marile | |
Definite=Ind|Gender=Fem|Number=Sing | mare | mari |
PRON
4199 PRON tokens (100% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Person=3 (4156; 99%), Gender=EMPTY (3135; 75%), Number=EMPTY (2919; 70%), Reflex=EMPTY (2884; 69%), Strength=EMPTY (2581; 61%).
PRON
tokens may have the following values of Case
:
Acc
(1399; 33% of non-emptyCase
): se, s-, o, îl, le, ne, sine, l-, vă, lDat
(103; 2% of non-emptyCase
): își, li, și-, i, le, îi, se, i-, le-, neGen
(194; 5% of non-emptyCase
): acestora, celor, acestuia, acesteia, cărora, celei, căreia, căruia, celui, luiNom
(2503; 60% of non-emptyCase
): care, ce, ceea, acestea, cei, cea, cele, aceasta, cel, aceeaEMPTY
(4): dumneavoastră, lor, lui, sale
Paradigm el | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Gender=Masc|Number=Sing|Strength=Strong | el | lui | ||
Gender=Masc|Number=Sing|Strength=Weak | îl | |||
Gender=Masc|Number=Sing|Strength=Weak|Variant=Short | l-, l | |||
Gender=Masc|Number=Plur|Strength=Strong | ei | |||
Gender=Masc|Number=Plur|Strength=Weak | îi | |||
Gender=Masc|Number=Plur|Strength=Weak|Variant=Short | i, i-, l | |||
Gender=Fem|Number=Sing|Strength=Strong | ea | ei | ||
Gender=Fem|Number=Sing|Strength=Weak | o | |||
Gender=Fem|Number=Sing|Strength=Weak|Variant=Short | o | |||
Gender=Fem|Number=Plur|Strength=Strong | ele | |||
Gender=Fem|Number=Plur|Strength=Weak | le | |||
Gender=Fem|Number=Plur|Strength=Weak|Variant=Short | le-, le | |||
Number=Sing|Strength=Weak | i, îi | |||
Number=Sing|Strength=Weak|Variant=Short | i-, l | |||
Number=Plur|Strength=Strong | lor | |||
Number=Plur|Strength=Weak | li, le | |||
Number=Plur|Strength=Weak|Variant=Short | le, le-, i |
DET
4148 DET tokens (56% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Poss=EMPTY (4105; 99%), Number=Sing (3182; 77%), Position=EMPTY (2969; 72%), PronType=Ind (2840; 68%), Person=EMPTY (2765; 67%).
DET
tokens may have the following values of Case
:
Dat,Gen
(6; 0% of non-emptyCase
): căruiGen
(834; 20% of non-emptyCase
): unui, unei, unor, acestor, acestei, acestui, celor, altor, lui, cărorNom
(3308; 80% of non-emptyCase
): o, un, acest, această, cel, cele, aceste, alte, cea, toateEMPTY
(3276): a, al, ale, lor, multe, ai, săi, alt, ei, său
Paradigm care | Dat,Gen | Nom | Gen |
---|---|---|---|
Gender=Masc|Number=Sing | cărui | ||
Gender=Fem|Number=Sing | cărei | ||
Number=Plur | căror | ||
care |
NUM
237 NUM tokens (5% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (202; 85%), NumType=Ord (194; 82%), Number=Sing (143; 60%).
NUM
tokens may have the following values of Case
:
Acc,Nom
(1; 0% of non-emptyCase
): unulGen
(22; 9% of non-emptyCase
): primei, ambelor, ultimilor, ultimului, primului, ambilor, primilor, ultimeiNom
(214; 90% of non-emptyCase
): primul, prima, ambele, primele, primă, ultimii, ultimul, ultima, ultimele, primiiEMPTY
(4368): 2, 1, două, 3, 4, 5, 30, 10, 20, 6
Case
seems to be lexical feature of NUM
. 100% lemmas (27) occur only with one value of Case
.
PROPN
13 PROPN tokens (2% of all PROPN
tokens) have a non-empty value of Case
.
PROPN
tokens may have the following values of Case
:
Gen
(12; 92% of non-emptyCase
): Americii, Asiei, Europei, Franței, Greciei, RomânieiNom
(1; 8% of non-emptyCase
): AmericăEMPTY
(704): Graves-Basedow, Doppler, Rubino, Europa, România, Langerhans, Paulescu, Pendred, Britanie, Esnaola
Paradigm America | Nom | Gen |
---|---|---|
Definite=Def | Americii | |
Definite=Ind | Americă |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[amod]–> ADJ (5529; 54%),
NOUN –[conj]–> NOUN (2491; 69%),
ADP –[fixed]–> ADP (533; 100%),
ADJ –[conj]–> ADJ (223; 89%),
PRON –[fixed]–> PRON (86; 100%),
NOUN –[conj]–> PRON (49; 77%),
NOUN –[case]–> NOUN (39; 65%),
PRON –[nsubj]–> NOUN (37; 84%),
ADJ –[nsubj:pass]–> NOUN (21; 55%),
NOUN –[nsubj]–> PRON (21; 54%).