Treebank Statistics: UD_Romanian-SiMoNERo: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
57191 tokens (39%) have a non-empty value of Definite
.
13826 types (77%) occur at least once with a non-empty value of Definite
.
7041 lemmas (66%) occur at least once with a non-empty value of Definite
.
The feature is used with 5 part-of-speech tags: NOUN (39982; 27% instances), ADJ (16947; 12% instances), NUM (208; 0% instances), DET (41; 0% instances), PROPN (13; 0% instances).
NOUN
39982 NOUN tokens (94% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (29256; 73%), Gender=Fem (24935; 62%), Case=Nom (20629; 52%).
NOUN
tokens may have the following values of Definite
:
Def
(21800; 55% of non-emptyDefinite
): pacienții, nivelul, cazul, creșterea, tratamentul, vârsta, pacienților, scăderea, riscul, diabetuluiInd
(18182; 45% of non-emptyDefinite
): pacienți, ani, diabet, risc, insulină, tip, tratament, timp, studiu, cazuriEMPTY
(2716): mg, IC, vs, HTA, TA, DZ, FA, AVC, dl, EI
Paradigm pacient | Ind | Def |
---|---|---|
Case=Gen|Number=Sing | pacientului | |
Case=Gen|Number=Plur | pacienților | |
Case=Nom|Number=Sing | pacientul | |
Case=Nom|Number=Plur | pacienții, pacientii | |
Number=Sing | pacient | |
Number=Plur | pacienți, pacienții |
ADJ
16947 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADJ
and Definite
co-occurred: Degree=Pos (16909; 100%), Number=Sing (11572; 68%), Gender=Fem (10848; 64%), Case=EMPTY (10119; 60%).
ADJ
tokens may have the following values of Definite
:
Def
(228; 1% of non-emptyDefinite
): principalul, principala, marea, principalele, următoarele, singura, diferitelor, întreaga, diversele, numitaInd
(16719; 99% of non-emptyDefinite
): mare, vârstnici, crescut, zaharat, clinice, mici, cardiacă, cardiace, mari, importantEMPTY
(105): precoce, standard, online, postpartum, pre-test, AV, eficace, pretest, anume, aparte
Paradigm mare | Ind | Def |
---|---|---|
Case=Gen|Gender=Fem|Number=Sing | mari | marii |
Case=Nom|Gender=Masc|Number=Sing | Marele | |
Case=Nom|Gender=Fem|Number=Sing | mare | marea |
Case=Nom|Gender=Fem|Number=Plur | marile | |
Gender=Masc|Number=Sing | mare | |
Number=Plur | mari |
Definite
seems to be lexical feature of ADJ
. 98% lemmas (2815) occur only with one value of Definite
.
NUM
208 NUM tokens (5% of all NUM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NUM
and Definite
co-occurred: NumForm=Word (208; 100%), NumType=Ord (197; 95%), Number=Sing (146; 70%).
NUM
tokens may have the following values of Definite
:
Def
(175; 84% of non-emptyDefinite
): primul, prima, primele, ultimii, ultimul, ultima, primei, ultimele, primii, ultimilorInd
(33; 16% of non-emptyDefinite
): primă, milioane, treime, ultimă, mii, prim, miliarde, ultim, zeciEMPTY
(4397): 2, 1, două, 3, 4, 5, 30, 10, 20, 6
Paradigm prim | Ind | Def |
---|---|---|
Case=Nom|Gender=Fem | prima | |
Gender=Masc | prim |
Definite
seems to be lexical feature of NUM
. 96% lemmas (23) occur only with one value of Definite
.
DET
41 DET tokens (1% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Person=EMPTY (41; 100%), Position=EMPTY (41; 100%), Poss=EMPTY (41; 100%), PronType=Art (41; 100%), Number=Sing (39; 95%), Case=Gen (25; 61%), Gender=EMPTY (22; 54%).
DET
tokens may have the following values of Definite
:
Def
(41; 100% of non-emptyDefinite
): lui, ul, a, urileEMPTY
(7383): a, o, un, al, ale, unui, acest, această, unei, cel
PROPN
13 PROPN tokens (2% of all PROPN
tokens) have a non-empty value of Definite
.
PROPN
tokens may have the following values of Definite
:
Def
(12; 92% of non-emptyDefinite
): Americii, Asiei, Europei, Franței, Greciei, RomânieiInd
(1; 8% of non-emptyDefinite
): AmericăEMPTY
(704): Graves-Basedow, Doppler, Rubino, Europa, România, Langerhans, Paulescu, Pendred, Britanie, Esnaola
Paradigm America | Ind | Def |
---|---|---|
Case=Gen | Americii | |
Case=Nom | Americă |
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[nmod]–> NOUN (8910; 53%),
NOUN –[conj]–> NOUN (3358; 78%),
ADJ –[conj]–> ADJ (670; 99%),
ADJ –[conj]–> NOUN (126; 78%),
ADJ –[amod]–> ADJ (55; 93%),
NOUN –[conj]–> ADJ (48; 65%),
ADJ –[advcl]–> ADJ (34; 97%),
ADJ –[obl:agent]–> NOUN (33; 60%),
ADJ –[amod]–> NOUN (25; 61%),
ADJ –[appos]–> NOUN (23; 72%).