Treebank Statistics: UD_Romanian-RRT: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
69057 tokens (32%) have a non-empty value of Definite.
22189 types (70%) occur at least once with a non-empty value of Definite.
10814 lemmas (63%) occur at least once with a non-empty value of Definite.
The feature is used with 5 part-of-speech tags: NOUN (52849; 24% instances), ADJ (15009; 7% instances), NUM (470; 0% instances), DET (407; 0% instances), PROPN (322; 0% instances).
NOUN
52849 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (38591; 73%), Gender=Fem (32519; 62%), Case=Acc,Nom (28805; 55%).
NOUN tokens may have the following values of Definite:
Def(27197; 51% of non-emptyDefinite): cazul, timpul, statele, Comisia, cadrul, partea, fața, comisiei, anul, articolulInd(25652; 49% of non-emptyDefinite): ani, timp, conformitate, loc, membre, mod, acord, parte, b, lucruEMPTY(1407): art., a., nr., CE, b., mg, lit., alin., ml, CEE
| Paradigm an | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Number=Sing | anul | |
| Case=Acc,Nom|Number=Plur | anii | |
| Case=Dat,Gen|Number=Sing | anului | |
| Case=Dat,Gen|Number=Plur | anilor | |
| Number=Sing | an | |
| Number=Plur | ani |
ADJ
15009 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (14969; 100%), Number=Sing (9818; 65%), Case=EMPTY (9396; 63%), Gender=Fem (9208; 61%).
ADJ tokens may have the following values of Definite:
Def(895; 6% of non-emptyDefinite): prezentul, prezenta, prezentului, prezentei, întreaga, următoarele, noul, noua, fosta, principaleleInd(14114; 94% of non-emptyDefinite): mare, europene, nou, necesare, europeană, mari, european, mică, naționale, generalEMPTY(288): asemenea, standard, anume, așa, n., aparte, atare, eficace, roz, e-
| Paradigm mare | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Number=Sing | marele | |
| Case=Acc,Nom|Gender=Masc|Number=Plur | marii | |
| Case=Acc,Nom|Gender=Fem|Number=Sing | marea | |
| Case=Acc,Nom|Gender=Fem|Number=Plur | marile | |
| Case=Dat,Gen|Gender=Masc|Number=Sing | marelui | |
| Case=Dat,Gen|Gender=Fem|Number=Sing | mari | Marii |
| Case=Dat,Gen|Number=Plur | marilor | |
| Number=Sing | mare | |
| Number=Plur | mari |
Definite seems to be lexical feature of ADJ. 96% lemmas (3277) occur only with one value of Definite.
NUM
470 NUM tokens (8% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (470; 100%), NumType=Ord (336; 71%), Gender=Fem (293; 62%), Number=Sing (277; 59%).
NUM tokens may have the following values of Definite:
Def(311; 66% of non-emptyDefinite): primul, prima, primele, ultimii, ultimul, primului, ultimele, ultima, primii, întâiaInd(159; 34% of non-emptyDefinite): milioane, mii, miliarde, o, sute, prim-, primă, zeci, sută, milionEMPTY(5079): 1, 2, 3, două, 4, trei, 5, 6, doi, 7
| Paradigm prim | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Number=Sing | primul | |
| Case=Acc,Nom|Gender=Masc|Number=Plur | primii | |
| Case=Acc,Nom|Gender=Fem|Number=Sing | primă | prima |
| Case=Acc,Nom|Gender=Fem|Number=Plur | primele | |
| Case=Dat,Gen|Gender=Masc|Number=Sing | primului | |
| Case=Dat,Gen|Gender=Masc|Number=Plur | primilor | |
| Case=Dat,Gen|Gender=Fem|Number=Sing | prime | primei |
| Case=Dat,Gen|Gender=Fem|Number=Plur | primelor | |
| Gender=Masc|Number=Sing | prim-, prim |
DET
407 DET tokens (3% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Person=EMPTY (407; 100%), Position=EMPTY (407; 100%), Poss=EMPTY (407; 100%), PronType=Art (407; 100%), Number=Sing (403; 99%), Case=Dat,Gen (318; 78%), Gender=EMPTY (297; 73%).
DET tokens may have the following values of Definite:
Def(407; 100% of non-emptyDefinite): lui, -lea, -ul, -a, -ului, -urilor, -ilor, -urileEMPTY(11617): o, un, a, al, ale, unei, unui, acest, lui, cel
PROPN
322 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Definite.
PROPN tokens may have the following values of Definite:
Def(314; 98% of non-emptyDefinite): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiInd(8; 2% of non-emptyDefinite): Britanii, Americi, Eladă, Făt-frumos, Iugoslavie, Mediterane, NapoleonEMPTY(5563): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
| Paradigm Iugoslavia | Ind | Def |
|---|---|---|
| Case=Acc,Nom | Iugoslavie | |
| Case=Dat,Gen | Iugoslaviei |
Definite seems to be lexical feature of PROPN. 98% lemmas (102) occur only with one value of Definite.
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[nmod]–> NOUN (9516; 56%),
NOUN –[conj]–> NOUN (3046; 89%),
ADJ –[conj]–> ADJ (713; 99%),
NOUN –[appos]–> NOUN (279; 51%),
ADJ –[conj]–> NOUN (86; 80%),
NOUN –[conj]–> ADJ (32; 73%),
ADJ –[amod]–> ADJ (28; 67%),
NOUN –[acl]–> NOUN (27; 55%),
ADJ –[advmod]–> ADJ (26; 96%),
ADJ –[xcomp]–> NOUN (24; 73%).