Treebank Statistics: UD_Romanian-Nonstandard: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
134887 tokens (24%) have a non-empty value of Definite.
21212 types (67%) occur at least once with a non-empty value of Definite.
9344 lemmas (76%) occur at least once with a non-empty value of Definite.
The feature is used with 6 part-of-speech tags: NOUN (96782; 17% instances), PROPN (19968; 3% instances), ADJ (11426; 2% instances), DET (3246; 1% instances), PRON (2607; 0% instances), NUM (858; 0% instances).
NOUN
96782 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Case=Acc,Nom (87450; 90%), Number=Sing (71135; 74%), Gender=Fem (49597; 51%).
NOUN tokens may have the following values of Definite:
Def(44543; 46% of non-emptyDefinite): domnul, țara, omul, domnului, cuvîntul, împăratul, turcii, oamenii, numele, fiiulInd(52239; 54% of non-emptyDefinite): vodă, doamne, țară, om, oaste, lume, oameni, pace, parte, baniEMPTY(1): neamure
| Paradigm domn | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Number=Sing | domnu, domn | domnul, domnu, domnu-, Domnulu |
| Case=Acc,Nom|Gender=Masc|Number=Plur | domni, domnu | domnii, domnu |
| Case=Dat,Gen|Gender=Masc|Number=Sing | Domnului | domnului, Domn, Domnul, Domnunlui, Domului |
| Case=Dat,Gen|Gender=Masc|Number=Plur | domnilor | |
| Case=Dat,Gen|Gender=Fem|Number=Sing | domnii | |
| Case=Voc|Gender=Masc|Number=Sing | doamne | Doamne |
| Case=Voc|Gender=Masc|Number=Plur | domnilor |
PROPN
19968 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (19121; 96%), Case=Acc,Nom (18754; 94%), Gender=Masc (16817; 84%).
PROPN tokens may have the following values of Definite:
Def(4690; 23% of non-emptyDefinite): Duca, Moldova, Evangheliia, Brîncovanul, Tighine, Lupul, Dumitraşco-, Ducăi, Gruia, MosculuiInd(15278; 77% of non-emptyDefinite): dumnezău, Hristos, Iisus, Pavel, David, Poartă, Pătru, Ioan, Mihai-, CostantinEMPTY(167): tîrgu, târgu, greșală, Dunărea, războiu, boer, iarăș, Catargiul, Chipru, Filimon
| Paradigm Dumnezeu | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Number=Sing | dumnezău, Dumnedzău, Dumnezeu, Dumnedzeu, Dumnădzău, Dumnezăul, Dumnădzeu, Dumnedzeul, Dumnăzău | Dumnezăul, Dumnedzăul, Dumnedzeul, Dumnezău |
| Case=Acc,Nom|Number=Plur | Dumnezăi, Dumnezei | Dumnezăii |
| Case=Dat,Gen|Number=Sing | Dumnezăului, Dumnedzeu, Dumnedzeului | |
| Case=Dat,Gen|Number=Plur | Dumnezeilor | |
| Case=Voc|Number=Sing | Dumnezeule, Dumnezăule |
ADJ
11426 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (11425; 100%), Case=Acc,Nom (10719; 94%), Number=Sing (8625; 75%).
ADJ tokens may have the following values of Definite:
Def(812; 7% of non-emptyDefinite): svînta, svîntul, svintei, svintele, mișelul, svîntului, bietul, cinstitul, sfîntul, buneInd(10614; 93% of non-emptyDefinite): mare, bună, bun, mari, svinte, verde, sfînt, datoriu, mic, svîntăEMPTY(237): vel, vel-, biv-, -vel, așa, biv, -vel-, adevărat, anume, baș
| Paradigm mare | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Number=Sing | mare | marele, mareli, marili |
| Case=Acc,Nom|Gender=Masc|Number=Plur | mari, mare | marii |
| Case=Acc,Nom|Gender=Fem|Number=Sing | mare | marea |
| Case=Acc,Nom|Number=Sing | mare, mari | |
| Case=Dat,Gen|Gender=Masc|Number=Sing | marelui | |
| Case=Dat,Gen|Gender=Fem|Number=Sing | mari | marei |
| Gender=Masc|Number=Plur | mari | |
| Number=Plur | mari |
DET
3246 DET tokens (14% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Number[psor]=EMPTY (3246; 100%), Person=EMPTY (3246; 100%), Poss=EMPTY (3246; 100%), PronType=Art (3246; 100%), Number=Sing (3245; 100%), Gender=EMPTY (3104; 96%), Case=Dat,Gen (3103; 96%).
DET tokens may have the following values of Definite:
Def(3246; 100% of non-emptyDefinite): lui, -lea, -a, lu, un, -le, iui, niște, -luiEMPTY(20612): a, un, o, toată, ta, toate, tot, al, cel, cea
PRON
2607 PRON tokens (4% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Person=3 (2607; 100%), Strength=EMPTY (2607; 100%), Case=Acc,Nom (2605; 100%), PronType=Int,Rel (2340; 90%), Gender=Masc (2006; 77%), Number=Sing (1682; 65%).
PRON tokens may have the following values of Definite:
Def(2607; 100% of non-emptyDefinite): carele, carii, carea, unii, totul, alții, toții, unul, toțîi, alțîiEMPTY(62021): să, ce, lui, el, -i, -l, s-, lor, ei, le
Definite seems to be lexical feature of PRON. 100% lemmas (10) occur only with one value of Definite.
NUM
858 NUM tokens (17% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (843; 98%), Case=Acc,Nom (624; 73%), Gender=Fem (544; 63%), NumType=Card (495; 58%), Number=Sing (430; 50%).
NUM tokens may have the following values of Definite:
Def(133; 16% of non-emptyDefinite): doilea, treile, doa, treilea, doile, triile, întîia, întîiul, patrulea, unInd(725; 84% of non-emptyDefinite): mii, doao, mie, sute, doo, sută, un, giumătate, întîi, întăiEMPTY(4315): trei, doi, 2, cinci, patru, 3, întîiu, treia, 4, 7
| Paradigm doi | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Number=Sing|NumType=Ord | doilea, doile, doiele, doili | |
| Case=Acc,Nom|Gender=Fem|Number=Sing|NumType=Card | doao, doo, doă, doaă | |
| Case=Acc,Nom|Gender=Fem|Number=Sing|NumType=Ord | doo, doao, DOA | doa, doao |
| Case=Acc,Nom|Gender=Fem|Number=Plur|NumType=Card | doao, doo, doauă, doă, da, dao, doua, douo | |
| Case=Acc,Nom|Gender=Fem|Number=Plur|NumType=Ord | doo | |
| Gender=Masc|Number=Sing|NumType=Ord | doile |
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[nmod]–> NOUN (8673; 75%),
NOUN –[conj]–> NOUN (5774; 85%),
NOUN –[amod]–> ADJ (4401; 58%),
PROPN –[nmod]–> NOUN (1931; 64%),
NOUN –[appos]–> NOUN (769; 73%),
PROPN –[conj]–> PROPN (751; 75%),
PROPN –[nmod]–> PROPN (708; 66%),
ADJ –[conj]–> ADJ (640; 99%),
PROPN –[appos]–> NOUN (550; 54%),
ADJ –[obl]–> NOUN (347; 60%).