Treebank Statistics: UD_Romanian-Nonstandard: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
180626 tokens (32%) have a non-empty value of Gender.
23602 types (74%) occur at least once with a non-empty value of Gender.
10137 lemmas (82%) occur at least once with a non-empty value of Gender.
The feature is used with 9 part-of-speech tags: NOUN (96782; 17% instances), PRON (27320; 5% instances), PROPN (19968; 3% instances), DET (19701; 3% instances), ADJ (10019; 2% instances), VERB (4616; 1% instances), NUM (2215; 0% instances), AUX (4; 0% instances), ADV (1; 0% instances).
NOUN
96782 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Acc,Nom (87450; 90%), Number=Sing (71135; 74%), Definite=Ind (52239; 54%).
NOUN tokens may have the following values of Gender:
Fem(49597; 51% of non-emptyGender): țara, țară, oaste, lume, pace, parte, casa, credință, vreme, casăMasc(47185; 49% of non-emptyGender): vodă, domnul, doamne, omul, om, domnului, cuvîntul, oameni, împăratul, turciiEMPTY(1): neamure
| Paradigm domn | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Number=Sing | domnul, domnu, domnu-, Domnulu | |
| Case=Acc,Nom|Definite=Def|Number=Plur | domnii, domnu | |
| Case=Acc,Nom|Definite=Ind|Number=Sing | domnu, domn | |
| Case=Acc,Nom|Definite=Ind|Number=Plur | domni, domnu | |
| Case=Dat,Gen|Definite=Def|Number=Sing | domnului, Domn, Domnul, Domnunlui, Domului | domnii |
| Case=Dat,Gen|Definite=Def|Number=Plur | domnilor | |
| Case=Dat,Gen|Definite=Ind|Number=Sing | Domnului | |
| Case=Voc|Definite=Def|Number=Sing | Doamne | |
| Case=Voc|Definite=Def|Number=Plur | domnilor | |
| Case=Voc|Definite=Ind|Number=Sing | doamne |
Gender seems to be lexical feature of NOUN. 90% lemmas (5851) occur only with one value of Gender.
PRON
27320 PRON tokens (42% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (27119; 99%), Number=Sing (18307; 67%), Strength=EMPTY (17542; 64%), PronType=Prs (15734; 58%).
PRON tokens may have the following values of Gender:
Fem(7046; 26% of non-emptyGender): o, aceaia, le, aceasta, carea, toate, aceastea, -o, ei, eaMasc(20274; 74% of non-emptyGender): lui, el, -l, -i, carele, ei, l-, carii, i-, toțiEMPTY(37308): să, ce, s-, lor, -i, mă, voi, eu, se, cine
| Paradigm el | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Number=Sing|PronType=Prs | el, elu, iel, l, îl, ei, Еl, Lui, Părinte | ea, ia, -o, O, ei |
| Case=Acc,Nom|Number=Plur|PronType=Prs | ei | |
| Case=Acc|Number=Sing|PronType=Prs|Strength=Strong | el, elu, еl, -l, ei, l- | ia, ea, -o, o, ei |
| Case=Acc|Number=Sing|PronType=Prs|Strength=Weak | -l, l-, l, îl, i-, lu, el, -i, îlu, îi, li-, Il, o | o, -o, o-, ia, -l, l, li- |
| Case=Acc|Number=Plur|PronType=Prs|Strength=Strong | ei, -i, lor, îi | iale, ele, le, eale, ia |
| Case=Acc|Number=Plur|PronType=Prs|Strength=Weak | -i, i-, îi, -l, i, îl, ei, le, l, l-, le-, li | le, le-, -le, li, li-, o, -i, -li |
| Case=Dat,Gen|Number=Sing|PronType=Dem | lui | |
| Case=Dat,Gen|Number=Plur|PronType=Dem | lui | |
| Case=Dat|Number=Sing|PronType=Prs|Strength=Strong | lui, ei, lor | ei |
| Case=Gen|Number=Sing|PronType=Prs | lui, ei, lor, -i | ei, o, iei |
| Case=Gen|Number=Plur|PronType=Prs | lor, lui | |
| Case=Nom|Number=Plur|PronType=Prs | ei, iei, îi, i, -i, I-, ii | ele, le, iale, eale, le- |
PROPN
19968 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (19121; 96%), Case=Acc,Nom (18754; 94%), Definite=Ind (15278; 77%).
PROPN tokens may have the following values of Gender:
Fem(3151; 16% of non-emptyGender): Poartă, Moldova, Muntenească, Evangheliia, Tighine, Cameniță, Leșască, Ungurească, Leşască, MariaMasc(16817; 84% of non-emptyGender): dumnezău, Hristos, Iisus, Pavel, David, Pătru, Ioan, Mihai-, Duca, Dumitraşco-EMPTY(167): tîrgu, târgu, greșală, Dunărea, războiu, boer, iarăș, Catargiul, Chipru, Filimon
| Paradigm Iisus | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def | Iisus | |
| Case=Acc,Nom|Definite=Ind | Iisus, IISUS, Isus | |
| Case=Voc|Definite=Ind | Iisuse |
Gender seems to be lexical feature of PROPN. 95% lemmas (2274) occur only with one value of Gender.
DET
19701 DET tokens (83% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (19559; 99%), Poss=EMPTY (16545; 84%), Case=Acc,Nom (16233; 82%), Number[psor]=EMPTY (14948; 76%), Number=Sing (14886; 76%).
DET tokens may have the following values of Gender:
Fem(12041; 61% of non-emptyGender): a, o, toată, ta, toate, tot, cea, mea, multe, saMasc(7660; 39% of non-emptyGender): un, al, cel, mieu, tău, cei, său, acel, toți, nostruEMPTY(4157): lui, ce, care, celor, nește, tuturor, niște, nişte, lu, vo
| Paradigm -ul | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Number=Sing|PronType=Art | -lea, -le | -a |
| Case=Acc,Nom|Number=Plur|PronType=Dem | lui | |
| Case=Dat,Gen|Definite=Def|Number=Sing|PronType=Art | -lui | |
| Case=Dat,Gen|Number=Sing|PronType=Ind | lui |
ADJ
10019 ADJ tokens (86% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (10018; 100%), Case=Acc,Nom (9396; 94%), Definite=Ind (9207; 92%), Number=Sing (7302; 73%).
ADJ tokens may have the following values of Gender:
Fem(4400; 44% of non-emptyGender): bună, svînta, svîntă, bune, frumoasă, mare, sfîntă, grea, curată, plinăMasc(5619; 56% of non-emptyGender): bun, svinte, sfînt, datoriu, mic, rău, omenesc, verde, viu, nouEMPTY(1644): mare, mari, vel, vel-, tare, verde, dulce, rece, dulci, iute
| Paradigm mare | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Number=Sing | marele, mareli, marili | marea |
| Case=Acc,Nom|Definite=Def|Number=Plur | marii | |
| Case=Acc,Nom|Definite=Ind|Number=Sing | mare | mare |
| Case=Acc,Nom|Definite=Ind|Number=Plur | mari, mare | |
| Case=Dat,Gen|Definite=Def|Number=Sing | marelui | marei |
| Case=Dat,Gen|Definite=Ind|Number=Sing | mari | |
| Definite=Ind|Number=Plur | mari |
VERB
4616 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (4616; 100%), Person=EMPTY (4616; 100%), Tense=EMPTY (4616; 100%), VerbForm=Part (4615; 100%), Polarity=Pos (4331; 94%), Number=Sing (3325; 72%).
VERB tokens may have the following values of Gender:
Fem(1737; 38% of non-emptyGender): scrisă, dată, scrise, făcută, făcute, adevărată, pusă, adevărate, aleasă, ascunsăMasc(2879; 62% of non-emptyGender): scris, făcut, dat, pus, născut, dus, zis, ales, iubit, legatEMPTY(70507): zise, făcut, face, da, dat, era, zice, veni, luat, avea
| Paradigm zice | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Number=Sing | zisă, zîsă | |
| Number=Sing | zis, dzis | |
| Number=Plur | zise |
NUM
2215 NUM tokens (43% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (2084; 94%), Definite=EMPTY (1461; 66%), Case=EMPTY (1432; 65%), NumType=Card (1271; 57%), Number=Sing (1127; 51%).
NUM tokens may have the following values of Gender:
Fem(1172; 53% of non-emptyGender): doao, treia, mii, două, doa, mie, sute, doo, sută, patraMasc(1043; 47% of non-emptyGender): doi, întîiu, amîndoi, doisprăzeace, întîi, întăiu, dintîiu, un, doilea, dentîiuEMPTY(2958): trei, 2, 3, cinci, patru, 4, 7, 12, 5, 1
| Paradigm doi | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Number=Sing|NumType=Ord | doilea, doile, doiele, doili | doa, doao |
| Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Card | doao, doo, doă, doaă | |
| Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Ord | doo, doao, DOA | |
| Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Card | doao, doo, doauă, doă, da, dao, doua, douo | |
| Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Ord | doo | |
| Definite=Ind|Number=Sing|NumType=Ord | doile | |
| Number=Sing|NumType=Ord | doilea, doile, doili | doaoa, doa, doua, doaua |
| Number=Plur|NumType=Card | doi | două, doao, doa, doo |
AUX
4 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (4; 100%), Number=Sing (4; 100%), Person=EMPTY (4; 100%), Tense=EMPTY (4; 100%).
AUX tokens may have the following values of Gender:
Masc(4; 100% of non-emptyGender): fost, vrutEMPTY(31922): au, va, -i, -au, era, am, a, iaste, vor, fi
ADV
1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (1; 100%), PronType=Int,Rel (1; 100%).
ADV tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): cîtEMPTY(34595): nu, mai, și, cum, n-, cînd, numai, şi, tot, unde
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (13804; 85%),
NOUN –[nmod]–> NOUN (5882; 51%),
NOUN –[amod]–> ADJ (5845; 76%),
NOUN –[conj]–> NOUN (4692; 69%),
PROPN –[nmod]–> NOUN (2857; 95%),
NOUN –[nmod]–> PROPN (2818; 58%),
NOUN –[amod]–> VERB (1171; 95%),
PROPN –[appos]–> NOUN (946; 92%),
PROPN –[nmod]–> PROPN (940; 88%),
PROPN –[conj]–> PROPN (860; 86%).