home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-Nonstandard: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

180626 tokens (32%) have a non-empty value of Gender. 23602 types (74%) occur at least once with a non-empty value of Gender. 10137 lemmas (82%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (96782; 17% instances), PRON (27320; 5% instances), PROPN (19968; 3% instances), DET (19701; 3% instances), ADJ (10019; 2% instances), VERB (4616; 1% instances), NUM (2215; 0% instances), AUX (4; 0% instances), ADV (1; 0% instances).

NOUN

96782 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Acc,Nom (87450; 90%), Number=Sing (71135; 74%), Definite=Ind (52239; 54%).

NOUN tokens may have the following values of Gender:

Paradigm domnMascFem
Case=Acc,Nom|Definite=Def|Number=Singdomnul, domnu, domnu-, Domnulu
Case=Acc,Nom|Definite=Def|Number=Plurdomnii, domnu
Case=Acc,Nom|Definite=Ind|Number=Singdomnu, domn
Case=Acc,Nom|Definite=Ind|Number=Plurdomni, domnu
Case=Dat,Gen|Definite=Def|Number=Singdomnului, Domn, Domnul, Domnunlui, Domuluidomnii
Case=Dat,Gen|Definite=Def|Number=Plurdomnilor
Case=Dat,Gen|Definite=Ind|Number=SingDomnului
Case=Voc|Definite=Def|Number=SingDoamne
Case=Voc|Definite=Def|Number=Plurdomnilor
Case=Voc|Definite=Ind|Number=Singdoamne

Gender seems to be lexical feature of NOUN. 90% lemmas (5851) occur only with one value of Gender.

PRON

27320 PRON tokens (42% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (27119; 99%), Number=Sing (18307; 67%), Strength=EMPTY (17542; 64%), PronType=Prs (15734; 58%).

PRON tokens may have the following values of Gender:

Paradigm elMascFem
Case=Acc,Nom|Number=Sing|PronType=Prsel, elu, iel, l, îl, ei, Еl, Lui, Părinteea, ia, -o, O, ei
Case=Acc,Nom|Number=Plur|PronType=Prsei
Case=Acc|Number=Sing|PronType=Prs|Strength=Strongel, elu, еl, -l, ei, l-ia, ea, -o, o, ei
Case=Acc|Number=Sing|PronType=Prs|Strength=Weak-l, l-, l, îl, i-, lu, el, -i, îlu, îi, li-, Il, oo, -o, o-, ia, -l, l, li-
Case=Acc|Number=Plur|PronType=Prs|Strength=Strongei, -i, lor, îiiale, ele, le, eale, ia
Case=Acc|Number=Plur|PronType=Prs|Strength=Weak-i, i-, îi, -l, i, îl, ei, le, l, l-, le-, lile, le-, -le, li, li-, o, -i, -li
Case=Dat,Gen|Number=Sing|PronType=Demlui
Case=Dat,Gen|Number=Plur|PronType=Demlui
Case=Dat|Number=Sing|PronType=Prs|Strength=Stronglui, ei, lorei
Case=Gen|Number=Sing|PronType=Prslui, ei, lor, -iei, o, iei
Case=Gen|Number=Plur|PronType=Prslor, lui
Case=Nom|Number=Plur|PronType=Prsei, iei, îi, i, -i, I-, iiele, le, iale, eale, le-

PROPN

19968 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (19121; 96%), Case=Acc,Nom (18754; 94%), Definite=Ind (15278; 77%).

PROPN tokens may have the following values of Gender:

Paradigm IisusMascFem
Case=Acc,Nom|Definite=DefIisus
Case=Acc,Nom|Definite=IndIisus, IISUS, Isus
Case=Voc|Definite=IndIisuse

Gender seems to be lexical feature of PROPN. 95% lemmas (2274) occur only with one value of Gender.

DET

19701 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (19559; 99%), Poss=EMPTY (16545; 84%), Case=Acc,Nom (16233; 82%), Number[psor]=EMPTY (14948; 76%), Number=Sing (14886; 76%).

DET tokens may have the following values of Gender:

Paradigm -ulMascFem
Case=Acc,Nom|Definite=Def|Number=Sing|PronType=Art-lea, -le-a
Case=Acc,Nom|Number=Plur|PronType=Demlui
Case=Dat,Gen|Definite=Def|Number=Sing|PronType=Art-lui
Case=Dat,Gen|Number=Sing|PronType=Indlui

ADJ

10019 ADJ tokens (86% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (10018; 100%), Case=Acc,Nom (9396; 94%), Definite=Ind (9207; 92%), Number=Sing (7302; 73%).

ADJ tokens may have the following values of Gender:

Paradigm mareMascFem
Case=Acc,Nom|Definite=Def|Number=Singmarele, mareli, marilimarea
Case=Acc,Nom|Definite=Def|Number=Plurmarii
Case=Acc,Nom|Definite=Ind|Number=Singmaremare
Case=Acc,Nom|Definite=Ind|Number=Plurmari, mare
Case=Dat,Gen|Definite=Def|Number=Singmareluimarei
Case=Dat,Gen|Definite=Ind|Number=Singmari
Definite=Ind|Number=Plurmari

VERB

4616 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (4616; 100%), Person=EMPTY (4616; 100%), Tense=EMPTY (4616; 100%), VerbForm=Part (4615; 100%), Polarity=Pos (4331; 94%), Number=Sing (3325; 72%).

VERB tokens may have the following values of Gender:

Paradigm ziceMascFem
Case=Acc,Nom|Number=Singzisă, zîsă
Number=Singzis, dzis
Number=Plurzise

NUM

2215 NUM tokens (43% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (2084; 94%), Definite=EMPTY (1461; 66%), Case=EMPTY (1432; 65%), NumType=Card (1271; 57%), Number=Sing (1127; 51%).

NUM tokens may have the following values of Gender:

Paradigm doiMascFem
Case=Acc,Nom|Definite=Def|Number=Sing|NumType=Orddoilea, doile, doiele, doilidoa, doao
Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Carddoao, doo, doă, doaă
Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Orddoo, doao, DOA
Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Carddoao, doo, doauă, doă, da, dao, doua, douo
Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Orddoo
Definite=Ind|Number=Sing|NumType=Orddoile
Number=Sing|NumType=Orddoilea, doile, doilidoaoa, doa, doua, doaua
Number=Plur|NumType=Carddoidouă, doao, doa, doo

AUX

4 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (4; 100%), Number=Sing (4; 100%), Person=EMPTY (4; 100%), Tense=EMPTY (4; 100%).

AUX tokens may have the following values of Gender:

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (1; 100%), PronType=Int,Rel (1; 100%).

ADV tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (13804; 85%), NOUN –[nmod]–> NOUN (5882; 51%), NOUN –[amod]–> ADJ (5845; 76%), NOUN –[conj]–> NOUN (4692; 69%), PROPN –[nmod]–> NOUN (2857; 95%), NOUN –[nmod]–> PROPN (2818; 58%), NOUN –[amod]–> VERB (1171; 95%), PROPN –[appos]–> NOUN (946; 92%), PROPN –[nmod]–> PROPN (940; 88%), PROPN –[conj]–> PROPN (860; 86%).