home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

90308 tokens (41%) have a non-empty value of Gender. 24196 types (77%) occur at least once with a non-empty value of Gender. 12077 lemmas (70%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (52827; 24% instances), ADJ (14474; 7% instances), DET (10401; 5% instances), VERB (7634; 3% instances), PRON (3079; 1% instances), NUM (940; 0% instances), AUX (631; 0% instances), PROPN (322; 0% instances).

NOUN

52827 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (38593; 73%), Case=Acc,Nom (28805; 55%), Definite=Def (27198; 51%).

NOUN tokens may have the following values of Gender:

Paradigm timpMascFem
Case=Acc,Nom|Definite=Def|Number=Singtimpul
Case=Acc,Nom|Definite=Def|Number=Sing|Variant=Shorttimpu'
Case=Acc,Nom|Definite=Def|Number=Plurtimpurile
Case=Dat,Gen|Definite=Def|Number=Singtimpului
Definite=Ind|Number=Singtimp
Definite=Ind|Number=Plurtimpuri

Gender seems to be lexical feature of NOUN. 92% lemmas (7033) occur only with one value of Gender.

ADJ

14474 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (14434; 100%), Definite=Ind (13597; 94%), Number=Sing (9616; 66%), Case=EMPTY (8880; 61%).

ADJ tokens may have the following values of Gender:

Paradigm mareMascFem
Case=Acc,Nom|Definite=Def|Number=Singmarelemarea
Case=Acc,Nom|Definite=Def|Number=Plurmariimarile
Case=Dat,Gen|Definite=Def|Number=SingmareluiMarii
Case=Dat,Gen|Definite=Ind|Number=Singmari

DET

10401 DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Position=EMPTY (8973; 86%), Number=Sing (8609; 83%), Person=EMPTY (7680; 74%), Poss=EMPTY (7070; 68%), Case=Acc,Nom (6028; 58%), PronType=Ind (5340; 51%).

DET tokens may have the following values of Gender:

Paradigm unMascFem
Case=Acc,Nom|Number=Singun, -uno, -o
Case=Acc,Nom|Number=Plur|Person=3|Position=Prenomuniiunele
Case=Dat,Gen|Number=Singunuiunei

VERB

7634 VERB tokens (33% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7634; 100%), Person=EMPTY (7634; 100%), Tense=EMPTY (7634; 100%), VerbForm=Part (7634; 100%), Number=Sing (5575; 73%).

VERB tokens may have the following values of Gender:

Paradigm aveaMascFem
Number=Singavutavută
Number=Pluravute

PRON

3079 PRON tokens (26% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3079; 100%), Person=3 (3063; 99%), Variant=EMPTY (2646; 86%), Number=Sing (2173; 71%), Case=Acc,Nom (1930; 63%), PronType=Prs (1614; 52%).

PRON tokens may have the following values of Gender:

Paradigm elMascFem
Case=Acc,Nom|Number=Sing|Strength=Strongelea
Case=Acc,Nom|Number=Plur|Strength=Strongeiele
Case=Acc|Number=Sing|Strength=Weakîlo
Case=Acc|Number=Sing|Strength=Weak|Variant=Short-l, l-, l-o
Case=Acc|Number=Plur|Strength=Weakîi, ile
Case=Acc|Number=Plur|Strength=Weak|Variant=Short-i, i-le-, -le
Case=Dat,Gen|Number=Sing|Strength=Strongluiei
Case=Dat|Number=Plur|Strength=Weak|Variant=Short-i

NUM

940 NUM tokens (17% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (892; 95%), Number=Plur (483; 51%), NumType=Ord (472; 50%).

NUM tokens may have the following values of Gender:

Paradigm doiMascFem
Number=Sing|NumType=Orddoilea, secunddoua
Number=Plur|NumType=Carddoidouă

AUX

631 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (631; 100%), Number=Sing (631; 100%), Person=EMPTY (631; 100%), Tense=EMPTY (631; 100%), VerbForm=Part (631; 100%).

AUX tokens may have the following values of Gender:

PROPN

322 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (104) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (11682; 95%), NOUN –[nmod]–> NOUN (8995; 54%), NOUN –[det]–> DET (8122; 78%), NOUN –[conj]–> NOUN (2494; 73%), VERB –[nsubj:pass]–> NOUN (1025; 60%), ADJ –[conj]–> ADJ (662; 93%), VERB –[conj]–> VERB (528; 61%), ADJ –[nsubj]–> NOUN (383; 91%), VERB –[obl:agent]–> NOUN (345; 51%), NOUN –[appos]–> NOUN (308; 57%).