home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

7066 tokens (49%) have a non-empty value of Gender. 3320 types (82%) occur at least once with a non-empty value of Gender. 2037 lemmas (75%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (3934; 27% instances), ADJ (2051; 14% instances), DET (627; 4% instances), VERB (316; 2% instances), PRON (91; 1% instances), NUM (25; 0% instances), AUX (22; 0% instances).

NOUN

3934 NOUN tokens (90% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Abbr=EMPTY (3934; 100%), Number=Sing (2867; 73%), Definite=Def (2242; 57%), Case=Nom (2104; 53%).

NOUN tokens may have the following values of Gender:

Paradigm cazMascFem
Case=Gen|Definite=Def|Number=Plurcazurilor
Case=Nom|Definite=Def|Number=Singcazul
Case=Nom|Definite=Def|Number=Plurcazurile
Definite=Ind|Number=Singcaz
Definite=Ind|Number=Plurcazuri

Gender seems to be lexical feature of NOUN. 95% lemmas (1081) occur only with one value of Gender.

ADJ

2051 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (2047; 100%), Definite=Ind (2035; 99%), Number=Sing (1477; 72%), Case=EMPTY (1064; 52%).

ADJ tokens may have the following values of Gender:

Paradigm aorticMascFem
Case=Gen|Number=Singaortice
Case=Nom|Number=Singaortică
Number=Singaortic
Number=Pluraortice

DET

627 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Position=EMPTY (552; 88%), Number=Sing (519; 83%), Person=EMPTY (509; 81%), Case=EMPTY (322; 51%), Poss=Yes (314; 50%), PronType=Prs (314; 50%).

DET tokens may have the following values of Gender:

Paradigm alMascFem
Number=Singala
Number=Pluraiale

VERB

316 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (316; 100%), Person=EMPTY (316; 100%), Tense=EMPTY (316; 100%), VerbForm=Part (316; 100%), Number=Sing (214; 68%).

VERB tokens may have the following values of Gender:

Paradigm asociaMascFem
Number=Singasociatasociată
Number=Plurasociațiasociate

PRON

91 PRON tokens (23% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (91; 100%), Reflex=EMPTY (91; 100%), Strength=EMPTY (78; 86%), Case=Nom (76; 84%), PronType=Dem (62; 68%), Number=Plur (48; 53%).

PRON tokens may have the following values of Gender:

Paradigm careMascFem
căruiacăreia

NUM

25 NUM tokens (7% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (23; 92%), Number=Plur (16; 64%), NumType=Ord (14; 56%).

NUM tokens may have the following values of Gender:

Paradigm doiMascFem
doidouă

Gender seems to be lexical feature of NUM. 93% lemmas (13) occur only with one value of Gender.

AUX

22 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (22; 100%), Number=Sing (22; 100%), Person=EMPTY (22; 100%), Tense=EMPTY (22; 100%), VerbForm=Part (22; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (1675; 95%), NOUN –[det]–> DET (435; 71%), NOUN –[conj]–> NOUN (316; 64%), ADJ –[conj]–> ADJ (74; 97%), ADJ –[nsubj]–> NOUN (69; 90%), VERB –[nsubj:pass]–> NOUN (66; 56%), NOUN –[acl]–> ADJ (52; 91%), ADJ –[conj]–> NOUN (21; 72%), NOUN –[amod]–> NOUN (19; 83%), ADJ –[det]–> DET (18; 100%).