This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ro/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Romanian)

This feature is universal. It occurs with 2 different values: Fem, Masc.

90602 tokens (41%) have a non-empty value of Gender. 24276 types (77%) occur at least once with a non-empty value of Gender. 12135 lemmas (70%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (52829; 24% instances), ADJ (14794; 7% instances), DET (10394; 5% instances), VERB (7729; 4% instances), PRON (3174; 1% instances), NUM (901; 0% instances), AUX (475; 0% instances), PROPN (306; 0% instances).

NOUN

52829 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (38507; 73%), Case=Acc,Nom (28798; 55%), Definite=Def (27175; 51%).

NOUN tokens may have the following values of Gender:

Paradigm timpMascFem
Case=Acc,Nom|Definite=Def|Number=Singtimpul
Case=Acc,Nom|Definite=Def|Number=Sing|Variant=Shorttimpu'
Case=Acc,Nom|Definite=Def|Number=Plurtimpurile
Case=Dat,Gen|Definite=Def|Number=Singtimpului
Definite=Ind|Number=Singtimp
Definite=Ind|Number=Plurtimpuri

Gender seems to be lexical feature of NOUN. 92% lemmas (7007) occur only with one value of Gender.

ADJ

14794 ADJ tokens (96% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (14758; 100%), Definite=Ind (13906; 94%), Number=Sing (9889; 67%), Case=EMPTY (9064; 61%).

ADJ tokens may have the following values of Gender:

Paradigm mareMascFem
Case=Acc,Nom|Definite=Def|Number=Singmarelemarea
Case=Acc,Nom|Definite=Def|Number=Plurmariimarile
Case=Acc,Nom|Definite=Ind|Number=Singmare
Case=Dat,Gen|Definite=Def|Number=Singmarelui
Case=Dat,Gen|Definite=Ind|Number=Singmari
Definite=Ind|Number=Singmare

DET

10394 DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Position=EMPTY (8962; 86%), Number=Sing (8614; 83%), Person=EMPTY (7701; 74%), Poss=EMPTY (7076; 68%), Case=Acc,Nom (6033; 58%), PronType=Ind (5364; 52%).

DET tokens may have the following values of Gender:

Paradigm unMascFem
Case=Acc,Nom|Number=Singun, -uno, -o
Case=Acc,Nom|Number=Plur|Person=3|Position=Prenomuniiunele
Case=Dat,Gen|Number=Singunuiunei

VERB

7729 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Tense=EMPTY (7729; 100%), Mood=EMPTY (7729; 100%), VerbForm=Part (7729; 100%), Person=EMPTY (7729; 100%), Number=Sing (5707; 74%).

VERB tokens may have the following values of Gender:

Paradigm aveaMascFem
Number=Singavutavută
Number=Pluravute

PRON

3174 PRON tokens (27% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3174; 100%), Person=3 (3154; 99%), Variant=EMPTY (2690; 85%), Number=Sing (2205; 69%), Case=Acc,Nom (1977; 62%), PronType=Prs (1674; 53%).

PRON tokens may have the following values of Gender:

Paradigm elMascFem
Case=Acc,Nom|Number=Sing|Strength=Strongelea
Case=Acc,Nom|Number=Plur|Strength=Strongeiele
Case=Acc|Number=Sing|Strength=Weakîlo
Case=Acc|Number=Sing|Strength=Weak|Variant=Short-l, l-, l-o
Case=Acc|Number=Plur|Strength=Weakîile
Case=Acc|Number=Plur|Strength=Weak|Variant=Short-i, i-le-, -le
Case=Dat,Gen|Number=Sing|Strength=Strongluiei

NUM

901 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (853; 95%), Number=Plur (484; 54%), NumType=Ord (473; 52%).

NUM tokens may have the following values of Gender:

Paradigm doiMascFem
Number=Sing|NumType=Orddoilea, secunddoua
Number=Plur|NumType=Carddoidouă

AUX

475 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=Part (475; 100%), Mood=EMPTY (475; 100%), Tense=EMPTY (475; 100%), Number=Sing (475; 100%), Person=EMPTY (475; 100%).

AUX tokens may have the following values of Gender:

PROPN

306 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (88) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (11509; 96%), NOUN –[nmod]–> NOUN (8860; 54%), NOUN –[det]–> DET (7395; 79%), NOUN –[conj]–> NOUN (2461; 73%), VERB –[nsubjpass]–> NOUN (891; 58%), ADJ –[conj]–> ADJ (650; 94%), NOUN –[amod]–> DET (606; 82%), VERB –[conj]–> VERB (515; 61%), ADJ –[nsubj]–> NOUN (386; 93%), ADJ –[nmod]–> NOUN (377; 52%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]