home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-ArT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

132 tokens (23%) have a non-empty value of Gender. 107 types (36%) occur at least once with a non-empty value of Gender. 83 lemmas (37%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (78; 14% instances), PRON (22; 4% instances), DET (14; 2% instances), ADJ (8; 1% instances), NUM (5; 1% instances), VERB (5; 1% instances).

NOUN

78 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (69; 88%), Case=Acc,Nom (57; 73%), Definite=Def (40; 51%).

NOUN tokens may have the following values of Gender:

Paradigm merMascFem
Case=Acc,Nom|Definite=Def|Number=SingMerlu
Definite=Ind|Number=Plurmeari

Gender seems to be lexical feature of NOUN. 98% lemmas (58) occur only with one value of Gender.

PRON

22 PRON tokens (31% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (22; 100%), Reflex=EMPTY (22; 100%), Number=Sing (19; 86%), PronType=Prs (18; 82%), Variant=EMPTY (15; 68%).

PRON tokens may have the following values of Gender:

Paradigm elMascFem
Case=Acc|Number=Sing|Strength=Strongu
Case=Acc|Number=Sing|Strength=Weaku, O
Case=Acc|Number=Sing|Strength=Weak|Variant=Shortlu, -l-o
Case=Acc|Number=Plur|Strength=Weak|Variant=Shortlo-li
Case=Dat|Number=Sing|Strength=Weak|Variant=Shortĺ-

DET

14 DET tokens (93% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (13; 93%), Case=Acc,Nom (12; 86%), Position=EMPTY (12; 86%), Number=Sing (11; 79%), PronType=Ind (11; 79%), Person=EMPTY (8; 57%).

DET tokens may have the following values of Gender:

Paradigm unMascFem
unnă, Ună

ADJ

8 ADJ tokens (89% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (8; 100%), Definite=Ind (7; 88%), Number=Sing (7; 88%), Case=Acc,Nom (6; 75%).

ADJ tokens may have the following values of Gender:

Paradigm mareMascFem
Definite=Def|Number=Plurmărli
Definite=Ind|Number=Singmari

NUM

5 NUM tokens (83% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Case=Acc,Nom (5; 100%), NumForm=Word (5; 100%), NumType=Card (5; 100%), Definite=Def (4; 80%), Number=Sing (3; 60%).

NUM tokens may have the following values of Gender:

VERB

5 VERB tokens (4% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (5; 100%), Number=Sing (5; 100%), Person=EMPTY (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Part (5; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (13; 93%), NOUN –[amod]–> ADJ (4; 67%), ADJ –[nsubj]–> NOUN (1; 100%), DET –[fixed]–> NOUN (1; 100%), NOUN –[amod]–> NUM (1; 100%).