home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParisStories: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

6634 tokens (16%) have a non-empty value of Gender. 278 types (9%) occur at least once with a non-empty value of Gender. 203 lemmas (9%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: PRON (3175; 7% instances), DET (2415; 6% instances), ADJ (643; 2% instances), VERB (292; 1% instances), AUX (42; 0% instances), ADV (33; 0% instances), PROPN (16; 0% instances), X (10; 0% instances), NUM (6; 0% instances), NOUN (2; 0% instances).

PRON

3175 PRON tokens (49% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (3144; 99%), Number=Sing (3002; 95%), Emph=No (1823; 57%), Case=Nom (1744; 55%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Sing|Person=3lela
Case=Acc|Emph=No|Number=Single
Case=Dat|Emph=No|Number=Sing|Person=3lui
Case=Nom|Emph=No|ExtPos=ADP|Number=Sing|Person=3il
Case=Nom|Emph=No|ExtPos=VERB|Number=Sing|Person=3il
Case=Nom|Emph=No|Number=Sing|Person=3il, elleelle
Case=Nom|Emph=No|Number=Plur|Person=3ilselles
Emph=No|Number=Sing|Person=3lui, le
Emph=Yes|Number=Sing|Person=3luielle
Emph=Yes|Number=Plur|Person=3elles

DET

2415 DET tokens (70% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2404; 100%), Number[psor]=EMPTY (2156; 89%), Person[psor]=EMPTY (2156; 89%), Poss=EMPTY (2156; 89%), PronType=Art (2047; 85%), Definite=Def (1355; 56%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Singlela
Definite=Def|Number=Plurles
Definite=Ind|Number=Single

ADJ

643 ADJ tokens (54% of all ADJ tokens) have a non-empty value of Gender.

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
_tout, toustoute, toutes
PronType=Indtout, toustoute

VERB

292 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (242; 83%), Tense=EMPTY (242; 83%), VerbForm=Part (241; 83%), Person=EMPTY (237; 81%), Number=EMPTY (211; 72%), Voice=Act (170; 58%).

VERB tokens may have the following values of Gender:

Paradigm prendreMascFem
Voice=Actprisprise
Voice=Passprisprise

AUX

42 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (42; 100%), Number=Sing (42; 100%), VerbForm=Part (42; 100%), Person=EMPTY (41; 98%), Tense=Past (37; 88%).

AUX tokens may have the following values of Gender:

ADV

33 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: ExtPos=EMPTY (33; 100%), Polarity=EMPTY (33; 100%).

ADV tokens may have the following values of Gender:

PROPN

16 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

X

10 X tokens (8% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (10; 100%), ExtPos=NOUN (6; 60%).

X tokens may have the following values of Gender:

NUM

6 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm unMascFem
unune

NOUN

2 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Gender.

NOUN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PRON –[reparandum]–> PRON (65; 96%), DET –[reparandum]–> DET (50; 77%), PRON –[amod]–> ADJ (32; 94%), DET –[fixed]–> ADJ (6; 100%), PRON –[acl:relcl]–> ADJ (6; 67%), PRON –[conj]–> PRON (4; 67%), DET –[nsubj]–> PRON (3; 75%), PRON –[appos]–> PRON (2; 100%), ADJ –[appos]–> ADJ (1; 100%), ADJ –[conj]–> NOUN (1; 100%).