home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

80300 tokens (20%) have a non-empty value of Gender. 9257 types (22%) occur at least once with a non-empty value of Gender. 6079 lemmas (19%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: DET (38619; 10% instances), ADJ (16953; 4% instances), VERB (11161; 3% instances), PRON (9298; 2% instances), PROPN (3214; 1% instances), AUX (904; 0% instances), X (85; 0% instances), NUM (61; 0% instances), SYM (5; 0% instances).

DET

38619 DET tokens (63% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (38372; 99%), PronType=Art (34196; 89%), Definite=Def (26384; 68%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVlela
ExtPos=NOUNle
ExtPos=PRONle
le, L'la
Typo=Yeslela, là

ADJ

16953 ADJ tokens (71% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (11606; 68%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremierpremière
Number=Sing|Typo=Yespremier
Number=Plurpremierspremières

VERB

11161 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11161; 100%), Person=EMPTY (11161; 100%), Tense=EMPTY (11161; 100%), VerbForm=Part (11161; 100%), Number=Sing (8845; 79%), Voice=Pass (7693; 69%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
ExtPos=ADJ|Number=Sing|Voice=Passfaite
Number=Sing|Typo=Yes|Voice=Actfais
Number=Sing|Typo=Yes|Voice=Passfait
Number=Sing|Voice=Actfaitfaite
Number=Sing|Voice=Passfaitfaite
Number=Plur|Voice=Actfaits
Number=Plur|Voice=Passfaitsfaites

PRON

9298 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (9197; 99%), Person=3 (8959; 96%), Number=Sing (7911; 85%), Emph=No (6342; 68%), PronType=Prs (6091; 66%), Case=Nom (5707; 61%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Singlela
Case=Dat|Emph=No|Number=Singlui
Case=Nom|Emph=No|ExtPos=ADP|Number=Singil
Case=Nom|Emph=No|Number=Singil, -ilelle, -elle
Case=Nom|Emph=No|Number=Sing|Typo=Yesilelle
Case=Nom|Emph=No|Number=Plurilselles
Emph=No|Number=Sing-t-il, -il, le, lui-t-elle, -elle, la
Emph=No|Number=Sing|Typo=Yest-il, -il, -le, t'il
Emph=Yes|Number=Singluielle
Emph=Yes|Number=Plurelles

PROPN

3214 PROPN tokens (12% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3164; 98%).

PROPN tokens may have the following values of Gender:

Paradigm AfriqueMascFem
AfriqueAfrique

Gender seems to be lexical feature of PROPN. 99% lemmas (1865) occur only with one value of Gender.

AUX

904 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (904; 100%), Number=Sing (904; 100%), Person=EMPTY (904; 100%), Tense=EMPTY (904; 100%), VerbForm=Part (904; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

X

85 X tokens (3% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (85; 100%), ExtPos=PROPN (43; 51%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (83) occur only with one value of Gender.

NUM

61 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

SYM

5 SYM tokens (1% of all SYM tokens) have a non-empty value of Gender.

The most frequent other feature values with which SYM and Gender co-occurred: Number=Sing (4; 80%), ExtPos=NOUN (3; 60%).

SYM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[det]–> DET (2950; 96%), VERB –[nsubj:pass]–> PRON (684; 70%), VERB –[conj]–> VERB (637; 51%), PROPN –[amod]–> ADJ (335; 82%), ADJ –[det]–> DET (285; 57%), PRON –[amod]–> ADJ (74; 86%), PRON –[acl]–> VERB (55; 50%), ADJ –[nsubj]–> PROPN (40; 65%), PRON –[nsubj]–> PRON (32; 51%), PRON –[conj]–> PRON (22; 51%).