home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

168176 tokens (42%) have a non-empty value of Gender. 23120 types (54%) occur at least once with a non-empty value of Gender. 15916 lemmas (48%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (75175; 19% instances), DET (44453; 11% instances), ADJ (23820; 6% instances), VERB (11161; 3% instances), PRON (9298; 2% instances), PROPN (3214; 1% instances), AUX (904; 0% instances), X (85; 0% instances), NUM (61; 0% instances), SYM (5; 0% instances).

NOUN

75175 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56343; 75%).

NOUN tokens may have the following values of Gender:

Paradigm partieMascFem
Number=Singpartie
Number=Sing|Typo=Yespartipartie
Number=Plurparties

Gender seems to be lexical feature of NOUN. 99% lemmas (9330) occur only with one value of Gender.

DET

44453 DET tokens (73% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (44204; 99%), PronType=Art (39692; 89%), Definite=Def (31882; 72%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVlela
ExtPos=NOUNle
ExtPos=PRONle
le, l'la, l'
Typo=Yeslela, là

ADJ

23820 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (17171; 72%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremierpremière
Number=Sing|Typo=Yespremier
Number=Plurpremierspremières

VERB

11161 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11161; 100%), Person=EMPTY (11161; 100%), Tense=EMPTY (11161; 100%), VerbForm=Part (11161; 100%), Number=Sing (8845; 79%), Voice=Pass (7693; 69%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
ExtPos=ADJ|Number=Sing|Voice=Passfaite
Number=Sing|Typo=Yes|Voice=Actfais
Number=Sing|Typo=Yes|Voice=Passfait
Number=Sing|Voice=Actfaitfaite
Number=Sing|Voice=Passfaitfaite
Number=Plur|Voice=Actfaits
Number=Plur|Voice=Passfaitsfaites

PRON

9298 PRON tokens (53% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (9197; 99%), Person=3 (8959; 96%), Number=Sing (7911; 85%), Emph=No (6342; 68%), PronType=Prs (6091; 66%), Case=Nom (5707; 61%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Singlela
Case=Dat|Emph=No|Number=Singlui
Case=Nom|Emph=No|ExtPos=ADP|Number=Singil
Case=Nom|Emph=No|Number=Singil, -ilelle, -elle
Case=Nom|Emph=No|Number=Sing|Typo=Yesilelle
Case=Nom|Emph=No|Number=Plurils, -ilselles
Case=Nom|Emph=No|Number=Plur|Typo=YesElles
Emph=No|Number=Sing-t-il, -il, le, lui-t-elle, -elle, la
Emph=No|Number=Sing|Typo=Yest-il, -il, -le, t'il
Emph=No|Number=Plur-ils-elles, ELLES
Emph=Yes|Number=Singluielle
Emph=Yes|Number=Plureuxelles
Emph=Yes|Number=Plur|Typo=Yes-eux

PROPN

3214 PROPN tokens (12% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3164; 98%).

PROPN tokens may have the following values of Gender:

Paradigm AfriqueMascFem
AfriqueAfrique

Gender seems to be lexical feature of PROPN. 99% lemmas (1865) occur only with one value of Gender.

AUX

904 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (904; 100%), Number=Sing (904; 100%), Person=EMPTY (904; 100%), Tense=EMPTY (904; 100%), VerbForm=Part (904; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

X

85 X tokens (3% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (85; 100%), ExtPos=PROPN (43; 51%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (83) occur only with one value of Gender.

NUM

61 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (61; 100%).

NUM tokens may have the following values of Gender:

SYM

5 SYM tokens (1% of all SYM tokens) have a non-empty value of Gender.

The most frequent other feature values with which SYM and Gender co-occurred: Number=Sing (4; 80%), ExtPos=NOUN (3; 60%).

SYM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (39084; 73%), NOUN –[amod]–> ADJ (19323; 100%), NOUN –[conj]–> NOUN (3298; 63%), PROPN –[det]–> DET (3000; 98%), NOUN –[acl]–> VERB (2999; 70%), VERB –[nsubj:pass]–> NOUN (1832; 81%), ADJ –[nsubj]–> NOUN (951; 98%), ADJ –[conj]–> ADJ (908; 97%), NOUN –[appos]–> NOUN (896; 62%), VERB –[nsubj:pass]–> PRON (683; 70%).