home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

160900 tokens (40%) have a non-empty value of Gender. 23063 types (54%) occur at least once with a non-empty value of Gender. 15980 lemmas (49%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (74600; 19% instances), DET (37526; 9% instances), ADJ (23667; 6% instances), VERB (11170; 3% instances), PRON (9554; 2% instances), PROPN (3318; 1% instances), AUX (904; 0% instances), X (86; 0% instances), NUM (61; 0% instances), SYM (14; 0% instances).

NOUN

74600 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56170; 75%).

NOUN tokens may have the following values of Gender:

Paradigm partieMascFem
Number=Singpartie
Number=Sing|Typo=Yespartipartie
Number=Plurparties

Gender seems to be lexical feature of NOUN. 99% lemmas (9335) occur only with one value of Gender.

DET

37526 DET tokens (61% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (37281; 99%), PronType=Art (34233; 91%), Definite=Def (26422; 70%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVlela
ExtPos=NOUNle
ExtPos=PRONle
le, L'la
Typo=Yeslela, là

ADJ

23667 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (17017; 72%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremierpremière
Number=Sing|Typo=Yespremier
Number=Plurpremierspremières

VERB

11170 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11170; 100%), Person=EMPTY (11170; 100%), VerbForm=Part (11170; 100%), Tense=Past (11168; 100%), Number=Sing (8853; 79%), Voice=Pass (7701; 69%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
ExtPos=ADJ|Number=Sing|Voice=Passfaite
Number=Sing|Typo=Yes|Voice=Actfais
Number=Sing|Typo=Yes|Voice=Passfait
Number=Sing|Voice=Actfaitfaite
Number=Sing|Voice=Passfaitfaite
Number=Plur|Voice=Actfaits
Number=Plur|Voice=Passfaitsfaites

PRON

9554 PRON tokens (53% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (9453; 99%), Person=3 (9215; 96%), Number=Sing (8167; 85%), Emph=No (6598; 69%), PronType=Prs (6347; 66%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Emph=No|ExtPos=ADPil
Emph=Noil, le, lui, -t-il, -ilelle, la, -elle, -t-elle
Emph=No|Typo=Yesil, t-il, -il, -le, t'ilelle
Emph=Yesluielle

PROPN

3318 PROPN tokens (12% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3263; 98%).

PROPN tokens may have the following values of Gender:

Paradigm AfriqueMascFem
AfriqueAfrique

Gender seems to be lexical feature of PROPN. 99% lemmas (1933) occur only with one value of Gender.

AUX

904 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (904; 100%), Number=Sing (904; 100%), Person=EMPTY (904; 100%), Tense=Past (904; 100%), VerbForm=Part (904; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

X

86 X tokens (3% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (86; 100%), ExtPos=PROPN (44; 51%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (84) occur only with one value of Gender.

NUM

61 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (61; 100%).

NUM tokens may have the following values of Gender:

SYM

14 SYM tokens (2% of all SYM tokens) have a non-empty value of Gender.

The most frequent other feature values with which SYM and Gender co-occurred: Number=Sing (13; 93%), ExtPos=NOUN (12; 86%).

SYM tokens may have the following values of Gender:

Paradigm MascFem

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (32266; 61%), NOUN –[amod]–> ADJ (19268; 99%), NOUN –[conj]–> NOUN (3266; 63%), PROPN –[det]–> DET (3048; 96%), NOUN –[acl]–> VERB (2997; 70%), VERB –[nsubj:pass]–> NOUN (1829; 81%), ADJ –[nsubj]–> NOUN (948; 98%), ADJ –[conj]–> ADJ (897; 96%), NOUN –[appos]–> NOUN (866; 63%), VERB –[nsubj:pass]–> PRON (686; 70%).