home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

161000 tokens (40%) have a non-empty value of Gender. 23083 types (54%) occur at least once with a non-empty value of Gender. 15909 lemmas (48%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (74822; 19% instances), DET (37490; 9% instances), ADJ (23687; 6% instances), VERB (11169; 3% instances), PRON (9552; 2% instances), PROPN (3216; 1% instances), AUX (904; 0% instances), X (85; 0% instances), NUM (61; 0% instances), SYM (14; 0% instances).

NOUN

74822 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56333; 75%).

NOUN tokens may have the following values of Gender:

Paradigm partieMascFem
Number=Singpartie
Number=Sing|Typo=Yespartipartie
Number=Plurparties

Gender seems to be lexical feature of NOUN. 99% lemmas (9339) occur only with one value of Gender.

DET

37490 DET tokens (61% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (37245; 99%), PronType=Art (34196; 91%), Definite=Def (26385; 70%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVlela
ExtPos=NOUNle
ExtPos=PRONle
le, L'la
Typo=Yeslela, là

ADJ

23687 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (17035; 72%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremierpremière
Number=Sing|Typo=Yespremier
Number=Plurpremierspremières

VERB

11169 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11169; 100%), Person=EMPTY (11169; 100%), VerbForm=Part (11169; 100%), Tense=Past (11168; 100%), Number=Sing (8853; 79%), Voice=Pass (7701; 69%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
ExtPos=ADJ|Number=Sing|Voice=Passfaite
Number=Sing|Typo=Yes|Voice=Actfais
Number=Sing|Typo=Yes|Voice=Passfait
Number=Sing|Voice=Actfaitfaite
Number=Sing|Voice=Passfaitfaite
Number=Plur|Voice=Actfaits
Number=Plur|Voice=Passfaitsfaites

PRON

9552 PRON tokens (53% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (9451; 99%), Person=3 (9213; 96%), Number=Sing (8165; 85%), Emph=No (6596; 69%), PronType=Prs (6345; 66%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Emph=No|ExtPos=ADPil
Emph=Noil, le, lui, -t-il, -ilelle, la, -elle, -t-elle
Emph=No|Typo=Yesil, t-il, -il, -le, t'ilelle
Emph=Yesluielle

PROPN

3216 PROPN tokens (12% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3166; 98%).

PROPN tokens may have the following values of Gender:

Paradigm AfriqueMascFem
AfriqueAfrique

Gender seems to be lexical feature of PROPN. 99% lemmas (1867) occur only with one value of Gender.

AUX

904 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (904; 100%), Number=Sing (904; 100%), Person=EMPTY (904; 100%), Tense=Past (904; 100%), VerbForm=Part (904; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

X

85 X tokens (3% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (85; 100%), ExtPos=PROPN (43; 51%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (83) occur only with one value of Gender.

NUM

61 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (61; 100%).

NUM tokens may have the following values of Gender:

SYM

14 SYM tokens (2% of all SYM tokens) have a non-empty value of Gender.

The most frequent other feature values with which SYM and Gender co-occurred: Number=Sing (13; 93%), ExtPos=NOUN (12; 86%).

SYM tokens may have the following values of Gender:

Paradigm MascFem

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (32358; 61%), NOUN –[amod]–> ADJ (19287; 99%), NOUN –[conj]–> NOUN (3285; 63%), NOUN –[acl]–> VERB (3000; 71%), PROPN –[det]–> DET (2953; 96%), VERB –[nsubj:pass]–> NOUN (1834; 81%), ADJ –[nsubj]–> NOUN (948; 98%), ADJ –[conj]–> ADJ (897; 96%), NOUN –[appos]–> NOUN (879; 62%), VERB –[nsubj:pass]–> PRON (686; 70%).