home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

176095 tokens (44%) have a non-empty value of Gender. 21236 types (50%) occur at least once with a non-empty value of Gender. 14127 lemmas (43%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (74582; 19% instances), DET (58397; 15% instances), ADJ (23109; 6% instances), VERB (11176; 3% instances), PRON (7920; 2% instances), AUX (895; 0% instances), NUM (10; 0% instances), PROPN (4; 0% instances), X (2; 0% instances).

NOUN

74582 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56049; 75%).

NOUN tokens may have the following values of Gender:

Paradigm partieMascFem
Number=Singpartipartie
Number=Plurparties

Gender seems to be lexical feature of NOUN. 98% lemmas (9281) occur only with one value of Gender.

DET

58397 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (50330; 86%), Number=Sing (45524; 78%), Definite=Def (40338; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Sing|PronType=Artle, l', lla, l', l, Les, le, là
Definite=Def|Number=Plur|PronType=Artles, Leles, L'
Number=Single

ADJ

23109 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (16841; 73%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremier, Ierpremière, 1re, 1ère
Number=Plurpremierspremières

VERB

11176 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11176; 100%), Person=EMPTY (11176; 100%), Tense=Past (11176; 100%), VerbForm=Part (11175; 100%), Number=Sing (8986; 80%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
Number=Singfait, faisfaite
Number=Plurfaitsfaites

PRON

7920 PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6844; 86%), Person=3 (6599; 83%), PronType=Prs (5762; 73%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing|Person=2-Tu
Number=Sing|Person=3il, -il, Lui, t-il-elle, elle
Number=SingLui
Number=Plur|Person=3ils, -ilselles, -elles

AUX

895 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (895; 100%), Number=Sing (895; 100%), Person=EMPTY (895; 100%), Tense=Past (895; 100%), VerbForm=Part (895; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

NUM

10 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (10) occur only with one value of Gender.

PROPN

4 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

X

2 X tokens (0% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (52784; 99%), NOUN –[amod]–> ADJ (18725; 99%), NOUN –[conj]–> NOUN (3361; 63%), NOUN –[acl]–> VERB (3025; 68%), VERB –[nsubj:pass]–> NOUN (1521; 96%), ADJ –[conj]–> ADJ (914; 97%), ADJ –[nsubj]–> NOUN (907; 97%), NOUN –[appos]–> NOUN (886; 58%), VERB –[conj]–> VERB (635; 50%), NOUN –[nsubj]–> NOUN (595; 60%).