home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

176106 tokens (44%) have a non-empty value of Gender. 21036 types (49%) occur at least once with a non-empty value of Gender. 13915 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (74671; 19% instances), DET (58542; 15% instances), ADJ (22793; 6% instances), VERB (11223; 3% instances), PRON (7974; 2% instances), AUX (890; 0% instances), NUM (10; 0% instances), PROPN (3; 0% instances).

NOUN

74671 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56091; 75%).

NOUN tokens may have the following values of Gender:

Paradigm foisMascFem
Number=Singfois
Number=Plurfoisfois

Gender seems to be lexical feature of NOUN. 98% lemmas (9229) occur only with one value of Gender.

DET

58542 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (50441; 86%), Number=Sing (45622; 78%), Definite=Def (40396; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Number=Singla
Number=Sing|PronType=Artle, l', lla, l', l, Les, là
Number=Plur|PronType=Artlesles, L

ADJ

22793 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (16529; 73%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremier, 1er, Ier, 1e, 1première, 1re, 1ère
Number=Plurpremierspremières

VERB

11223 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11223; 100%), Person=EMPTY (11223; 100%), Tense=Past (11223; 100%), VerbForm=Part (11223; 100%), Number=Sing (9031; 80%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
Number=Singfait, faisfaite
Number=Plurfaitsfaites

PRON

7974 PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6893; 86%), Person=3 (6646; 83%), PronType=Prs (5805; 73%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing|Person=2-Tu
Number=Sing|Person=3il, -il, t-il, Lui-elle, elle
Number=SingLui
Number=Plur|Person=3ils, -ilselles, -elles

AUX

890 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (890; 100%), Number=Sing (890; 100%), Person=EMPTY (890; 100%), Tense=Past (890; 100%), VerbForm=Part (890; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

NUM

10 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (10) occur only with one value of Gender.

PROPN

3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (53959; 99%), NOUN –[amod]–> ADJ (18532; 99%), NOUN –[conj]–> NOUN (3391; 63%), NOUN –[acl]–> VERB (2959; 70%), VERB –[nsubj:pass]–> NOUN (1539; 96%), ADJ –[conj]–> ADJ (905; 97%), ADJ –[nsubj]–> NOUN (903; 97%), NOUN –[appos]–> NOUN (888; 58%), NOUN –[nsubj]–> NOUN (587; 61%), ADJ –[obl]–> NOUN (572; 52%).