home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FQB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

9863 tokens (41%) have a non-empty value of Gender. 2537 types (60%) occur at least once with a non-empty value of Gender. 2172 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (3686; 15% instances), DET (2815; 12% instances), ADJ (1242; 5% instances), PROPN (932; 4% instances), VERB (769; 3% instances), PRON (415; 2% instances), ADP (3; 0% instances), AUX (1; 0% instances).

NOUN

3686 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3048; 83%).

NOUN tokens may have the following values of Gender:

Paradigm présidentMascFem
présidentprésidente

Gender seems to be lexical feature of NOUN. 99% lemmas (1240) occur only with one value of Gender.

DET

2815 DET tokens (73% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2798; 99%), PronType=Art (2249; 80%), Definite=Def (1932; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=PRONle
le, lesla

ADJ

1242 ADJ tokens (82% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1082; 87%).

ADJ tokens may have the following values of Gender:

Paradigm quelMascFem
Number=Singquelquelle
Number=PlurQuelsquelles

PROPN

932 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (920; 99%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (478) occur only with one value of Gender.

VERB

769 VERB tokens (41% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (769; 100%), Person=EMPTY (769; 100%), Tense=Past (769; 100%), VerbForm=Part (769; 100%), Number=Sing (691; 90%), Voice=EMPTY (523; 68%).

VERB tokens may have the following values of Gender:

Paradigm nommerMascFem
nomménommée
Voice=Passnommé

PRON

415 PRON tokens (25% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (387; 93%), PronType=Prs (383; 92%), Number=Sing (376; 91%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing-t-il, -il, il-t-elle, -elle, elle
Number=Plur-ils-elles, elles

ADP

3 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

1 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2214; 71%), NOUN –[amod]–> ADJ (634; 71%), ADJ –[nsubj]–> NOUN (499; 95%), VERB –[expl:subj]–> PRON (203; 64%), VERB –[nsubj:pass]–> NOUN (140; 95%), NOUN –[acl]–> VERB (80; 60%), ADJ –[det]–> DET (41; 77%), NOUN –[conj]–> NOUN (21; 54%), NOUN –[nsubj]–> PROPN (15; 56%), NOUN –[flat:name]–> PROPN (7; 58%).