home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FQB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

9828 tokens (41%) have a non-empty value of Gender. 2531 types (60%) occur at least once with a non-empty value of Gender. 2166 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (3674; 15% instances), DET (2801; 12% instances), ADJ (1238; 5% instances), PROPN (932; 4% instances), VERB (768; 3% instances), PRON (411; 2% instances), ADP (3; 0% instances), AUX (1; 0% instances).

NOUN

3674 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3036; 83%).

NOUN tokens may have the following values of Gender:

Paradigm présidentMascFem
présidentprésidente

Gender seems to be lexical feature of NOUN. 99% lemmas (1235) occur only with one value of Gender.

DET

2801 DET tokens (73% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2784; 99%), PronType=Art (2234; 80%), Definite=Def (1919; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
le, lesla

ADJ

1238 ADJ tokens (82% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1078; 87%).

ADJ tokens may have the following values of Gender:

Paradigm quelMascFem
Number=Singquelquelle
Number=PlurQuelsquelles

PROPN

932 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (920; 99%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (478) occur only with one value of Gender.

VERB

768 VERB tokens (41% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (768; 100%), Person=EMPTY (768; 100%), Tense=Past (768; 100%), VerbForm=Part (768; 100%), Number=Sing (690; 90%), Voice=EMPTY (522; 68%).

VERB tokens may have the following values of Gender:

Paradigm nommerMascFem
nomménommée
Voice=Passnommé

PRON

411 PRON tokens (25% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=EMPTY (388; 94%), Person=3 (383; 93%), Number=Sing (372; 91%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing-t-il, -il, il-t-elle, -elle, elle
Number=Plur-ils-elles, elles

ADP

3 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

1 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2209; 71%), NOUN –[amod]–> ADJ (631; 71%), ADJ –[nsubj]–> NOUN (499; 95%), VERB –[expl:subj]–> PRON (203; 64%), VERB –[nsubj:pass]–> NOUN (140; 95%), NOUN –[acl]–> VERB (80; 60%), ADJ –[det]–> DET (41; 77%), NOUN –[conj]–> NOUN (21; 54%), NOUN –[nsubj]–> PROPN (15; 56%), NOUN –[flat:name]–> PROPN (7; 58%).