home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

13528 tokens (19%) have a non-empty value of Gender. 2618 types (28%) occur at least once with a non-empty value of Gender. 1748 lemmas (26%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: DET (5946; 8% instances), ADJ (3027; 4% instances), VERB (2206; 3% instances), PROPN (1429; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances).

DET

5946 DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5898; 99%), PronType=Art (5365; 90%), Definite=Def (4063; 68%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVle
ExtPos=PRONle
lela
Typo=Yesle

ADJ

3027 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2011; 66%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

2206 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2206; 100%), Person=EMPTY (2206; 100%), Tense=EMPTY (2206; 100%), VerbForm=Part (2206; 100%), Number=Sing (1536; 70%), Voice=Pass (1513; 69%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Sing|Voice=Actdû, du
Number=Plur|Voice=Passdues

PROPN

1429 PROPN tokens (44% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1396; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (434) occur only with one value of Gender.

PRON

910 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%), Emph=No (626; 69%), Case=Nom (555; 61%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Singla
Case=Nom|Emph=No|ExtPos=ADP|Number=Singil
Case=Nom|Emph=No|ExtPos=ADV|Number=Singil
Case=Nom|Emph=No|Number=Singil, -ilelle, -elle
Case=Nom|Emph=No|Number=Plurilselles
Case=Nom|Emph=No|Number=Plur|Typo=Yeselles
Emph=No|Number=Sing-il, -t-il, le, lui-t-elle, la
Emph=Yes|Number=Singluielle

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=EMPTY (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[det]–> DET (225; 57%), VERB –[conj]–> VERB (104; 52%), PROPN –[conj]–> PROPN (79; 57%), VERB –[nsubj:pass]–> PRON (58; 52%), ADJ –[det]–> DET (31; 56%), PRON –[amod]–> ADJ (9; 69%), ADJ –[amod]–> ADJ (5; 71%), ADJ –[obl:mod]–> ADJ (4; 57%), PRON –[nmod]–> PRON (4; 100%), ADJ –[parataxis]–> VERB (2; 67%).