home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

27934 tokens (40%) have a non-empty value of Gender. 6152 types (65%) occur at least once with a non-empty value of Gender. 4403 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (14522; 21% instances), DET (5864; 8% instances), ADJ (2999; 4% instances), VERB (2195; 3% instances), PROPN (1433; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances), NUM (1; 0% instances).

NOUN

14522 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (10300; 71%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Sing|Typo=Yespatient
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2731) occur only with one value of Gender.

DET

5864 DET tokens (56% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5820; 99%), PronType=Art (5358; 91%), Definite=Def (4064; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|ExtPos=ADV|PronType=Artle
Definite=Def|ExtPos=PRON|PronType=Artle
Definite=Def|PronType=Artle, l'la, l'
Definite=Def|PronType=Art|Typo=Yesle
Le

ADJ

2999 ADJ tokens (68% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1963; 65%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

2195 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1536; 70%), Voice=Pass (1476; 67%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Singdû, du
Number=Plur|Voice=Passdues

PROPN

1433 PROPN tokens (44% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1399; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (436) occur only with one value of Gender.

PRON

910 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
ExtPos=ADPil
ExtPos=ADVil
il, le, -il, lui, -t-ilelle, la, -elle, -t-elle

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

NUM

1 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (5309; 57%), NOUN –[amod]–> ADJ (2456; 67%), NOUN –[acl]–> VERB (629; 63%), NOUN –[conj]–> NOUN (569; 55%), VERB –[nsubj:pass]–> NOUN (351; 86%), PROPN –[det]–> DET (228; 58%), NOUN –[appos]–> NOUN (132; 55%), VERB –[conj]–> VERB (104; 52%), ADJ –[nsubj]–> NOUN (81; 62%), PROPN –[conj]–> PROPN (79; 56%).