home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

27497 tokens (39%) have a non-empty value of Gender. 6025 types (64%) occur at least once with a non-empty value of Gender. 4319 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (14331; 20% instances), DET (5927; 8% instances), ADJ (2781; 4% instances), VERB (2195; 3% instances), PROPN (1438; 2% instances), PRON (813; 1% instances), AUX (10; 0% instances), NUM (2; 0% instances).

NOUN

14331 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (10124; 71%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2734) occur only with one value of Gender.

DET

5927 DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5883; 99%), PronType=Art (5421; 91%), Definite=Def (4127; 70%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|PronType=Artle, les, l'la, l'
Le

ADJ

2781 ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1810; 65%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

2195 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1537; 70%), Voice=EMPTY (1450; 66%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Singdû, du
Number=Plurdues

PROPN

1438 PROPN tokens (43% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1405; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (438) occur only with one value of Gender.

PRON

813 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (813; 100%), PronType=EMPTY (694; 85%), Person=3 (659; 81%), Number=Sing (647; 80%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, -il, On, -onelle, -elle
Number=Plurils, -ilselles

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

NUM

2 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (2; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (5178; 57%), NOUN –[amod]–> ADJ (2214; 61%), NOUN –[acl]–> VERB (621; 62%), NOUN –[conj]–> NOUN (560; 55%), VERB –[nsubj:pass]–> NOUN (338; 90%), PROPN –[det]–> DET (231; 58%), NOUN –[appos]–> NOUN (133; 55%), VERB –[conj]–> VERB (100; 51%), ADJ –[nsubj]–> NOUN (81; 61%), VERB –[nsubj:pass]–> PRON (80; 59%).