home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

27470 tokens (39%) have a non-empty value of Gender. 6021 types (64%) occur at least once with a non-empty value of Gender. 4330 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (14146; 20% instances), DET (5927; 8% instances), ADJ (2774; 4% instances), VERB (2195; 3% instances), PROPN (1589; 2% instances), PRON (820; 1% instances), AUX (10; 0% instances), ADP (7; 0% instances), NUM (2; 0% instances).

NOUN

14146 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (9947; 70%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2721) occur only with one value of Gender.

DET

5927 DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5883; 99%), PronType=Art (5421; 91%), Definite=Def (4127; 70%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|PronType=Artle, les, l'la, l'
Le

ADJ

2774 ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1810; 65%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

2195 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1537; 70%), Voice=EMPTY (1451; 66%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Singdû, du
Number=Plurdues

PROPN

1589 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1554; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (471) occur only with one value of Gender.

PRON

820 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (820; 100%), PronType=EMPTY (701; 85%), Person=3 (659; 80%), Number=Sing (647; 79%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, -il, On, -onelle, -elle
Number=Plurils, -ilselles

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

ADP

7 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Paradigm àMascFem
àà

NUM

2 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (2; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (5108; 56%), NOUN –[amod]–> ADJ (2172; 61%), NOUN –[acl]–> VERB (618; 62%), NOUN –[conj]–> NOUN (558; 56%), PROPN –[det]–> DET (372; 65%), VERB –[nsubj:pass]–> NOUN (337; 90%), NOUN –[appos]–> NOUN (132; 54%), VERB –[conj]–> VERB (100; 51%), ADJ –[nsubj]–> NOUN (81; 61%), PROPN –[conj]–> PROPN (80; 52%).