home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

27471 tokens (39%) have a non-empty value of Gender. 6022 types (64%) occur at least once with a non-empty value of Gender. 4330 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (14147; 20% instances), DET (5927; 8% instances), ADJ (2781; 4% instances), VERB (2195; 3% instances), PROPN (1589; 2% instances), PRON (813; 1% instances), AUX (10; 0% instances), ADP (7; 0% instances), NUM (2; 0% instances).

NOUN

14147 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (9948; 70%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2721) occur only with one value of Gender.

DET

5927 DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5883; 99%), PronType=Art (5421; 91%), Definite=Def (4127; 70%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|PronType=Artle, les, l'la, l'
Le

ADJ

2781 ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1810; 65%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

2195 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1537; 70%), Voice=EMPTY (1450; 66%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Singdû, du
Number=Plurdues

PROPN

1589 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1554; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (471) occur only with one value of Gender.

PRON

813 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (813; 100%), PronType=EMPTY (694; 85%), Person=3 (659; 81%), Number=Sing (647; 80%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, -il, On, -onelle, -elle
Number=Plurils, -ilselles

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

ADP

7 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Paradigm àMascFem
àà

NUM

2 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (2; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (5107; 56%), NOUN –[amod]–> ADJ (2172; 61%), NOUN –[acl]–> VERB (618; 62%), NOUN –[conj]–> NOUN (558; 56%), PROPN –[det]–> DET (372; 65%), VERB –[nsubj:pass]–> NOUN (338; 90%), NOUN –[appos]–> NOUN (132; 54%), VERB –[conj]–> VERB (100; 51%), ADJ –[nsubj]–> NOUN (81; 61%), PROPN –[conj]–> PROPN (80; 52%).