home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

30618 tokens (43%) have a non-empty value of Gender. 6700 types (71%) occur at least once with a non-empty value of Gender. 4801 lemmas (71%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (14575; 21% instances), DET (7102; 10% instances), ADJ (4387; 6% instances), VERB (2206; 3% instances), PROPN (1425; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances), NUM (3; 0% instances).

NOUN

14575 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (10390; 71%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Sing|Typo=Yespatient
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2775) occur only with one value of Gender.

DET

7102 DET tokens (68% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (7054; 99%), PronType=Art (6418; 90%), Definite=Def (5152; 73%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
ExtPos=ADVle
ExtPos=PRONle
le, l'la, l'
Typo=Yesle

ADJ

4387 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2932; 67%).

ADJ tokens may have the following values of Gender:

Paradigm autreMascFem
_AUTRE(S)
Number=Singautreautre
Number=Plurautresautres

VERB

2206 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2206; 100%), Person=EMPTY (2206; 100%), Tense=EMPTY (2206; 100%), VerbForm=Part (2206; 100%), Number=Sing (1536; 70%), Voice=Pass (1513; 69%).

VERB tokens may have the following values of Gender:

Paradigm devoirMascFem
Number=Sing|Voice=Actdû, du
Number=Plur|Voice=Passdues

PROPN

1425 PROPN tokens (44% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1392; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (434) occur only with one value of Gender.

PRON

910 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%), Emph=No (626; 69%), Case=Nom (555; 61%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Singlela
Case=Nom|Emph=No|ExtPos=ADP|Number=Singil
Case=Nom|Emph=No|ExtPos=ADV|Number=Singil
Case=Nom|Emph=No|Number=Singil, -ilelle, -elle
Case=Nom|Emph=No|Number=Plurils, -ilselles
Case=Nom|Emph=No|Number=Plur|Typo=Yeselles
Emph=No|Number=Sing-il, -t-il, le, lui-t-elle, la
Emph=No|Number=Plur-ils
Emph=Yes|Number=Singluielle
Emph=Yes|Number=Plureux

AUX

10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=EMPTY (10; 100%), VerbForm=Part (10; 100%).

AUX tokens may have the following values of Gender:

NUM

3 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (3; 100%), Number=Sing (3; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (6490; 69%), NOUN –[amod]–> ADJ (3641; 99%), NOUN –[acl]–> VERB (631; 63%), NOUN –[conj]–> NOUN (572; 56%), VERB –[nsubj:pass]–> NOUN (349; 86%), PROPN –[det]–> DET (270; 69%), NOUN –[appos]–> NOUN (139; 58%), ADJ –[nsubj]–> NOUN (130; 100%), ADJ –[conj]–> ADJ (118; 98%), VERB –[conj]–> VERB (104; 52%).