home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-PUD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

12689 tokens (51%) have a non-empty value of Gender. 4404 types (74%) occur at least once with a non-empty value of Gender. 3541 lemmas (77%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (4672; 19% instances), DET (3872; 16% instances), ADJ (1618; 7% instances), PROPN (970; 4% instances), VERB (838; 3% instances), PRON (588; 2% instances), AUX (99; 0% instances), ADP (26; 0% instances), NUM (4; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances).

NOUN

4672 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3369; 72%).

NOUN tokens may have the following values of Gender:

Paradigm sudMascFem
sudsud

Gender seems to be lexical feature of NOUN. 99% lemmas (1828) occur only with one value of Gender.

DET

3872 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3446; 89%), Number=Sing (2862; 74%), Definite=Def (2781; 72%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Number=Single, l', l’, les, l‘la, l', l’, l‘
Number=Plurles, leles

ADJ

1618 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1085; 67%).

ADJ tokens may have the following values of Gender:

Paradigm nouveauMascFem
Number=Singnouveau, nouvelnouvelle
Number=Plurnouveauxnouvelles

PROPN

970 PROPN tokens (76% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (933; 96%).

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
TrumpTrump

Gender seems to be lexical feature of PROPN. 99% lemmas (640) occur only with one value of Gender.

VERB

838 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (838; 100%), Person=EMPTY (838; 100%), Tense=Past (838; 100%), VerbForm=Part (838; 100%), Number=Sing (692; 83%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
Number=Singfaitfaite
Number=Plurfaites

PRON

588 PRON tokens (55% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (542; 92%), Person=3 (523; 89%), Number=Sing (449; 76%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing|Person=1je, J’, j'
Number=Sing|Person=3il, -il, -t-ilelle, -elle
Number=Plur|Person=1nous
Number=Plur|Person=3ils, -ilselles

AUX

99 AUX tokens (10% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (99; 100%), Number=Sing (99; 100%), Person=EMPTY (99; 100%), Tense=Past (99; 100%), VerbForm=Part (99; 100%).

AUX tokens may have the following values of Gender:

ADP

26 ADP tokens (1% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Paradigm dontMascFem
Number=Singdontdont
Number=Plurdontdont

NUM

4 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

X

1 X tokens (1% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (1; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (3457; 100%), NOUN –[amod]–> ADJ (1353; 100%), NOUN –[nmod]–> NOUN (673; 53%), PROPN –[det]–> DET (244; 96%), PROPN –[flat:name]–> PROPN (189; 96%), NOUN –[conj]–> NOUN (153; 60%), VERB –[nsubj:pass]–> NOUN (121; 96%), NOUN –[acl]–> VERB (104; 71%), NOUN –[appos]–> PROPN (74; 62%), VERB –[conj]–> VERB (53; 54%).