home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

176155 tokens (44%) have a non-empty value of Gender. 21048 types (49%) occur at least once with a non-empty value of Gender. 13918 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (74689; 19% instances), DET (58536; 15% instances), ADJ (22818; 6% instances), VERB (11235; 3% instances), PRON (7972; 2% instances), AUX (892; 0% instances), NUM (10; 0% instances), PROPN (3; 0% instances).

NOUN

74689 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56106; 75%).

NOUN tokens may have the following values of Gender:

Paradigm partieMascFem
Number=Singpartipartie
Number=Plurparties

Gender seems to be lexical feature of NOUN. 98% lemmas (9230) occur only with one value of Gender.

DET

58536 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (50429; 86%), Number=Sing (45600; 78%), Definite=Def (40379; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Number=Single, l', lla, l', l, Les, là
Number=Plurlesles, L'

ADJ

22818 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (16550; 73%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremier, 1er, Ier, 1e, 1première, 1re, 1ère
Number=Plurpremierspremières

VERB

11235 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11235; 100%), Person=EMPTY (11235; 100%), Tense=Past (11235; 100%), VerbForm=Part (11235; 100%), Number=Sing (9032; 80%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
Number=Singfait, faisfaite
Number=Plurfaitsfaites

PRON

7972 PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6891; 86%), Person=3 (6646; 83%), PronType=Prs (5805; 73%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing|Person=2-Tu
Number=Sing|Person=3il, -il, Lui, t-il-elle, elle
Number=SingLui
Number=Plur|Person=3ils, -ilselles, -elles

AUX

892 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (892; 100%), Number=Sing (892; 100%), Person=EMPTY (892; 100%), Tense=Past (892; 100%), VerbForm=Part (892; 100%).

AUX tokens may have the following values of Gender:

Paradigm faireMascFem
faitfaite

NUM

10 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (10) occur only with one value of Gender.

PROPN

3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (53878; 99%), NOUN –[amod]–> ADJ (18536; 99%), NOUN –[conj]–> NOUN (3390; 63%), NOUN –[acl]–> VERB (2965; 70%), VERB –[nsubj:pass]–> NOUN (1540; 96%), ADJ –[conj]–> ADJ (906; 97%), ADJ –[nsubj]–> NOUN (906; 97%), NOUN –[appos]–> NOUN (887; 58%), NOUN –[nsubj]–> NOUN (586; 61%), VERB –[nsubj:pass]–> PRON (549; 76%).