home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Rhapsodie: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

13316 tokens (30%) have a non-empty value of Gender. 2784 types (61%) occur at least once with a non-empty value of Gender. 2273 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (5228; 12% instances), DET (3308; 7% instances), PRON (2423; 5% instances), ADJ (1561; 4% instances), VERB (704; 2% instances), PROPN (48; 0% instances), AUX (44; 0% instances).

NOUN

5228 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4000; 77%).

NOUN tokens may have the following values of Gender:

Paradigm foisMascFem
Number=Singfois
Number=Plurfoisfois

Gender seems to be lexical feature of NOUN. 99% lemmas (1531) occur only with one value of Gender.

DET

3308 DET tokens (74% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3294; 100%), PronType=Art (2901; 88%), Definite=Def (2135; 65%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
le, l'la, l'

PRON

2423 PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (2374; 98%), Number=Sing (2224; 92%), Case=EMPTY (1335; 55%), Emph=EMPTY (1299; 54%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
Case=Acc|Emph=No|Number=Singlela
Case=Nom|Emph=No|ExtPos=ADP|Number=Singil
Case=Nom|Emph=No|Number=Singil, -ilelle
Case=Nom|Emph=No|Number=Plurilselles
Emph=No|Number=Single
Emph=Yes|Number=Singluielle
Emph=Yes|Number=Plureux
Number=Sing-il, le, -t-il
Number=Plur-ils

ADJ

1561 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1243; 80%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

704 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (704; 100%), Person=EMPTY (704; 100%), Tense=EMPTY (704; 100%), VerbForm=Part (704; 100%), Number=Sing (603; 86%), Voice=Act (446; 63%).

VERB tokens may have the following values of Gender:

Paradigm allerMascFem
Voice=Actallée
Voice=Passallé

PROPN

48 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (15) occur only with one value of Gender.

AUX

44 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (44; 100%), Number=Sing (44; 100%), Person=EMPTY (44; 100%), Tense=Past (44; 100%), VerbForm=Part (44; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2819; 75%), NOUN –[amod]–> ADJ (934; 100%), NOUN –[conj]–> NOUN (153; 60%), ADJ –[nsubj]–> PRON (147; 72%), NOUN –[reparandum]–> NOUN (93; 79%), DET –[reparandum]–> DET (88; 81%), DET –[fixed]–> NOUN (81; 100%), NOUN –[appos]–> NOUN (56; 79%), ADJ –[nsubj]–> NOUN (45; 100%), ADJ –[conj]–> ADJ (42; 100%).