home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParisStories: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

11929 tokens (28%) have a non-empty value of Gender. 1946 types (59%) occur at least once with a non-empty value of Gender. 1650 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (4289; 10% instances), PRON (3247; 8% instances), DET (2276; 5% instances), VERB (1155; 3% instances), ADJ (869; 2% instances), AUX (38; 0% instances), ADV (33; 0% instances), PROPN (16; 0% instances), NUM (6; 0% instances).

NOUN

4289 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3434; 80%).

NOUN tokens may have the following values of Gender:

Paradigm trucMascFem
Number=Singtructruc
Number=Plurtrucs

Gender seems to be lexical feature of NOUN. 98% lemmas (1093) occur only with one value of Gender.

PRON

3247 PRON tokens (50% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (3204; 99%), Number=Sing (3081; 95%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
ExtPos=ADP|Person=3il
ExtPos=VERB|Person=3il
Person=3il, lui, le, elle, l'elle, la
le

DET

2276 DET tokens (65% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2264; 99%), Number[psor]=EMPTY (2160; 95%), Person[psor]=EMPTY (2160; 95%), Poss=EMPTY (2160; 95%), PronType=Art (2043; 90%), Definite=Def (1358; 60%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Single, l'la, l'
Definite=Def|Number=Plurles
Definite=Ind|Number=Single

VERB

1155 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (1104; 96%), Mood=EMPTY (1101; 95%), Tense=Past (1099; 95%), Person=EMPTY (1097; 95%), Number=Sing (1084; 94%).

VERB tokens may have the following values of Gender:

Paradigm avoirMascFem
eueue

ADJ

869 ADJ tokens (72% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (662; 76%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Sing|PronType=Indtouttoute
Number=Plurtoustoutes
Number=Plur|PronType=Indtous

AUX

38 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (38; 100%), VerbForm=Part (38; 100%), Mood=EMPTY (37; 97%), Person=EMPTY (37; 97%), Tense=Past (37; 97%).

AUX tokens may have the following values of Gender:

ADV

33 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: ExtPos=EMPTY (33; 100%).

ADV tokens may have the following values of Gender:

PROPN

16 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

NUM

6 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (4; 67%).

NUM tokens may have the following values of Gender:

Paradigm unMascFem
unune

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1966; 66%), NOUN –[amod]–> ADJ (440; 75%), ADJ –[nsubj]–> PRON (149; 55%), NOUN –[nsubj]–> PRON (128; 51%), DET –[fixed]–> NOUN (77; 96%), NOUN –[conj]–> NOUN (68; 61%), PRON –[reparandum]–> PRON (64; 91%), NOUN –[reparandum]–> NOUN (57; 77%), DET –[reparandum]–> DET (46; 79%), ADJ –[obl:mod]–> NOUN (32; 53%).