home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParTUT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

11401 tokens (40%) have a non-empty value of Gender. 2604 types (63%) occur at least once with a non-empty value of Gender. 1905 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (6057; 21% instances), DET (2768; 10% instances), ADJ (1309; 5% instances), VERB (741; 3% instances), PRON (455; 2% instances), AUX (70; 0% instances), ADP (1; 0% instances).

NOUN

6057 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4334; 72%).

NOUN tokens may have the following values of Gender:

Paradigm œuvreMascFem
Number=Singoeuvreoeuvre, œuvre
Number=Pluroeuvres

Gender seems to be lexical feature of NOUN. 97% lemmas (1215) occur only with one value of Gender.

DET

2768 DET tokens (58% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2426; 88%), PronType=Art (2126; 77%), Definite=Def (1487; 54%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|ExtPos=ADV|Number=Single
Definite=Def|ExtPos=PRON|Number=Single
Definite=Def|Number=Singlela
Definite=Def|Number=Plurlesles
Number=Singlela
Number=Plurles

ADJ

1309 ADJ tokens (71% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (818; 62%).

ADJ tokens may have the following values of Gender:

Paradigm présentMascFem
présentprésente

VERB

741 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (741; 100%), Mood=EMPTY (740; 100%), VerbForm=Part (740; 100%), Tense=Past (739; 100%), Number=Sing (466; 63%).

VERB tokens may have the following values of Gender:

Paradigm direMascFem
Number=Singditdite
Number=Plurdites

PRON

455 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (360; 79%), Number=Sing (341; 75%), PronType=Prs (267; 59%).

PRON tokens may have the following values of Gender:

Paradigm luiMascFem
ExtPos=ADP|Number=Singil
Number=Singil, le, -ilelle, la
Number=Plurils, euxelles

AUX

70 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (70; 100%), Number=Sing (70; 100%), Person=EMPTY (70; 100%), Tense=Past (70; 100%), VerbForm=Part (70; 100%).

AUX tokens may have the following values of Gender:

ADP

1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2639; 59%), NOUN –[amod]–> ADJ (1063; 73%), NOUN –[nmod]–> NOUN (1020; 50%), NOUN –[conj]–> NOUN (276; 54%), NOUN –[acl]–> VERB (233; 52%), VERB –[nsubj:pass]–> NOUN (123; 78%), NOUN –[compound]–> NOUN (60; 91%), ADJ –[conj]–> ADJ (42; 55%), ADJ –[nsubj]–> NOUN (40; 63%), NOUN –[nsubj]–> NOUN (24; 53%).