home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParTUT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

11334 tokens (40%) have a non-empty value of Gender. 2591 types (63%) occur at least once with a non-empty value of Gender. 1944 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (6002; 21% instances), DET (2790; 10% instances), ADJ (1279; 4% instances), VERB (746; 3% instances), PRON (447; 2% instances), AUX (70; 0% instances).

NOUN

6002 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4161; 69%).

NOUN tokens may have the following values of Gender:

Paradigm oeuvreMascFem
Number=Singoeuvreoeuvre
Number=Pluroeuvres

Gender seems to be lexical feature of NOUN. 95% lemmas (1196) occur only with one value of Gender.

DET

2790 DET tokens (58% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2441; 87%), PronType=Art (2097; 75%), Definite=Def (1488; 53%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Singlela
Definite=Def|Number=Plurles
Number=Singlela
Number=Plurles

ADJ

1279 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (809; 63%).

ADJ tokens may have the following values of Gender:

Paradigm présentMascFem
présentprésente

VERB

746 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (746; 100%), Mood=EMPTY (744; 100%), VerbForm=Part (744; 100%), Tense=Past (743; 100%), Number=Sing (469; 63%).

VERB tokens may have the following values of Gender:

Paradigm direMascFem
Number=Singditdite
Number=Plurdites

PRON

447 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (350; 78%), Number=Sing (337; 75%), PronType=Prs (310; 69%).

PRON tokens may have the following values of Gender:

Paradigm leMascFem
lela

AUX

70 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (70; 100%), Number=Sing (70; 100%), Person=EMPTY (70; 100%), Tense=Past (70; 100%), VerbForm=Part (70; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2547; 57%), NOUN –[nmod]–> NOUN (1011; 50%), NOUN –[amod]–> ADJ (968; 68%), NOUN –[conj]–> NOUN (276; 55%), NOUN –[acl]–> VERB (233; 51%), VERB –[nsubj:pass]–> NOUN (120; 77%), NOUN –[compound]–> NOUN (64; 91%), ADJ –[conj]–> ADJ (42; 55%), ADJ –[nsubj]–> NOUN (33; 57%), NOUN –[nsubj]–> NOUN (26; 51%).