home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParTUT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

11369 tokens (40%) have a non-empty value of Gender. 2602 types (63%) occur at least once with a non-empty value of Gender. 1915 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (6047; 21% instances), DET (2788; 10% instances), ADJ (1272; 4% instances), VERB (745; 3% instances), PRON (447; 2% instances), AUX (70; 0% instances).

NOUN

6047 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4310; 71%).

NOUN tokens may have the following values of Gender:

Paradigm œuvreMascFem
Number=Singoeuvreoeuvre, œuvre
Number=Pluroeuvres

Gender seems to be lexical feature of NOUN. 95% lemmas (1189) occur only with one value of Gender.

DET

2788 DET tokens (58% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2439; 87%), PronType=Art (2096; 75%), Definite=Def (1487; 53%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Singlela
Definite=Def|Number=Plurles
Number=Singlela
Number=Plurles

ADJ

1272 ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (803; 63%).

ADJ tokens may have the following values of Gender:

Paradigm présentMascFem
présentprésente

VERB

745 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (745; 100%), Mood=EMPTY (743; 100%), VerbForm=Part (743; 100%), Tense=Past (742; 100%), Number=Sing (469; 63%).

VERB tokens may have the following values of Gender:

Paradigm direMascFem
Number=Singditdite
Number=Plurdites

PRON

447 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (350; 78%), Number=Sing (337; 75%), PronType=Prs (310; 69%).

PRON tokens may have the following values of Gender:

Paradigm leMascFem
lela

AUX

70 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (70; 100%), Number=Sing (70; 100%), Person=EMPTY (70; 100%), Tense=Past (70; 100%), VerbForm=Part (70; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2575; 57%), NOUN –[nmod]–> NOUN (1032; 51%), NOUN –[amod]–> ADJ (977; 69%), NOUN –[conj]–> NOUN (280; 56%), NOUN –[acl]–> VERB (233; 51%), VERB –[nsubj:pass]–> NOUN (120; 76%), NOUN –[compound]–> NOUN (60; 91%), ADJ –[conj]–> ADJ (42; 55%), ADJ –[nsubj]–> NOUN (36; 58%), NOUN –[nsubj]–> NOUN (25; 53%).