home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

293967 tokens (51%) have a non-empty value of Gender. 1370 types (74%) occur at least once with a non-empty value of Gender. 1189 lemmas (75%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (115471; 20% instances), DET (77215; 13% instances), ADJ (34368; 6% instances), PRON (21812; 4% instances), PROPN (17832; 3% instances), VERB (14796; 3% instances), NUM (11279; 2% instances), AUX (1104; 0% instances), ADP (89; 0% instances), PUNCT (1; 0% instances).

NOUN

115471 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (76662; 66%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (454) occur only with one value of Gender.

DET

77215 DET tokens (90% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (68136; 88%), Number=Sing (61188; 79%), Definite=Def (54610; 71%).

DET tokens may have the following values of Gender:

ADJ

34368 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (22349; 65%).

ADJ tokens may have the following values of Gender:

PRON

21812 PRON tokens (95% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (20342; 93%), Reflex=EMPTY (18133; 83%), Number=Sing (15759; 72%), PronType=EMPTY (12913; 59%).

PRON tokens may have the following values of Gender:

PROPN

17832 PROPN tokens (82% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (17588; 99%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 97% lemmas (336) occur only with one value of Gender.

VERB

14796 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (14796; 100%), Person=EMPTY (14796; 100%), Tense=Past (14796; 100%), VerbForm=Part (14795; 100%), Number=Sing (11141; 75%).

VERB tokens may have the following values of Gender:

NUM

11279 NUM tokens (63% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (11261; 100%), Number=Plur (5863; 52%).

NUM tokens may have the following values of Gender:

AUX

1104 AUX tokens (9% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1104; 100%), Person=EMPTY (1104; 100%), Tense=Past (1104; 100%), VerbForm=Part (1104; 100%), Number=Sing (1103; 100%).

AUX tokens may have the following values of Gender:

ADP

89 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (65142; 90%), NOUN –[amod]–> ADJ (22841; 98%), NOUN –[nmod]–> NOUN (17148; 52%), NOUN –[nummod]–> NUM (7737; 90%), PROPN –[det]–> DET (3979; 79%), NOUN –[acl]–> VERB (3905; 67%), NOUN –[conj]–> NOUN (3525; 60%), NOUN –[fixed]–> ADJ (2885; 68%), NOUN –[flat:name]–> PROPN (2318; 95%), PROPN –[flat:name]–> PROPN (1372; 93%).