home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

This is a layered feature with the following layers: Gender, Gender[psor].

2242 tokens (22%) have a non-empty value of Gender. 1167 types (48%) occur at least once with a non-empty value of Gender. 857 lemmas (50%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (1990; 20% instances), PROPN (107; 1% instances), PRON (66; 1% instances), VERB (41; 0% instances), NUM (38; 0% instances).

NOUN

1990 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1456; 73%).

NOUN tokens may have the following values of Gender:

Paradigm mignonMascFem
Number=Singmignonvignonez
Number=Plurmignoned, vignoned

Gender seems to be lexical feature of NOUN. 99% lemmas (808) occur only with one value of Gender.

PROPN

107 PROPN tokens (35% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (107; 100%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (31) occur only with one value of Gender.

PRON

66 PRON tokens (27% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (66; 100%), PronType=Prs (58; 88%), Person=3 (57; 86%), Case=Acc (51; 77%).

PRON tokens may have the following values of Gender:

Paradigm indirectMascFem
_, añ, nañ, nnañi, _, nni, zi, ni

VERB

41 VERB tokens (2% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (41; 100%), Number=Sing (41; 100%), Person=3 (41; 100%), VerbForm=Fin (41; 100%), Tense=Pres (34; 83%).

VERB tokens may have the following values of Gender:

Paradigm kaoutMascFem
Tense=Futhe do
Tense=Pasten doahe devoa
Tense=Presen deushe deus

NUM

38 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (38; 100%).

NUM tokens may have the following values of Gender:

Paradigm daouMascFem
daou, zaoudiv

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod:gen]–> NOUN (159; 54%), NOUN –[conj]–> NOUN (77; 68%), NOUN –[nmod]–> NOUN (61; 53%), NOUN –[nsubj]–> NOUN (21; 64%), NOUN –[compound]–> NOUN (12; 71%), NOUN –[appos]–> NOUN (11; 58%), NOUN –[nsubj]–> PROPN (9; 60%), PROPN –[appos]–> NOUN (4; 80%), NOUN –[dep]–> NOUN (1; 100%), NOUN –[flat:name]–> NOUN (1; 100%).