Treebank Statistics: UD_Breton-KEB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
This is a layered feature with the following layers: Gender, Gender[psor].
2236 tokens (22%) have a non-empty value of Gender.
1163 types (48%) occur at least once with a non-empty value of Gender.
857 lemmas (50%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (1984; 20% instances), PROPN (107; 1% instances), PRON (66; 1% instances), AUX (39; 0% instances), NUM (38; 0% instances), VERB (2; 0% instances).
NOUN
1984 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1452; 73%).
NOUN tokens may have the following values of Gender:
Fem(610; 31% of non-emptyGender): yezh, vro, rannvro, bro, gevredigezh, gêr, plac’h, amzer, stad, weturMasc(1374; 69% of non-emptyGender): dud, levr, den, ti, brezhoneg, labour, vugale, istor, tud, traoù
| Paradigm mignon | Masc | Fem |
|---|---|---|
| Number=Sing | mignon | vignonez |
| Number=Plur | mignoned, vignoned |
Gender seems to be lexical feature of NOUN. 99% lemmas (808) occur only with one value of Gender.
PROPN
107 PROPN tokens (35% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (107; 100%).
PROPN tokens may have the following values of Gender:
Fem(27; 25% of non-emptyGender): Lenaig, Mari, Mona, Morwenna, Anna, Janed, Nolwenn, StéphanieMasc(80; 75% of non-emptyGender): Yann, Yannig, Divi, Fañch, Lan, Nevenoe, Iañ, Loeiz, Ber, EricEMPTY(200): Breizh, Pariz, Frañs, Kembre, Naoned, Europa, Karaez, Roazhon, Brezhoneg, Diwan
Gender seems to be lexical feature of PROPN. 100% lemmas (31) occur only with one value of Gender.
PRON
66 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (66; 100%), PronType=Prs (58; 88%), Person=3 (57; 86%), Case=Acc (51; 77%).
PRON tokens may have the following values of Gender:
Fem(37; 56% of non-emptyGender): i, hi, anezhi, -hi, he, zi, Honnezh, eben, hounnezh, niMasc(29; 44% of non-emptyGender): añ, Hemañ, anezhañ, E, Hennezh, egile, nañ, nnañEMPTY(173): me, in, o, oc’h, piv, holl, hini, re, a, int
| Paradigm indirect | Masc | Fem |
|---|---|---|
| añ, nañ, nnañ | i, zi, ni |
AUX
39 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (39; 100%), Number=Sing (39; 100%), Person=3 (39; 100%), VerbForm=Fin (39; 100%), Tense=Pres (33; 85%).
AUX tokens may have the following values of Gender:
Fem(11; 28% of non-emptyGender): he deus, he devoa, he doMasc(28; 72% of non-emptyGender): en deus, en doaEMPTY(1290): a, e, eo, oa, vo, zo, bet, o, en, vez
| Paradigm kaout | Masc | Fem |
|---|---|---|
| Tense=Fut | he do | |
| Tense=Past | en doa | he devoa |
| Tense=Pres | en deus | he deus |
NUM
38 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (38; 100%).
NUM tokens may have the following values of Gender:
Fem(12; 32% of non-emptyGender): div, peder, teirMasc(26; 68% of non-emptyGender): daou, tri, dri, pevar, zaouEMPTY(195): unan, 2007, 4, 000, 1950, 20, 3, 30, eil, 10
| Paradigm daou | Masc | Fem |
|---|---|---|
| daou, zaou | div |
VERB
2 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (2; 100%), Number=Sing (2; 100%), Person=3 (2; 100%), VerbForm=Fin (2; 100%).
VERB tokens may have the following values of Gender:
Fem(2; 100% of non-emptyGender): he deus, he devoaEMPTY(1095): kinniget, dont, ober, graet, gwelet, gouestlet, lennet, chom, aozet, kavet
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nmod:gen]–> NOUN (159; 54%),
NOUN –[conj]–> NOUN (77; 68%),
NOUN –[nmod]–> NOUN (61; 53%),
NOUN –[nsubj]–> NOUN (21; 64%),
NOUN –[compound]–> NOUN (12; 71%),
NOUN –[appos]–> NOUN (11; 58%),
NOUN –[nsubj]–> PROPN (9; 60%),
PROPN –[appos]–> NOUN (4; 80%),
NOUN –[dep]–> NOUN (1; 100%),
NOUN –[flat:name]–> NOUN (1; 100%).