Treebank Statistics: UD_Breton-KEB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
This is a layered feature with the following layers: Gender, Gender[psor].
2236 tokens (22%) have a non-empty value of Gender
.
1163 types (48%) occur at least once with a non-empty value of Gender
.
857 lemmas (50%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (1984; 20% instances), PROPN (107; 1% instances), PRON (66; 1% instances), AUX (39; 0% instances), NUM (38; 0% instances), VERB (2; 0% instances).
NOUN
1984 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (1452; 73%).
NOUN
tokens may have the following values of Gender
:
Fem
(610; 31% of non-emptyGender
): yezh, vro, rannvro, bro, gevredigezh, gêr, plac’h, amzer, stad, weturMasc
(1374; 69% of non-emptyGender
): dud, levr, den, ti, brezhoneg, labour, vugale, istor, tud, traoù
Paradigm mignon | Masc | Fem |
---|---|---|
Number=Sing | mignon | vignonez |
Number=Plur | mignoned, vignoned |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (808) occur only with one value of Gender
.
PROPN
107 PROPN tokens (35% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (107; 100%).
PROPN
tokens may have the following values of Gender
:
Fem
(27; 25% of non-emptyGender
): Lenaig, Mari, Mona, Morwenna, Anna, Janed, Nolwenn, StéphanieMasc
(80; 75% of non-emptyGender
): Yann, Yannig, Divi, Fañch, Lan, Nevenoe, Iañ, Loeiz, Ber, EricEMPTY
(200): Breizh, Pariz, Frañs, Kembre, Naoned, Europa, Karaez, Roazhon, Brezhoneg, Diwan
Gender
seems to be lexical feature of PROPN
. 100% lemmas (31) occur only with one value of Gender
.
PRON
66 PRON tokens (28% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (66; 100%), PronType=Prs (58; 88%), Person=3 (57; 86%), Case=Acc (51; 77%).
PRON
tokens may have the following values of Gender
:
Fem
(37; 56% of non-emptyGender
): i, hi, anezhi, -hi, he, zi, Honnezh, eben, hounnezh, niMasc
(29; 44% of non-emptyGender
): añ, Hemañ, anezhañ, E, Hennezh, egile, nañ, nnañEMPTY
(173): me, in, o, oc’h, piv, holl, hini, re, a, int
Paradigm indirect | Masc | Fem |
---|---|---|
añ, nañ, nnañ | i, zi, ni |
AUX
39 AUX tokens (3% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=Ind (39; 100%), Number=Sing (39; 100%), Person=3 (39; 100%), VerbForm=Fin (39; 100%), Tense=Pres (33; 85%).
AUX
tokens may have the following values of Gender
:
Fem
(11; 28% of non-emptyGender
): he deus, he devoa, he doMasc
(28; 72% of non-emptyGender
): en deus, en doaEMPTY
(1290): a, e, eo, oa, vo, zo, bet, o, en, vez
Paradigm kaout | Masc | Fem |
---|---|---|
Tense=Fut | he do | |
Tense=Past | en doa | he devoa |
Tense=Pres | en deus | he deus |
NUM
38 NUM tokens (16% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Plur (38; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(12; 32% of non-emptyGender
): div, peder, teirMasc
(26; 68% of non-emptyGender
): daou, tri, dri, pevar, zaouEMPTY
(195): unan, 2007, 4, 000, 1950, 20, 3, 30, eil, 10
Paradigm daou | Masc | Fem |
---|---|---|
daou, zaou | div |
VERB
2 VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=Ind (2; 100%), Number=Sing (2; 100%), Person=3 (2; 100%), VerbForm=Fin (2; 100%).
VERB
tokens may have the following values of Gender
:
Fem
(2; 100% of non-emptyGender
): he deus, he devoaEMPTY
(1095): kinniget, dont, ober, graet, gwelet, gouestlet, lennet, chom, aozet, kavet
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[nmod:gen]–> NOUN (159; 54%),
NOUN –[conj]–> NOUN (77; 68%),
NOUN –[nmod]–> NOUN (61; 53%),
NOUN –[nsubj]–> NOUN (21; 64%),
NOUN –[compound]–> NOUN (12; 71%),
NOUN –[appos]–> NOUN (11; 58%),
NOUN –[nsubj]–> PROPN (9; 60%),
PROPN –[appos]–> NOUN (4; 80%),
NOUN –[dep]–> NOUN (1; 100%),
NOUN –[flat:name]–> NOUN (1; 100%).