Treebank Statistics: UD_Welsh-CCG: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
14954 tokens (27%) have a non-empty value of Gender.
4936 types (66%) occur at least once with a non-empty value of Gender.
3241 lemmas (66%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (11779; 22% instances), PROPN (1448; 3% instances), PRON (1247; 2% instances), ADJ (380; 1% instances), NUM (100; 0% instances).
NOUN
11779 NOUN tokens (70% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (11771; 100%), Number=Sing (9326; 79%), Mutation=EMPTY (8661; 74%).
NOUN tokens may have the following values of Gender:
Fem(3276; 28% of non-emptyGender): iaith, Gymraeg, ysgol, Eisteddfod, rhan, llywodraeth, ystod, ardal, addysg, wythnosMasc(8503; 72% of non-emptyGender): ôl, nifer, gwaith, gyfer, mwyn, cyngor, rhaid, angen, byd, misEMPTY(4957): bod, cael, fod, gael, mynd, dod, wneud, ddod, fynd, gwneud
| Paradigm iaith | Masc | Fem |
|---|---|---|
| Mutation=AM|Number=Sing | hiaith | hiaith |
| Mutation=AM|Number=Plur | hieithoedd | |
| Number=Sing | Iaith | iaith |
| Number=Plur | ieithoedd |
PROPN
1448 PROPN tokens (68% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1417; 98%), Mutation=EMPTY (1212; 84%).
PROPN tokens may have the following values of Gender:
Fem(483; 33% of non-emptyGender): Cymru, Nghymru, Gymru, Wyddfa, Gwynedd, DU, Ffrainc, Siân, Nghaernarfon, LoegrMasc(965; 67% of non-emptyGender): Eryri, Gwynedd, Môn, Bangor, Dafydd, UE, BBC, Dewi, Ariannin, LlanberisEMPTY(675): Bangor, Jones, Aberystwyth, Iwerddon, Ewrop, Caerdydd, Lloegr, John, Prydain, Alban
| Paradigm Bangor | Masc | Fem |
|---|---|---|
| Mutation=NM | Mangor | |
| Mutation=SM | Fangor | Fangor |
| Bangor |
Gender seems to be lexical feature of PROPN. 93% lemmas (633) occur only with one value of Gender.
PRON
1247 PRON tokens (34% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1134; 91%), PronType=Prs (1112; 89%), Person=3 (1105; 89%), Poss=EMPTY (898; 72%).
PRON tokens may have the following values of Gender:
Fem(458; 37% of non-emptyGender): hi, ei, hon, ‘i, ‘w, honno, ‘u, hunain, Rhain, hithauMasc(789; 63% of non-emptyGender): ei, e, ‘i, hwn, o, ‘w, fo, hwnnw, ef, feEMPTY(2369): i, eu, ni, chi, a, ein, nhw, hyn, fy, ti
| Paradigm ef | Masc | Fem |
|---|---|---|
| Poss=Yes|PronType=Prs | ei, 'i, 'w, fe, 'u, ef | |
| PronType=Emp | yntau | |
| PronType=Prs | ei, o, e, 'w, 'i, fo, ef, fe, i | 'w |
ADJ
380 ADJ tokens (10% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (367; 97%), Number=Sing (349; 92%), Mutation=EMPTY (258; 68%).
ADJ tokens may have the following values of Gender:
Fem(55; 14% of non-emptyGender): leol, fechan, ariannol, drydedd, werdd, Chernyweg, Genedlaethol, Gymraeg, Saesneg, WenMasc(325; 86% of non-emptyGender): unrhyw, bach, Ewropeaidd, arbennig, gyflym, blynyddol, brif, ddiweddar, eang, academaiddEMPTY(3408): newydd, Cymraeg, Gymraeg, bob, mwy, lleol, genedlaethol, eraill, mawr, pob
| Paradigm da | Masc | Fem |
|---|---|---|
| Degree=Pos|Mutation=SM | well | |
| Degree=Pos | gwell | |
| Degree=Cmp | gwell | |
| Degree=Equ | cystal |
Gender seems to be lexical feature of ADJ. 93% lemmas (177) occur only with one value of Gender.
NUM
100 NUM tokens (14% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (100; 100%), NumForm=Word (98; 98%), Mutation=EMPTY (53; 53%).
NUM tokens may have the following values of Gender:
Fem(52; 52% of non-emptyGender): ddwy, tair, pedair, dwy, dair, thair, bedair, bedwar, dyw, phedairMasc(48; 48% of non-emptyGender): ddau, dau, tri, dri, bedwar, pedwar, 4, 52, bymtheg, dairEMPTY(627): un, chwe, 4, 100, 2019, 7, 10, 200, 2015, 2020
| Paradigm dau | Masc | Fem |
|---|---|---|
| Mutation=SM | ddau | ddwy |
| dau | dwy, dyw |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[conj]–> NOUN (474; 58%),
NOUN –[det]–> PRON (87; 53%),
NOUN –[appos]–> NOUN (71; 62%),
PROPN –[conj]–> PROPN (49; 51%),
PRON –[compound:redup]–> PRON (35; 100%),
PROPN –[nmod]–> NOUN (18; 60%),
NOUN –[fixed]–> NOUN (16; 100%),
NOUN –[acl:relcl]–> NOUN (14; 54%),
NOUN –[obl]–> NOUN (14; 56%),
NOUN –[amod]–> NOUN (12; 67%).