Treebank Statistics: UD_Welsh-CCG: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
13818 tokens (27%) have a non-empty value of Gender
.
4676 types (65%) occur at least once with a non-empty value of Gender
.
3099 lemmas (66%) occur at least once with a non-empty value of Gender
.
The feature is used with 5 part-of-speech tags: NOUN (10865; 21% instances), PROPN (1326; 3% instances), PRON (1166; 2% instances), ADJ (372; 1% instances), NUM (89; 0% instances).
NOUN
10865 NOUN tokens (70% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: VerbForm=EMPTY (10857; 100%), Number=Sing (8588; 79%), Mutation=EMPTY (7997; 74%).
NOUN
tokens may have the following values of Gender
:
Fem
(3026; 28% of non-emptyGender
): iaith, Gymraeg, ysgol, rhan, Eisteddfod, ardal, wythnos, ystod, addysg, llywodraethMasc
(7839; 72% of non-emptyGender
): ôl, nifer, gwaith, gyfer, cyngor, mwyn, rhaid, mis, angen, bydEMPTY
(4612): bod, cael, fod, gael, mynd, dod, wneud, ddod, fynd, gwneud
Paradigm iaith | Masc | Fem |
---|---|---|
Mutation=AM|Number=Sing | hiaith | hiaith |
Mutation=AM|Number=Plur | hieithoedd | |
Number=Sing | Iaith | iaith |
Number=Plur | ieithoedd |
PROPN
1326 PROPN tokens (67% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1302; 98%), Mutation=EMPTY (1110; 84%).
PROPN
tokens may have the following values of Gender
:
Fem
(443; 33% of non-emptyGender
): Cymru, Nghymru, Gymru, Wyddfa, Gwynedd, DU, Ffrainc, Nghaernarfon, Siân, LoegrMasc
(883; 67% of non-emptyGender
): Eryri, Gwynedd, Môn, Bangor, UE, BBC, Dafydd, Dewi, Llanberis, ThomasEMPTY
(642): Bangor, Aberystwyth, Jones, Iwerddon, Ewrop, Caerdydd, Lloegr, John, Prydain, Alban
Paradigm Bangor | Masc | Fem |
---|---|---|
Mutation=NM | Mangor | |
Mutation=SM | Fangor | Fangor |
Bangor |
Gender
seems to be lexical feature of PROPN
. 93% lemmas (594) occur only with one value of Gender
.
PRON
1166 PRON tokens (34% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (1061; 91%), PronType=Prs (1041; 89%), Person=3 (1032; 89%), Poss=EMPTY (837; 72%).
PRON
tokens may have the following values of Gender
:
Fem
(359; 31% of non-emptyGender
): hi, ei, hon, ‘i, ‘w, honno, ‘u, hunain, Rhain, hunaMasc
(807; 69% of non-emptyGender
): ei, e, ‘i, hwn, o, ‘w, fo, hwnnw, ef, feEMPTY
(2215): i, eu, ni, chi, a, ein, hyn, nhw, fy, ti
Paradigm hwy | Masc | Fem |
---|---|---|
Number=Sing | 'w | 'w |
Number=Plur|Poss=Yes | eu |
ADJ
372 ADJ tokens (11% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (360; 97%), Number=Sing (344; 92%), Mutation=EMPTY (255; 69%).
ADJ
tokens may have the following values of Gender
:
Fem
(53; 14% of non-emptyGender
): leol, fechan, ariannol, drydedd, werdd, Chernyweg, Gymraeg, Saesneg, Wen, bedwareddMasc
(319; 86% of non-emptyGender
): unrhyw, bach, Ewropeaidd, arbennig, gyflym, blynyddol, brif, ddiweddar, eang, academaiddEMPTY
(3123): Cymraeg, newydd, Gymraeg, bob, mwy, lleol, eraill, pob, arall, fawr
Paradigm da | Masc | Fem |
---|---|---|
Degree=Pos|Mutation=SM | well | |
Degree=Pos | gwell | |
Degree=Cmp | gwell | |
Degree=Equ | cystal |
Gender
seems to be lexical feature of ADJ
. 93% lemmas (174) occur only with one value of Gender
.
NUM
89 NUM tokens (13% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (89; 100%), NumForm=Word (87; 98%).
NUM
tokens may have the following values of Gender
:
Fem
(45; 51% of non-emptyGender
): ddwy, tair, pedair, dwy, dair, thair, bedair, dyw, phedairMasc
(44; 49% of non-emptyGender
): ddau, dau, tri, dri, bedwar, pedwar, 4, 52, bymtheg, dairEMPTY
(579): un, chwe, 4, 2019, 2020, 50, 500, 7, 10, 100
Paradigm dau | Masc | Fem |
---|---|---|
Mutation=SM | ddau | ddwy |
dau | dwy, dyw |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[nmod]–> NOUN (1906; 57%),
NOUN –[conj]–> NOUN (435; 58%),
NOUN –[det]–> PRON (82; 53%),
NOUN –[appos]–> NOUN (66; 62%),
NOUN –[appos]–> PROPN (31; 51%),
PRON –[compound:redup]–> PRON (31; 100%),
PROPN –[appos]–> NOUN (16; 55%),
PROPN –[nmod]–> NOUN (16; 57%),
NOUN –[fixed]–> NOUN (15; 100%),
NOUN –[acl:relcl]–> NOUN (14; 54%).