Treebank Statistics: UD_Welsh-CCG: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
13234 tokens (27%) have a non-empty value of Gender
.
4571 types (65%) occur at least once with a non-empty value of Gender
.
3040 lemmas (66%) occur at least once with a non-empty value of Gender
.
The feature is used with 5 part-of-speech tags: NOUN (10360; 21% instances), PROPN (1280; 3% instances), PRON (1134; 2% instances), ADJ (372; 1% instances), NUM (88; 0% instances).
NOUN
10360 NOUN tokens (70% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: VerbForm=EMPTY (10352; 100%), Number=Sing (8196; 79%), Mutation=EMPTY (7607; 73%).
NOUN
tokens may have the following values of Gender
:
Fem
(2912; 28% of non-emptyGender
): iaith, Gymraeg, ysgol, rhan, Eisteddfod, ardal, wythnos, llywodraeth, ystod, addysgMasc
(7448; 72% of non-emptyGender
): ôl, nifer, gwaith, gyfer, cyngor, mwyn, rhaid, angen, beth, bydEMPTY
(4408): bod, cael, fod, gael, mynd, dod, wneud, ddod, fynd, gwneud
Paradigm iaith | Masc | Fem |
---|---|---|
Mutation=AM|Number=Sing | hiaith | hiaith |
Mutation=AM|Number=Plur | hieithoedd | |
Number=Sing | Iaith | iaith |
Number=Plur | ieithoedd |
PROPN
1280 PROPN tokens (67% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1257; 98%), Mutation=EMPTY (1072; 84%).
PROPN
tokens may have the following values of Gender
:
Fem
(424; 33% of non-emptyGender
): Cymru, Nghymru, Gymru, Wyddfa, Gwynedd, Ffrainc, DU, Nghaernarfon, Siân, LoegrMasc
(856; 67% of non-emptyGender
): Eryri, Gwynedd, Môn, UE, Bangor, BBC, Dafydd, Dewi, Thomas, BlaenauEMPTY
(639): Bangor, Aberystwyth, Jones, Iwerddon, Ewrop, Caerdydd, Lloegr, John, Prydain, Alban
Paradigm Bangor | Masc | Fem |
---|---|---|
Mutation=NM | Mangor | |
Mutation=SM | Fangor | Fangor |
Bangor |
Gender
seems to be lexical feature of PROPN
. 93% lemmas (577) occur only with one value of Gender
.
PRON
1134 PRON tokens (35% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (1035; 91%), PronType=Prs (1015; 90%), Person=3 (1007; 89%), Poss=EMPTY (809; 71%).
PRON
tokens may have the following values of Gender
:
Fem
(355; 31% of non-emptyGender
): hi, ei, hon, ‘i, ‘w, honno, ‘u, hunain, Rhain, hunaMasc
(779; 69% of non-emptyGender
): ei, e, ‘i, hwn, o, ‘w, fo, hwnnw, ef, feEMPTY
(2123): i, eu, ni, chi, a, ein, nhw, hyn, fy, ti
Paradigm hwy | Masc | Fem |
---|---|---|
Number=Sing | 'w | 'w |
Number=Plur|Poss=Yes | eu |
ADJ
372 ADJ tokens (11% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (360; 97%), Number=Sing (344; 92%), Mutation=EMPTY (255; 69%).
ADJ
tokens may have the following values of Gender
:
Fem
(53; 14% of non-emptyGender
): leol, fechan, ariannol, drydedd, werdd, Chernyweg, Gymraeg, Saesneg, Wen, bedwareddMasc
(319; 86% of non-emptyGender
): unrhyw, bach, Ewropeaidd, arbennig, gyflym, blynyddol, brif, ddiweddar, eang, academaiddEMPTY
(2950): newydd, Cymraeg, Gymraeg, bob, mwy, lleol, eraill, arall, pob, fawr
Paradigm da | Masc | Fem |
---|---|---|
Degree=Pos|Mutation=SM | well | |
Degree=Pos | gwell | |
Degree=Cmp | gwell | |
Degree=Equ | cystal |
Gender
seems to be lexical feature of ADJ
. 93% lemmas (174) occur only with one value of Gender
.
NUM
88 NUM tokens (14% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (88; 100%), NumForm=Word (86; 98%), Mutation=EMPTY (48; 55%).
NUM
tokens may have the following values of Gender
:
Fem
(44; 50% of non-emptyGender
): tair, ddwy, pedair, dwy, dair, thair, bedair, dyw, phedairMasc
(44; 50% of non-emptyGender
): ddau, dau, tri, dri, bedwar, pedwar, 4, 52, bymtheg, dairEMPTY
(551): un, 4, chwe, 2019, 2020, 50, 500, 7, 100, 11
Paradigm dau | Masc | Fem |
---|---|---|
Mutation=SM | ddau | ddwy |
dau | dwy, dyw |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[nmod]–> NOUN (1797; 56%),
NOUN –[conj]–> NOUN (405; 58%),
NOUN –[det]–> PRON (80; 54%),
NOUN –[appos]–> NOUN (65; 62%),
PRON –[compound:redup]–> PRON (31; 100%),
PROPN –[appos]–> NOUN (15; 54%),
PROPN –[nmod]–> NOUN (15; 56%),
NOUN –[acl:relcl]–> NOUN (14; 58%),
NOUN –[fixed]–> NOUN (14; 100%),
NOUN –[amod]–> NOUN (8; 62%).