home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

13818 tokens (27%) have a non-empty value of Gender. 4676 types (65%) occur at least once with a non-empty value of Gender. 3099 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (10865; 21% instances), PROPN (1326; 3% instances), PRON (1166; 2% instances), ADJ (372; 1% instances), NUM (89; 0% instances).

NOUN

10865 NOUN tokens (70% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (10857; 100%), Number=Sing (8588; 79%), Mutation=EMPTY (7997; 74%).

NOUN tokens may have the following values of Gender:

Paradigm iaithMascFem
Mutation=AM|Number=Singhiaithhiaith
Mutation=AM|Number=Plurhieithoedd
Number=SingIaithiaith
Number=Plurieithoedd

PROPN

1326 PROPN tokens (67% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1302; 98%), Mutation=EMPTY (1110; 84%).

PROPN tokens may have the following values of Gender:

Paradigm BangorMascFem
Mutation=NMMangor
Mutation=SMFangorFangor
Bangor

Gender seems to be lexical feature of PROPN. 93% lemmas (594) occur only with one value of Gender.

PRON

1166 PRON tokens (34% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1061; 91%), PronType=Prs (1041; 89%), Person=3 (1032; 89%), Poss=EMPTY (837; 72%).

PRON tokens may have the following values of Gender:

Paradigm hwyMascFem
Number=Sing'w'w
Number=Plur|Poss=Yeseu

ADJ

372 ADJ tokens (11% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (360; 97%), Number=Sing (344; 92%), Mutation=EMPTY (255; 69%).

ADJ tokens may have the following values of Gender:

Paradigm daMascFem
Degree=Pos|Mutation=SMwell
Degree=Posgwell
Degree=Cmpgwell
Degree=Equcystal

Gender seems to be lexical feature of ADJ. 93% lemmas (174) occur only with one value of Gender.

NUM

89 NUM tokens (13% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (89; 100%), NumForm=Word (87; 98%).

NUM tokens may have the following values of Gender:

Paradigm dauMascFem
Mutation=SMddauddwy
daudwy, dyw

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (1906; 57%), NOUN –[conj]–> NOUN (435; 58%), NOUN –[det]–> PRON (82; 53%), NOUN –[appos]–> NOUN (66; 62%), NOUN –[appos]–> PROPN (31; 51%), PRON –[compound:redup]–> PRON (31; 100%), PROPN –[appos]–> NOUN (16; 55%), PROPN –[nmod]–> NOUN (16; 57%), NOUN –[fixed]–> NOUN (15; 100%), NOUN –[acl:relcl]–> NOUN (14; 54%).