home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

14954 tokens (27%) have a non-empty value of Gender. 4936 types (66%) occur at least once with a non-empty value of Gender. 3241 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (11779; 22% instances), PROPN (1448; 3% instances), PRON (1247; 2% instances), ADJ (380; 1% instances), NUM (100; 0% instances).

NOUN

11779 NOUN tokens (70% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (11771; 100%), Number=Sing (9326; 79%), Mutation=EMPTY (8661; 74%).

NOUN tokens may have the following values of Gender:

Paradigm iaithMascFem
Mutation=AM|Number=Singhiaithhiaith
Mutation=AM|Number=Plurhieithoedd
Number=SingIaithiaith
Number=Plurieithoedd

PROPN

1448 PROPN tokens (68% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1417; 98%), Mutation=EMPTY (1212; 84%).

PROPN tokens may have the following values of Gender:

Paradigm BangorMascFem
Mutation=NMMangor
Mutation=SMFangorFangor
Bangor

Gender seems to be lexical feature of PROPN. 93% lemmas (633) occur only with one value of Gender.

PRON

1247 PRON tokens (34% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1134; 91%), PronType=Prs (1112; 89%), Person=3 (1105; 89%), Poss=EMPTY (898; 72%).

PRON tokens may have the following values of Gender:

Paradigm efMascFem
Poss=Yes|PronType=Prsei, 'i, 'w, fe, 'u, ef
PronType=Empyntau
PronType=Prsei, o, e, 'w, 'i, fo, ef, fe, i'w

ADJ

380 ADJ tokens (10% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (367; 97%), Number=Sing (349; 92%), Mutation=EMPTY (258; 68%).

ADJ tokens may have the following values of Gender:

Paradigm daMascFem
Degree=Pos|Mutation=SMwell
Degree=Posgwell
Degree=Cmpgwell
Degree=Equcystal

Gender seems to be lexical feature of ADJ. 93% lemmas (177) occur only with one value of Gender.

NUM

100 NUM tokens (14% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (100; 100%), NumForm=Word (98; 98%), Mutation=EMPTY (53; 53%).

NUM tokens may have the following values of Gender:

Paradigm dauMascFem
Mutation=SMddauddwy
daudwy, dyw

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[conj]–> NOUN (474; 58%), NOUN –[det]–> PRON (87; 53%), NOUN –[appos]–> NOUN (71; 62%), PROPN –[conj]–> PROPN (49; 51%), PRON –[compound:redup]–> PRON (35; 100%), PROPN –[nmod]–> NOUN (18; 60%), NOUN –[fixed]–> NOUN (16; 100%), NOUN –[acl:relcl]–> NOUN (14; 54%), NOUN –[obl]–> NOUN (14; 56%), NOUN –[amod]–> NOUN (12; 67%).