home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

13234 tokens (27%) have a non-empty value of Gender. 4571 types (65%) occur at least once with a non-empty value of Gender. 3040 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (10360; 21% instances), PROPN (1280; 3% instances), PRON (1134; 2% instances), ADJ (372; 1% instances), NUM (88; 0% instances).

NOUN

10360 NOUN tokens (70% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (10352; 100%), Number=Sing (8196; 79%), Mutation=EMPTY (7607; 73%).

NOUN tokens may have the following values of Gender:

Paradigm iaithMascFem
Mutation=AM|Number=Singhiaithhiaith
Mutation=AM|Number=Plurhieithoedd
Number=SingIaithiaith
Number=Plurieithoedd

PROPN

1280 PROPN tokens (67% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1257; 98%), Mutation=EMPTY (1072; 84%).

PROPN tokens may have the following values of Gender:

Paradigm BangorMascFem
Mutation=NMMangor
Mutation=SMFangorFangor
Bangor

Gender seems to be lexical feature of PROPN. 93% lemmas (577) occur only with one value of Gender.

PRON

1134 PRON tokens (35% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1035; 91%), PronType=Prs (1015; 90%), Person=3 (1007; 89%), Poss=EMPTY (809; 71%).

PRON tokens may have the following values of Gender:

Paradigm hwyMascFem
Number=Sing'w'w
Number=Plur|Poss=Yeseu

ADJ

372 ADJ tokens (11% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (360; 97%), Number=Sing (344; 92%), Mutation=EMPTY (255; 69%).

ADJ tokens may have the following values of Gender:

Paradigm daMascFem
Degree=Pos|Mutation=SMwell
Degree=Posgwell
Degree=Cmpgwell
Degree=Equcystal

Gender seems to be lexical feature of ADJ. 93% lemmas (174) occur only with one value of Gender.

NUM

88 NUM tokens (14% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (88; 100%), NumForm=Word (86; 98%), Mutation=EMPTY (48; 55%).

NUM tokens may have the following values of Gender:

Paradigm dauMascFem
Mutation=SMddauddwy
daudwy, dyw

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (1797; 56%), NOUN –[conj]–> NOUN (405; 58%), NOUN –[det]–> PRON (80; 54%), NOUN –[appos]–> NOUN (65; 62%), PRON –[compound:redup]–> PRON (31; 100%), PROPN –[appos]–> NOUN (15; 54%), PROPN –[nmod]–> NOUN (15; 56%), NOUN –[acl:relcl]–> NOUN (14; 58%), NOUN –[fixed]–> NOUN (14; 100%), NOUN –[amod]–> NOUN (8; 62%).