home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

25779 tokens (29%) have a non-empty value of Gender. 4438 types (58%) occur at least once with a non-empty value of Gender. 3066 lemmas (55%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (13369; 15% instances), DET (5041; 6% instances), PRON (4712; 5% instances), ADJ (1445; 2% instances), PROPN (1212; 1% instances).

NOUN

13369 NOUN tokens (72% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (13369; 100%), Number=Sing (11353; 85%).

NOUN tokens may have the following values of Gender:

Paradigm bliadhnaMascFem
Case=Dat|Number=Singbhliadhna, bliadhna, bhliadhn', bliadhn’
Case=Dat|Number=Plurbliadhnaichean, bliadhnachan
Case=Gen|Form=Emp|Number=Singbliadhna-sa
Case=Gen|Number=Singbliadhna, bhliadhna, bliadhn', bliadhn’
Case=Gen|Number=Plurbhliadhnaicheanbhliadhnaichean, bliadhnaichean, bliadhnachan, bhliadhnachan, bliadhna
Case=Nom|CleftType=Nom|Number=Singbhliadhna, bliadhn', bliadhna
Case=Nom|Number=Singbliadhna, bhliadhna, bliadhn', bhliadhn'
Case=Nom|Number=Plurbliadhnaichean, bhliadhnaichean

Gender seems to be lexical feature of NOUN. 96% lemmas (2408) occur only with one value of Gender.

DET

5041 DET tokens (76% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (4547; 90%), Poss=EMPTY (4547; 90%), PronType=Art (4547; 90%), Definite=Def (4492; 89%), Number=Sing (4115; 82%), Case=EMPTY (3807; 76%).

DET tokens may have the following values of Gender:

Paradigm anMascFem
Case=Gen|Definite=Def|Number=Singan, a’, a', am, nana, an, a', a’
Case=Gen|Definite=Def|Number=Sing|Typo=Yesam, naa', a’, an
Case=Gen|Definite=Def|Number=Dualan
Case=Gen|Definite=Def|Number=Plurnan, na, namnan, nam, na
Case=Gen|Definite=Def|Number=Plur|Typo=Yesnana
Definite=Def|Number=Singan, a’, am, 'n, a', 'm, ‘n, ’n, naman, a’, a', 'n, ‘n, am, a, 'm, ‘m
Definite=Def|Number=Sing|Typo=Yesa', an, a’'n
Definite=Def|Number=Plurnana
Number=Singam, anan, a’, a'
Number=Sing|Typo=Yesa’
Number=Plurnana

PRON

4712 PRON tokens (49% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4712; 100%), Person=3 (4712; 100%), PronType=Prs (4711; 100%).

PRON tokens may have the following values of Gender:

Paradigm iMascFem
CleftType=Nomi
CleftType=Obl|Form=Empise
Form=Empise
isei, h-i

ADJ

1445 ADJ tokens (41% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1113; 77%), Case=Nom (788; 55%).

ADJ tokens may have the following values of Gender:

Paradigm eileMascFem
Case=Dat|Number=Singeileeile
Case=Dat|Number=Plureileeile
Case=Gen|Number=Singeileeile
Case=Gen|Number=Plureileeile
Case=Nom|Number=Singeile, eil', eil’eile
Case=Nom|Number=Plureileeile

PROPN

1212 PROPN tokens (28% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: NounType=Prs (971; 80%), Case=Nom (635; 52%).

PROPN tokens may have the following values of Gender:

Paradigm [Name]MascFem
Case=Dat|CleftType=Obl[Name]
Case=Dat[Name], dh’[Name][Name]
Case=Gen[Name][Name]
Case=Nom|CleftType=Nom[Name]
Case=Nom[Name][Name]
Case=Voc[Name][Name]

Gender seems to be lexical feature of PROPN. 100% lemmas (242) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4100; 85%), NOUN –[amod]–> ADJ (1273; 66%), NOUN –[conj]–> NOUN (295; 51%), NOUN –[appos]–> PROPN (70; 69%), PROPN –[amod]–> ADJ (67; 88%), NOUN –[appos]–> NOUN (56; 64%), PROPN –[nmod]–> PROPN (45; 52%), PROPN –[conj]–> PROPN (38; 68%), PROPN –[appos]–> NOUN (36; 75%), NOUN –[compound]–> NOUN (19; 100%).