home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

25773 tokens (29%) have a non-empty value of Gender. 4438 types (58%) occur at least once with a non-empty value of Gender. 3070 lemmas (55%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (13612; 15% instances), DET (5045; 6% instances), PRON (4710; 5% instances), ADJ (1441; 2% instances), PROPN (965; 1% instances).

NOUN

13612 NOUN tokens (72% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (13612; 100%), Number=Sing (11526; 85%).

NOUN tokens may have the following values of Gender:

Paradigm dèanMascFem
Case=Datnì, dèanamh
Case=Gen
Case=Nom

Gender seems to be lexical feature of NOUN. 96% lemmas (2441) occur only with one value of Gender.

DET

5045 DET tokens (77% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (4542; 90%), Poss=EMPTY (4542; 90%), Definite=Def (4487; 89%), PronType=Art (4487; 89%), Number=Sing (4119; 82%), Case=EMPTY (3812; 76%).

DET tokens may have the following values of Gender:

Paradigm anMascFem
Case=Gen|Definite=Def|Number=Sing|PronType=Artan, a’, a', am, nana, an, a', a’
Case=Gen|Definite=Def|Number=Dual|PronType=Artan
Case=Gen|Definite=Def|Number=Plur|PronType=Artnan, na, namnan, na, nam
Definite=Def|Number=Sing|PronType=Artan, a’, am, 'n, a', 'm, ‘n, ’n, naman, a’, a', 'n, ‘n, am, a, 'm, ‘m
Definite=Def|Number=Plur|PronType=Artnana
Number=Singam, an, a’an, a’, a'
Number=Plurnana

PRON

4710 PRON tokens (49% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4710; 100%), Person=3 (4710; 100%).

PRON tokens may have the following values of Gender:

Paradigm iMascFem
Form=Empise
isei, h-i

ADJ

1441 ADJ tokens (42% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1110; 77%), Case=Nom (785; 54%).

ADJ tokens may have the following values of Gender:

Paradigm eileMascFem
Case=Dat|Number=Singeileeile
Case=Dat|Number=Plureileeile
Case=Gen|Number=Singeileeile
Case=Gen|Number=Plureileeile
Case=Nom|Number=Singeile, eil', eil’eile
Case=Nom|Number=Plureileeile

PROPN

965 PROPN tokens (23% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (570; 59%).

PROPN tokens may have the following values of Gender:

Paradigm [Name]MascFem
Case=Dat[Name], dh’[Name][Name]
Case=Gen[Name][Name]
Case=Nom[Name][Name]
Case=Voc[Name][Name]

Gender seems to be lexical feature of PROPN. 100% lemmas (202) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4243; 85%), NOUN –[amod]–> ADJ (1274; 67%), NOUN –[conj]–> NOUN (300; 51%), NOUN –[appos]–> NOUN (61; 59%), NOUN –[appos]–> PROPN (55; 60%), PROPN –[amod]–> ADJ (48; 96%), PROPN –[appos]–> NOUN (29; 71%), PROPN –[conj]–> PROPN (28; 65%), NOUN –[compound]–> NOUN (19; 100%), ADJ –[conj]–> ADJ (16; 84%).