Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
25850 tokens (29%) have a non-empty value of Gender.
4437 types (58%) occur at least once with a non-empty value of Gender.
3054 lemmas (55%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (13369; 15% instances), DET (5112; 6% instances), PRON (4712; 5% instances), ADJ (1445; 2% instances), PROPN (1212; 1% instances).
NOUN
13369 NOUN tokens (72% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (13369; 100%), Number=Sing (11353; 85%).
NOUN tokens may have the following values of Gender:
Fem(4753; 36% of non-emptyGender): bliadhna, buille, bhliadhna, obair, cuid, tè, aghaidh, dòigh, leithid, uairMasc(8616; 64% of non-emptyGender): duine, fear, fhios, taobh, rud, daoine, latha, àite, taigh, lethEMPTY(5293): bhith, dol, ràdh, chur, ais, dhèanamh, feuchainn, thoirt, tighinn, cur
| Paradigm bliadhna | Masc | Fem |
|---|---|---|
| Case=Dat|Number=Sing | bhliadhna, bliadhna, bhliadhn', bliadhn’ | |
| Case=Dat|Number=Plur | bliadhnaichean, bliadhnachan | |
| Case=Gen|Form=Emp|Number=Sing | bliadhna-sa | |
| Case=Gen|Number=Sing | bliadhna, bhliadhna, bliadhn', bliadhn’ | |
| Case=Gen|Number=Plur | bhliadhnaichean | bhliadhnaichean, bliadhnaichean, bliadhnachan, bhliadhnachan, bliadhna |
| Case=Nom|CleftType=Nom|Number=Sing | bhliadhna, bliadhn', bliadhna | |
| Case=Nom|Number=Sing | bliadhna, bhliadhna, bliadhn', bhliadhn' | |
| Case=Nom|Number=Plur | bliadhnaichean, bhliadhnaichean |
Gender seems to be lexical feature of NOUN. 96% lemmas (2401) occur only with one value of Gender.
DET
5112 DET tokens (77% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Definite=Def (4618; 90%), Person=EMPTY (4618; 90%), Poss=EMPTY (4618; 90%), PronType=Art (4618; 90%), Number=Sing (4180; 82%), Case=EMPTY (3808; 74%).
DET tokens may have the following values of Gender:
Fem(1778; 35% of non-emptyGender): na, an, a’, a, a’, ‘n, nan, nam, ‘n, amMasc(3334; 65% of non-emptyGender): an, na, a’, a, am, a’, nan, ‘n, nam, ‘mEMPTY(1515): an, sin, seo, a, h-uile, am, mo, a’, na, do
| Paradigm an | Masc | Fem |
|---|---|---|
| Case=Gen|Number=Sing | an, a’, a', am, na | na, an, a', a’ |
| Case=Gen|Number=Sing|Typo=Yes | am, na | a', a’, an |
| Case=Gen|Number=Dual | an | |
| Case=Gen|Number=Plur | nan, na, nam | nan, nam, na |
| Case=Gen|Number=Plur|Typo=Yes | na | na |
| Number=Sing | an, a’, am, 'n, a', 'm, ‘n, ’n, nam | an, a’, a', 'n, ‘n, am, a, 'm, ‘m |
| Number=Sing|Typo=Yes | a’, a', an | 'n |
| Number=Plur | na | na |
PRON
4712 PRON tokens (49% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4712; 100%), Person=3 (4712; 100%), PronType=Prs (4712; 100%).
PRON tokens may have the following values of Gender:
Fem(994; 21% of non-emptyGender): i, a, ise, h-i, h-ìMasc(3718; 79% of non-emptyGender): e, esan, a, h-e, ise, è, mise, sinneEMPTY(4937): iad, mi, thu, sin, sinn, fhèin, seo, dè, sibh, mise
| Paradigm i | Masc | Fem |
|---|---|---|
| CleftType=Nom | i | |
| CleftType=Obl|Form=Emp | ise | |
| Form=Emp | ise | |
| ise | i, h-i |
ADJ
1445 ADJ tokens (41% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1113; 77%), Case=Nom (788; 55%).
ADJ tokens may have the following values of Gender:
Fem(504; 35% of non-emptyGender): eile, mhòr, ùr, àrd, shaor, mhath, bheag, mhór, beaga, BuidheMasc(941; 65% of non-emptyGender): eile, beag, ùr, mòr, math, mór, òg, dubh, ghoirid, ùraEMPTY(2077): bith, sam, a, cinnteach, math, faisg, seann, thall, fhearr, droch
| Paradigm eile | Masc | Fem |
|---|---|---|
| Case=Dat|Number=Sing | eile | eile |
| Case=Dat|Number=Plur | eile | eile |
| Case=Gen|Number=Sing | eile | eile |
| Case=Gen|Number=Plur | eile | eile |
| Case=Nom|Number=Sing | eile, eil', eil’ | eile |
| Case=Nom|Number=Plur | eile | eile |
PROPN
1212 PROPN tokens (28% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: NounType=Prs (971; 80%), Case=Nom (635; 52%).
PROPN tokens may have the following values of Gender:
Fem(334; 28% of non-emptyGender): Gàidhlig, Ghàidhlig, [Name], Màiri, Gaidhealtachd, Anna, Ghaidhealtachd, Gàidhealtachd, Mairearad, InisMasc(878; 72% of non-emptyGender): [Name], Iain, Dòmhnall, Tormod, Mhurchaidh, Alasdair, Aonghais, Garaidh, Labhruinn, lainEMPTY(3103): [Placename], Alba, [Name], Yugoslavia, MacLeish, Malpas, Uibhist, h-Alba, Aitken, Dalgleish
| Paradigm [Name] | Masc | Fem |
|---|---|---|
| Case=Dat|CleftType=Obl | [Name] | |
| Case=Dat | [Name], dh’[Name] | [Name] |
| Case=Gen | [Name] | [Name] |
| Case=Nom|CleftType=Nom | [Name] | |
| Case=Nom | [Name] | [Name] |
| Case=Voc | [Name] | [Name] |
Gender seems to be lexical feature of PROPN. 100% lemmas (242) occur only with one value of Gender.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (4100; 85%),
NOUN –[amod]–> ADJ (1273; 66%),
NOUN –[conj]–> NOUN (295; 51%),
NOUN –[appos]–> PROPN (70; 69%),
PROPN –[amod]–> ADJ (67; 88%),
NOUN –[appos]–> NOUN (56; 64%),
PROPN –[nmod]–> PROPN (45; 52%),
PROPN –[conj]–> PROPN (38; 68%),
PROPN –[appos]–> NOUN (36; 75%),
NOUN –[compound]–> NOUN (19; 100%).