Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: Features: Case
This feature is universal.
It occurs with 4 different values: Dat, Gen, Nom, Voc.
17912 tokens (20%) have a non-empty value of Case.
4618 types (61%) occur at least once with a non-empty value of Case.
3257 lemmas (59%) occur at least once with a non-empty value of Case.
The feature is used with 6 part-of-speech tags: NOUN (13391; 15% instances), PROPN (1721; 2% instances), ADJ (1445; 2% instances), DET (1350; 1% instances), PART (4; 0% instances), NUM (1; 0% instances).
NOUN
13391 NOUN tokens (72% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: VerbForm=EMPTY (13391; 100%), Number=Sing (11353; 85%), Gender=Masc (8616; 64%).
NOUN tokens may have the following values of Case:
Dat(4755; 36% of non-emptyCase): taobh, àite, àm, aghaidh, leth, thaobh, duine, dòigh, ceann, bhliadhnaGen(2463; 18% of non-emptyCase): bliadhna, pàirce, latha, obrach, taighe, dùthcha, dìon, pàrlamaid, airgid, dhaoineNom(6144; 46% of non-emptyCase): fhios, fear, duine, rud, daoine, ball, latha, buille, bliadhna, taobhVoc(29; 0% of non-emptyCase): dhuine, ‘ille, Rìgh, ghràidh, ‘illean, bhalaich, ghràidhein, ‘ill’, bheadragain, bhròineinEMPTY(5271): bhith, dol, ràdh, chur, ais, dhèanamh, feuchainn, thoirt, tighinn, cur
| Paradigm duine | Nom | Dat | Gen | Voc |
|---|---|---|---|---|
| CleftType=Nom|Number=Sing | duine, dhuine | |||
| CleftType=Nom|Number=Plur | daoin', daoine | |||
| Number=Sing | duine, duin', duin’, dhuine | duine, dhuine, dhuin', duin' | duine | dhuine |
| Number=Plur | daoine, duine, daoin', daoin’ | daoine, dhaoine, dhuine | dhaoine, daoine, dhaoin', duin' |
PROPN
1721 PROPN tokens (40% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: NounType=Prs (1039; 60%), Gender=Masc (878; 51%).
PROPN tokens may have the following values of Case:
Dat(209; 12% of non-emptyCase): Ghàidhlig, [Name], Dòmhnall, Ghaidhealtachd, Iain, Gàidhlig, Tearaich, Dhòmhnall, Garaidh, LabhruinnGen(785; 46% of non-emptyCase): [Name], Gàidhlig, h-Alba, Alba, Gaidhealtachd, Iain, Yugoslavia, [Placename], Dhùn, AstràiliaNom(635; 37% of non-emptyCase): [Name], Iain, Dòmhnall, Màiri, Tormod, Alasdair, Anna, Ghàidhlig, Eachann, GaraidhVoc(92; 5% of non-emptyCase): [Name], Mhurchaidh, Aonghais, Iain, Raghnaill, Dhòmhnaill, Anna, Choinnich, Sheonaidh, AnnEMPTY(2594): [Placename], [Name], Alba, MacLeish, Yugoslavia, Malpas, Aitken, Dalgleish, Johnson, Uibhist
| Paradigm [Name] | Nom | Dat | Gen | Voc |
|---|---|---|---|---|
| CleftType=Nom|Gender=Masc | [Name] | |||
| CleftType=Obl|Gender=Masc | [Name] | |||
| Gender=Masc | [Name] | [Name], dh’[Name] | [Name] | [Name] |
| Gender=Fem | [Name] | [Name] | [Name] | [Name] |
| [Name] |
ADJ
1445 ADJ tokens (41% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (1112; 77%), Gender=Masc (941; 65%).
ADJ tokens may have the following values of Case:
Dat(435; 30% of non-emptyCase): eile, ùr, ghoirid, ùra, dubh, mór, móra, Albannach, Eòrpach, annasachGen(218; 15% of non-emptyCase): eile, Ghlais, àrd, mhòir, Buidhe, Bhàin, ùr, Ruaidh, bhig, dùthchailNom(788; 55% of non-emptyCase): eile, ùr, beag, mhòr, mòr, math, shaor, àrd, òg, mòraVoc(4; 0% of non-emptyCase): dhuibh, bhochd, òigEMPTY(2077): bith, sam, a, cinnteach, math, faisg, seann, thall, fhearr, droch
| Paradigm eile | Nom | Dat | Gen |
|---|---|---|---|
| Gender=Masc|Number=Sing | eile, eil', eil’ | eile | eile |
| Gender=Masc|Number=Plur | eile | eile | eile |
| Gender=Fem|Number=Sing | eile | eile | eile |
| Gender=Fem|Number=Plur | eile | eile | eile |
DET
1350 DET tokens (20% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Definite=Def (1350; 100%), Person=EMPTY (1350; 100%), Poss=EMPTY (1350; 100%), PronType=Art (1350; 100%), Number=Sing (1066; 79%), Gender=Masc (766; 57%).
DET tokens may have the following values of Case:
Gen(1350; 100% of non-emptyCase): na, an, a’, nan, a’, nam, am, aEMPTY(5277): an, na, a’, a, am, a’, sin, seo, ‘n, h-uile
PART
4 PART tokens (0% of all PART tokens) have a non-empty value of Case.
The most frequent other feature values with which PART and Case co-occurred: Polarity=EMPTY (4; 100%), PronType=EMPTY (4; 100%), PartType=Pat (3; 75%).
PART tokens may have the following values of Case:
Gen(4; 100% of non-emptyCase): ‘ic, Mac, MhicEMPTY(8794): a, a’, gu, ag, cha, nach, air, gun, chan, an
NUM
1 NUM tokens (0% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumForm=Digit (1; 100%), NumType=Card (1; 100%).
NUM tokens may have the following values of Case:
Gen(1; 100% of non-emptyCase): 1630anEMPTY(1084): aon, dà, deug, trì, dhà, fhichead, fichead, ceithir, chiad, cheud
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[amod]–> ADJ (1274; 66%),
NOUN –[conj]–> NOUN (433; 74%),
PROPN –[det]–> DET (282; 73%),
PROPN –[amod]–> ADJ (66; 72%),
PROPN –[conj]–> PROPN (53; 78%),
NOUN –[appos]–> NOUN (46; 53%),
PROPN –[appos]–> NOUN (25; 52%),
NOUN –[compound]–> NOUN (19; 100%),
ADJ –[conj]–> ADJ (16; 84%),
NOUN –[reparandum]–> NOUN (11; 73%).