Treebank Statistics: UD_North_Sami-Giella: Features: Case
This feature is universal.
It occurs with 8 different values: Abe, Acc, Com, Ess, Gen, Ill, Loc, Nom.
10832 tokens (40%) have a non-empty value of Case.
5109 types (67%) occur at least once with a non-empty value of Case.
2932 lemmas (67%) occur at least once with a non-empty value of Case.
The feature is used with 7 part-of-speech tags: NOUN (6374; 24% instances), PRON (2610; 10% instances), PROPN (875; 3% instances), ADJ (515; 2% instances), NUM (344; 1% instances), VERB (111; 0% instances), AUX (3; 0% instances).
NOUN
6374 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (4311; 68%).
NOUN tokens may have the following values of Case:
Acc(1286; 20% of non-emptyCase): sámegiela, veahki, bierggu, biktasiid, mánáid, reivve, girjji, girjjiid, gáfe, bargguCom(247; 4% of non-emptyCase): biillain, mánáiguin, mánáin, vugiin, beatnagiiddisguin, beatnagiin, biillaiguin, bissuin, boazodoaluin, borramušainEss(167; 3% of non-emptyCase): oahpaheaddjin, lassin, veahkkin, ovdamearkan, vuođđun, buohccedivššárin, nuorran, Eurohpameašttirin, bassin, buohccinGen(1301; 20% of non-emptyCase): sámi, jagi, beaivvi, áigge, olbmo, sámegiela, máná, áiggi, skuvlla, sámiidIll(513; 8% of non-emptyCase): mánáide, skuvlii, gávpogii, meahccái, sámegillii, internáhttii, mollii, bargui, heajaide, siidiiLoc(814; 13% of non-emptyCase): skuvllas, internáhtas, guovllus, viesus, oasis, oktavuođas, olbmuin, barggus, goađis, gávpogisNom(2046; 32% of non-emptyCase): olbmot, mánát, eadni, gánda, olmmoš, stállu, oahppit, mánná, nieida, oahpaheaddjiEMPTY(45): M., dearvvašvuođa-, A., Mr., giella-, skuvla-, Bivdo-, Gieldda-, IL, J.
| Paradigm olmmoš | Nom | Acc | Gen | Loc | Ess | Com | Ill |
|---|---|---|---|---|---|---|---|
| _ | olmmožin | ||||||
| Number=Sing | olmmoš | olbmo | olbmo | olbmos | olbmui | ||
| Number=Plur | olbmot | olbmuid | olbmuid | olbmuin | olbmuiguin | olbmuide | |
| Number=Plur|Number[psor]=Plur|Person[psor]=3 | olbmuideaset |
PRON
2610 PRON tokens (92% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Number=Sing (1624; 62%), PronType=Prs (1537; 59%).
PRON tokens may have the following values of Case:
Acc(356; 14% of non-emptyCase): dan, maid, su, iežas, daid, dán, iežaset, du, mu, maidegeCom(53; 2% of non-emptyCase): dainna, daiguin, dáinna, iežainis, nuppiin, suinna, duinna, iežainan, iežaineaskka, maiguinEss(4; 0% of non-emptyCase): danin, dákkárin, iehčaneameGen(410; 16% of non-emptyCase): mu, dan, dán, min, su, iežas, sin, du, daid, iežasetIll(170; 7% of non-emptyCase): munnje, dasa, sutnje, dutnje, sidjiide, alccesis, dán, midjiide, dan, earáideLoc(239; 9% of non-emptyCase): mus, dus, das, mis, sis, sus, dán, dan, mas, dainNom(1378; 53% of non-emptyCase): son, mun, mii, dat, sii, don, dát, soai, moai, geatEMPTY(237): buot, juohke, eará, dakkár, muhtun, makkár, muhtin, unnán, dákkár, seamma
| Paradigm dat | Nom | Acc | Gen | Loc | Ess | Com | Ill |
|---|---|---|---|---|---|---|---|
| Number=Sing | dat | dan, dange | dan | das, dan, dasnai | dainna | dasa, dan | |
| Number=Plur | dat, Dathan | daid | daid | dain | daiguin, daid | daidda | |
| danin |
PROPN
875 PROPN tokens (88% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (865; 99%).
PROPN tokens may have the following values of Case:
Acc(43; 5% of non-emptyCase): Sarvva, Liná, Divvuma, Máhte, Sámedikki, Antonsen, Beckham, Buolbmága, Busi, EfraimaCom(14; 2% of non-emptyCase): Sámedikkiin, Birehiin, Hanseniin, Iŋggáin, Juffáin, Máhte-Iŋggáin, Márehiin, Nilut_Cupain, Rihtáin, RiibmagállásiinEss(5; 1% of non-emptyCase): Gállábárdnin, Jesusin, Mihkkalažžan, Márehažžan, SmierrunGen(223; 25% of non-emptyCase): Norgga, Sámi, Finnmárkku, Kárášjoga, Romssa, Sámedikki, Ipmila, Guovdageainnu, Deanu, RuoŧaIll(63; 7% of non-emptyCase): Kárášjohkii, Sápmái, Ellii, Finnmárkkuopmodahkii, Gáivutnii, Hámmárfestii, Trosterudii, Aarbortii, Abbai, ArniiLoc(147; 17% of non-emptyCase): Kárášjogas, Guovdageainnus, Finnmárkkus, Deanus, Gáivuonas, Norggas, Romssas, Máhtes, Olmmáivákkis, OslosNom(380; 43% of non-emptyCase): Gállá, Máret, Máhtte, Liná, Ánde, Sámediggi, Ánne, Biret, Ipmil, FinnmárkkuopmodatEMPTY(117): Nils, Stuorra, Aslak, Biret, Jack, Lene, Per, Anders, Johan, Margrethe
| Paradigm Sámediggi | Nom | Acc | Gen | Loc | Com | Ill |
|---|---|---|---|---|---|---|
| Sámediggi | Sámedikki | Sámedikki, Sámedikke | Sámedikkis | Sámedikkiin | Sámediggái |
ADJ
515 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=EMPTY (437; 85%), Number=Sing (329; 64%).
ADJ tokens may have the following values of Case:
Acc(18; 3% of non-emptyCase): buori, buriid, ollu, Goalmmáda, baháid, buhtismeahttumiid, doloža, guoskevačča, sullasačča, suohttasiiddiskaCom(3; 1% of non-emptyCase): buriinEss(59; 11% of non-emptyCase): duhtavažžan, nubbin, seavdnjadin, nuorran, ruoksadin, bassin, bivnnuhin, boarisin, buhtisin, buhtismeahttuminGen(20; 4% of non-emptyCase): nuppi, jagáš, buoremusaid, buori, 7-jahkásačča, buriid, doloža, parlamentáralaččaid, ráhkkásisIll(1; 0% of non-emptyCase): sullásaččaideLoc(8; 2% of non-emptyCase): nuppi, Nuorabuin, Nuoramusain, doložis, ráhkkásisttánNom(406; 79% of non-emptyCase): buorre, váttis, vejolaš, veara, buorit, boaris, dehálaš, suohtas, divrras, duohtaEMPTY(861): ollu, ođđa, vuosttaš, stuora, stuorra, eanaš, olles, buoremus, amas, dálá
| Paradigm buorre | Nom | Acc | Gen | Ess | Com |
|---|---|---|---|---|---|
| _ | buorrin | ||||
| Degree=Cmp | buorebun | ||||
| Degree=Cmp|Number=Sing | buoret | ||||
| Degree=Sup | buoremussan | ||||
| Degree=Sup|Number=Sing | buoremus | ||||
| Degree=Sup|Number=Plur | buoremusaid | ||||
| Number=Sing | buorre | buori | buori | buriin | |
| Number=Plur | buorit | buriid | buriid |
NUM
344 NUM tokens (99% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (344; 100%), Number=Sing (327; 95%).
NUM tokens may have the following values of Case:
Acc(79; 23% of non-emptyCase): guokte, moadde, golbma, máŋga, ovtta, vihtta, Galliid, guhtta, njeallje, 1300Com(15; 4% of non-emptyCase): ovttain, golmmain, guvttiin, viđain, galliin, golmmaiguin, čuđiinEss(2; 1% of non-emptyCase): guoktin, oktanGen(61; 18% of non-emptyCase): golmma, viđa, máŋgga, ovtta, 12, guovtti, 1.8.2001, moatti, 05.01.00, 12.03.2010Ill(23; 7% of non-emptyCase): golmma, beannot, guovtti, čuohtái, moatti, máŋgga, njealji, ovtta, golmmaideLoc(47; 14% of non-emptyCase): ovtta, guovtti, 1982:s, 1995:s, golmmain, máŋgga, 1834:s, 1877:s, 1898:s, 1899:sNom(117; 34% of non-emptyCase): okta, guokte, golbma, máŋga, njeallje, vihtta, moadde, 1971, 2005, 50EMPTY(4): guoktenuppelot, vihttalot
| Paradigm guokte | Nom | Acc | Gen | Loc | Ess | Com | Ill |
|---|---|---|---|---|---|---|---|
| Number=Sing | guokte | guokte | guovtti, guovtte | guovtti | guvttiin | guovtti | |
| Number=Plur | guovttit | guvttiid | |||||
| guoktin |
VERB
111 VERB tokens (3% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Aspect=EMPTY (111; 100%), Mood=EMPTY (111; 100%), Number=EMPTY (111; 100%), Person=EMPTY (111; 100%), Tense=EMPTY (111; 100%), VerbForm=Ger (111; 100%).
VERB tokens may have the following values of Case:
Abe(11; 10% of non-emptyCase): beroškeahttá, eahpitkeahttá, logakeahttá, bážikeahttá, dieđikeahttá, mávssekeahtesEss(64; 58% of non-emptyCase): boahtimin, fárremin, leamen, čierastallame, bargame, bargamin, bassaladdame, bassame, boahtime, oađđiminGen(24; 22% of non-emptyCase): vácci, čuoigga, gudnejahttin, ráhkistan, Mearkkašan, Suga, bora, fuopmášan, namahan, njágaLoc(12; 11% of non-emptyCase): goargŋumis, juhkamis, bargamis, borgguheames, botkemis, deaivvadeamis, gođđimis, guldaleames, jáhkkimis, vuostáváldimisEMPTY(4199): lea, leat, lei, ledje, bođii, boahtá, manai, vuolgit, ožžon, dieđe
| Paradigm bargat | Loc | Ess |
|---|---|---|
| bargamis | bargame, bargamin |
Case seems to be lexical feature of VERB. 93% lemmas (67) occur only with one value of Case.
AUX
3 AUX tokens (0% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Mood=EMPTY (3; 100%), Number=EMPTY (3; 100%), Person=EMPTY (3; 100%), Polarity=EMPTY (3; 100%), Tense=EMPTY (3; 100%), VerbForm=Ger (3; 100%).
AUX tokens may have the following values of Case:
Ess(3; 100% of non-emptyCase): leamen, áigume, áiguminEMPTY(1991): lea, leat, ii, lei, eai, ledje, galgá, lean, sáhttá, in
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[conj]–> NOUN (324; 96%),
NOUN –[det]–> PRON (168; 93%),
ADJ –[nsubj]–> NOUN (118; 98%),
ADJ –[nsubj]–> PRON (76; 100%),
NOUN –[nsubj]–> PRON (74; 74%),
NOUN –[nsubj]–> NOUN (65; 76%),
PROPN –[conj]–> PROPN (42; 84%),
NOUN –[amod]–> NUM (31; 97%),
ADJ –[conj]–> ADJ (13; 93%),
NOUN –[appos]–> NOUN (13; 76%).