Treebank Statistics: UD_Icelandic-GC: Features: Case
This feature is universal.
It occurs with 4 different values: Acc, Dat, Gen, Nom.
59365 tokens (60%) have a non-empty value of Case.
18484 types (92%) occur at least once with a non-empty value of Case.
12100 lemmas (88%) occur at least once with a non-empty value of Case.
The feature is used with 10 part-of-speech tags: NOUN (20374; 20% instances), ADP (11735; 12% instances), PRON (7760; 8% instances), VERB (5565; 6% instances), ADJ (5406; 5% instances), PROPN (5158; 5% instances), NUM (1353; 1% instances), AUX (986; 1% instances), ADV (983; 1% instances), DET (45; 0% instances).
NOUN
20374 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Definite=EMPTY (15590; 77%), Number=Sing (14204; 70%).
NOUN tokens may have the following values of Case:
Acc(5841; 29% of non-emptyCase): málið, áhrif, þátt, stig, tíma, sæti, fólk, mál, stað, vegDat(6040; 30% of non-emptyCase): landinu, sæti, samtali, máli, leiknum, sögn, Íslandi, fólki, stað, leikGen(3199; 16% of non-emptyCase): ára, landsins, kvenna, Íslands, manns, árs, barna, fólks, félagsins, málsinsNom(5294; 26% of non-emptyCase): fólk, maður, menn, liðið, formaður, forseti, stjórnvöld, konur, framkvæmdastjóri, fyrirtækiðEMPTY(190): mars, 2017, apríl, ágúst, desember, febrúar, júlí, júní, nóvember, október
| Paradigm ár | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | ársins | |||
| Definite=Def|Gender=Fem|Number=Plur | árarnar | |||
| Definite=Def|Gender=Neut|Number=Sing | árið | árið | árinu | ársins |
| Definite=Def|Gender=Neut|Number=Plur | árin | árin | árunum | áranna |
| Gender=Masc|Number=Sing | ári | árs, ára | ||
| Gender=Masc|Number=Plur | ára | |||
| Gender=Fem|Number=Sing | ár | |||
| Gender=Fem|Number=Plur | árar | |||
| Gender=Neut | ár | |||
| Gender=Neut|Number=Sing | ár | ár | ári, árum | árs |
| Gender=Neut|Number=Plur | ár | ár | árum | ára |
ADP
11735 ADP tokens (95% of all ADP tokens) have a non-empty value of Case.
ADP tokens may have the following values of Case:
Acc(4099; 35% of non-emptyCase): í, um, á, við, fyrir, með, eftir, yfir, undir, gegnumDat(6631; 57% of non-emptyCase): í, á, af, með, að, frá, úr, fyrir, hjá, eftirGen(979; 8% of non-emptyCase): til, vegna, milli, á, án, meðal, innan, auk, utan, fyrirNom(26; 0% of non-emptyCase): sem, á, og, sbr., tilEMPTY(676): til, um, á, fyrir, í, við, eftir, með, af, að
| Paradigm á | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| á | á | á, a | á |
PRON
7760 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Number=Sing (5464; 70%), Person=EMPTY (4076; 53%), PronType=Prs (4030; 52%).
PRON tokens may have the following values of Case:
Acc(1534; 20% of non-emptyCase): það, sig, hann, þetta, þá, sína, mig, þann, hvað, alltDat(1812; 23% of non-emptyCase): því, sér, þeim, mér, þessu, sínum, þessum, honum, henni, okkurGen(802; 10% of non-emptyCase): þess, þeirra, hans, okkar, hennar, þessa, allra, hvers, annarra, sinnarNom(3612; 47% of non-emptyCase): það, hann, ég, við, hún, þetta, þeir, þau, þessi, þúEMPTY(42): við, þess, hvað, annað, hann, sér, honum, Þetta, því, Hún
| Paradigm sá | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| _ | þess | |||
| Gender=Masc|Number=Sing | sá, þess, þessi | þann | þeim | þess |
| Gender=Masc|Number=Sing|Person=3|PronType=Prs | sá | |||
| Gender=Masc|Number=Plur | þeir | þá | þeim | þeirra |
| Gender=Masc|Number=Plur|Person=3|PronType=Prs | þeir | þeim | þeirra | |
| Gender=Fem|Number=Sing | sú | þá, það | þeirri | þeirrar |
| Gender=Fem|Number=Plur | þær | þær | þeim | þeirra |
| Gender=Neut|Number=Sing | það | það | því | þess |
| Gender=Neut|Number=Sing|Person=3|PronType=Prs | það | það | því | þess |
| Gender=Neut|Number=Plur | þau | þau | þeim | þeirra |
| Gender=Neut|Number=Plur|Person=3|PronType=Prs | þau | þau | þeirra |
VERB
5565 VERB tokens (43% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Voice=Act (5161; 93%), VerbForm=EMPTY (3174; 57%), Mood=Ind (2803; 50%).
VERB tokens may have the following values of Case:
Acc(3748; 67% of non-emptyCase): segir, hafa, sagði, gera, fá, sjá, segja, taka, fékk, hefurDat(1084; 19% of non-emptyCase): finnst, ná, koma, halda, náði, kom, tengjast, fylgja, sinna, komiðGen(127; 2% of non-emptyCase): er, krafist, krefjast, krefst, leita, njóta, nýtur, notið, geta, gætaNom(606; 11% of non-emptyCase): er, var, eru, verið, sé, verður, verði, voru, varð, verðaEMPTY(7275): er, segir, var, eru, kemur, fara, verið, koma, verður, sagði
| Paradigm vera | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Mood=Ind|Number=Sing|Person=1|Tense=Past | var | |||
| Mood=Ind|Number=Sing|Person=1|Tense=Pres | er | |||
| Mood=Ind|Number=Sing|Person=3|Tense=Past | var, voru | var | var | |
| Mood=Ind|Number=Sing|Person=3|Tense=Pres | er | er | er | er |
| Mood=Ind|Number=Plur|Person=3|Tense=Past | voru | voru | ||
| Mood=Ind|Number=Plur|Person=3|Tense=Pres | eru | eru | eru | |
| Mood=Sub|Number=Sing|Person=1|Tense=Past | væri | |||
| Mood=Sub|Number=Sing|Person=1|Tense=Pres | sé | |||
| Mood=Sub|Number=Sing|Person=3|Tense=Past | væri | væri | ||
| Mood=Sub|Number=Sing|Person=3|Tense=Pres | sé | sé | ||
| Mood=Sub|Number=Plur|Person=3|Tense=Past | væru | |||
| Mood=Sub|Number=Plur|Person=3|Tense=Pres | séu | |||
| VerbForm=Inf | vera | vera | vera | |
| VerbForm=Sup | verið | verið |
ADJ
5406 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=EMPTY (4210; 78%), Number=Sing (3779; 70%).
ADJ tokens may have the following values of Case:
Acc(1423; 26% of non-emptyCase): síðustu, fyrstu, meiri, næsta, næstu, góða, mikla, nýja, fyrsta, mikiðDat(1184; 22% of non-emptyCase): síðasta, síðustu, næstu, fyrstu, miklu, mörgum, góðu, miklum, nýju, nýjumGen(387; 7% of non-emptyCase): Sameinuðu, margra, íslenskra, síðustu, fyrsta, íslenska, íslensku, aldraðra, bandarískra, breskaNom(2412; 45% of non-emptyCase): hægt, ljóst, mikið, mikil, fleiri, gott, margir, mikill, erfitt, mikilvægtEMPTY(45): hægt, spennandi, Suðlæg, Svört, aðgerðalaus, beyglaður, brotinn, eldri, flestum, gert
| Paradigm mikill | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Degree=Pos|Gender=Masc|Number=Sing | mikill | mikla | miklum | mikils |
| Degree=Pos|Gender=Fem|Number=Sing | mikla | |||
| Degree=Pos|Gender=Fem|Number=Plur | mikil | miklum | ||
| Degree=Pos|Gender=Neut|Number=Sing | mikið | mikið, mikla | miklu | |
| Degree=Pos|Gender=Neut|Number=Plur | mikil | |||
| Degree=Cmp|Gender=Masc|Number=Sing | meiri | meiri | meiri | |
| Degree=Cmp|Gender=Fem|Number=Sing | meiri | meiri | meira | |
| Degree=Cmp|Gender=Fem|Number=Plur | meiri | meiri | ||
| Degree=Cmp|Gender=Neut|Number=Sing | meira | meira | meira | |
| Degree=Cmp|Gender=Neut|Number=Plur | meiri | meiri | meiri | |
| Degree=Sup|Gender=Masc|Number=Sing | mestan | mestum | ||
| Degree=Sup|Gender=Masc|Number=Plur | mestir | |||
| Degree=Sup|Gender=Fem|Number=Sing | mest, mesta | mesta | mestu | |
| Degree=Sup|Gender=Fem|Number=Plur | Mestar | mestar | ||
| Degree=Sup|Gender=Neut|Number=Sing | mesta, mest | mest | mestu, mesta | |
| Degree=Sup|Gender=Neut|Number=Plur | mestu | |||
| Gender=Masc|Number=Sing | mikill, mikli | mikinn, mikla | miklum | mikils, mikla |
| Gender=Masc|Number=Plur | miklir | miklum, miklu | ||
| Gender=Fem|Number=Sing | mikil, mikla | mikla | mikilli | mikillar |
| Gender=Fem|Number=Plur | miklar | miklar | miklum | mikilla |
| Gender=Neut|Number=Sing | mikið | mikið | miklu, meiru | mikils |
| Gender=Neut|Number=Plur | mikil | mikil, miklu | miklum |
PROPN
5158 PROPN tokens (84% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=EMPTY (2595; 50%).
PROPN tokens may have the following values of Case:
Acc(528; 10% of non-emptyCase): Trump, Ísland, Jón, Audi, Clinton, James, Reykjavík, Singapúr, Ólaf, AlbertaDat(1132; 22% of non-emptyCase): Íslandi, Bandaríkjunum, Reykjavík, Morgunblaðinu, Akureyri, Danmörku, London, Noregi, Evrópu, HMGen(1048; 20% of non-emptyCase): Íslands, Bandaríkjanna, Reykjavíkur, 2, Alþingis, stöðvar, Sjálfstæðisflokksins, RÚV, Trump, TrumpsNom(2450; 47% of non-emptyCase): Trump, þór, Jón, Guðmundur, Ísland, Katrín, Sigurður, Ólafur, Bjarni, BjörnEMPTY(967): Icelandair, United, New, air, Facebook, Group, Manchester, WOW, York, Post
| Paradigm Ísland | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Ísland | Ísland | Íslandi | Íslands, Ísland |
NUM
1353 NUM tokens (72% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: Number=Plur (1159; 86%), NumType=EMPTY (765; 57%).
NUM tokens may have the following values of Case:
Acc(448; 33% of non-emptyCase): tvo, tvö, fimm, eitt, tíu, 10, þrjú, milljónir, prósent, tværDat(284; 21% of non-emptyCase): tveimur, þremur, fimm, fjórum, sex, átta, sjö, tíu, einu, milljónumGen(273; 20% of non-emptyCase): tveggja, þriggja, fjögurra, fimm, sex, átta, 100, 16, 18, 6Nom(348; 26% of non-emptyCase): tveir, prósent, þrír, einn, fimm, þúsund, tvær, tvö, tíu, sexEMPTY(537): 2017, 2016, 0, 2, 2012, 2014, 1, prósent, 2013, 3
| Paradigm tveir | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Gender=Masc | tveir | tvo | tveimur | tveggja |
| Gender=Fem | tvær | tvær | tveimur | tveggja |
| Gender=Fem|NumType=Card | tveggja | |||
| Gender=Neut | tvö | tvö | tveimur | tveggja |
AUX
986 AUX tokens (24% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Voice=Act (983; 100%), VerbForm=EMPTY (861; 87%), Person=3 (824; 84%), Mood=Ind (716; 73%), Number=Sing (699; 71%), Tense=Pres (623; 63%).
AUX tokens may have the following values of Case:
Acc(22; 2% of non-emptyCase): hafi, hefur, hafa, hefði, hefðu, muni, fengið, hafði, munum, máDat(29; 3% of non-emptyCase): er, var, hefur, sé, hafi, hafði, myndu, mætti, skuli, veriðGen(16; 2% of non-emptyCase): er, var, sé, eru, vera, veriðNom(919; 93% of non-emptyCase): er, var, eru, verið, sé, vera, væri, voru, séu, væruEMPTY(3187): er, hefur, var, hafi, hafa, verið, eru, sé, hefði, má
| Paradigm vera | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Mood=Imp|Number=Plur|Voice=Act | Verið | |||
| Mood=Ind|Number=Sing|Person=1|Tense=Past|Voice=Act | var | |||
| Mood=Ind|Number=Sing|Person=1|Tense=Pres|Voice=Act | er | |||
| Mood=Ind|Number=Sing|Person=3|Tense=Past|Voice=Act | var | var | var | |
| Mood=Ind|Number=Sing|Person=3|Tense=Pres|Voice=Act | er | er | er | |
| Mood=Ind|Number=Plur|Person=1|Tense=Past|Voice=Act | vorum | |||
| Mood=Ind|Number=Plur|Person=1|Tense=Pres|Voice=Act | erum | |||
| Mood=Ind|Number=Plur|Person=3|Tense=Past|Voice=Act | voru | |||
| Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Act | eru, erum | eru | ||
| Mood=Ind|Person=3|Tense=Pres|Voice=Act | er | |||
| Mood=Sub|Number=Sing|Person=1|Tense=Past|Voice=Act | væri | |||
| Mood=Sub|Number=Sing|Person=1|Tense=Pres|Voice=Act | sé | |||
| Mood=Sub|Number=Sing|Person=2|Tense=Past|Voice=Act | værir | |||
| Mood=Sub|Number=Sing|Person=3|Tense=Past|Voice=Act | væri | |||
| Mood=Sub|Number=Sing|Person=3|Tense=Pres|Voice=Act | sé, væri | sé | sé | |
| Mood=Sub|Number=Plur|Person=1|Tense=Past|Voice=Act | værum | |||
| Mood=Sub|Number=Plur|Person=1|Tense=Pres|Voice=Act | séum | |||
| Mood=Sub|Number=Plur|Person=3|Tense=Past|Voice=Act | væru | |||
| Mood=Sub|Number=Plur|Person=3|Tense=Pres|Voice=Act | séu, sé | |||
| Number=Sing|VerbForm=Part | verið | |||
| Number=Plur|VerbForm=Part | verið | |||
| VerbForm=Inf | vera | |||
| VerbForm=Inf|Voice=Act | vera | vera | vera | |
| VerbForm=Sup|Voice=Act | verið | verið | verið | verið |
| Voice=Act | verið |
ADV
983 ADV tokens (10% of all ADV tokens) have a non-empty value of Case.
ADV tokens may have the following values of Case:
Acc(668; 68% of non-emptyCase): dag, ár, daga, fyrra, sumar, daginn, lok, viku, á, íDat(288; 29% of non-emptyCase): ári, árum, á, byrjun, því, frá, viku, mánuði, mánuðum, mínútuGen(12; 1% of non-emptyCase): til, ára, daga, framtíðar, metra, morguns, neins, sólarhringsNom(15; 2% of non-emptyCase): allt, það, Föstudaginn, ein, einn, erfitt, kvöld, mínútur, síður, vikaEMPTY(9316): ekki, þar, þá, í, fram, á, svo, upp, til, út
| Paradigm ár | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Definite=Def|Gender=Neut|Number=Sing | árið | árinu | ||
| Definite=Def|Gender=Neut|Number=Plur | árin | árunum | ||
| Gender=Fem|Number=Sing | ár | |||
| Gender=Neut|Number=Sing | ár | ári | ||
| Gender=Neut|Number=Plur | ár | árum | ára |
DET
45 DET tokens (100% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Definite=Def (45; 100%), Number=Sing (36; 80%).
DET tokens may have the following values of Case:
Acc(13; 29% of non-emptyCase): hið, hin, hina, WOW-ið, hinar, volume-iðDat(10; 22% of non-emptyCase): hinum, hinu, hinniGen(1; 2% of non-emptyCase): hinnarNom(21; 47% of non-emptyCase): hinn, hin, hið, Hinir
| Paradigm hinn | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Gender=Masc|Number=Sing | hinn | hinum | ||
| Gender=Masc|Number=Plur | Hinir | |||
| Gender=Fem|Number=Sing | hin | hina | hinni | hinnar |
| Gender=Fem|Number=Plur | hinar | hinum | ||
| Gender=Neut|Number=Sing | hið | hið | hinu | |
| Gender=Neut|Number=Plur | hin | hin | hinum |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[case]–> ADP (7816; 97%),
NOUN –[amod]–> ADJ (3305; 97%),
VERB –[obj]–> NOUN (2713; 90%),
NOUN –[nmod]–> PRON (1749; 97%),
PROPN –[flat]–> PROPN (1189; 100%),
PROPN –[case]–> ADP (1091; 84%),
PRON –[case]–> ADP (895; 98%),
NOUN –[conj]–> NOUN (860; 92%),
ADV –[case]–> ADP (708; 81%),
NOUN –[nummod]–> NUM (682; 84%).