Treebank Statistics: UD_Icelandic-GC: Features: Case
This feature is universal.
It occurs with 4 different values: Acc
, Dat
, Gen
, Nom
.
59365 tokens (60%) have a non-empty value of Case
.
18484 types (92%) occur at least once with a non-empty value of Case
.
12100 lemmas (88%) occur at least once with a non-empty value of Case
.
The feature is used with 10 part-of-speech tags: NOUN (20374; 20% instances), ADP (11735; 12% instances), PRON (7760; 8% instances), VERB (5565; 6% instances), ADJ (5406; 5% instances), PROPN (5158; 5% instances), NUM (1353; 1% instances), AUX (986; 1% instances), ADV (983; 1% instances), DET (45; 0% instances).
NOUN
20374 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Definite=EMPTY (15590; 77%), Number=Sing (14204; 70%).
NOUN
tokens may have the following values of Case
:
Acc
(5841; 29% of non-emptyCase
): málið, áhrif, þátt, stig, tíma, sæti, fólk, mál, stað, vegDat
(6040; 30% of non-emptyCase
): landinu, sæti, samtali, máli, leiknum, sögn, Íslandi, fólki, stað, leikGen
(3199; 16% of non-emptyCase
): ára, landsins, kvenna, Íslands, manns, árs, barna, fólks, félagsins, málsinsNom
(5294; 26% of non-emptyCase
): fólk, maður, menn, liðið, formaður, forseti, stjórnvöld, konur, framkvæmdastjóri, fyrirtækiðEMPTY
(190): mars, 2017, apríl, ágúst, desember, febrúar, júlí, júní, nóvember, október
Paradigm ár | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Definite=Def|Gender=Masc|Number=Sing | ársins | |||
Definite=Def|Gender=Fem|Number=Plur | árarnar | |||
Definite=Def|Gender=Neut|Number=Sing | árið | árið | árinu | ársins |
Definite=Def|Gender=Neut|Number=Plur | árin | árin | árunum | áranna |
Gender=Masc|Number=Sing | ári | árs, ára | ||
Gender=Masc|Number=Plur | ára | |||
Gender=Fem|Number=Sing | ár | |||
Gender=Fem|Number=Plur | árar | |||
Gender=Neut | ár | |||
Gender=Neut|Number=Sing | ár | ár | ári, árum | árs |
Gender=Neut|Number=Plur | ár | ár | árum | ára |
ADP
11735 ADP tokens (95% of all ADP
tokens) have a non-empty value of Case
.
ADP
tokens may have the following values of Case
:
Acc
(4099; 35% of non-emptyCase
): í, um, á, við, fyrir, með, eftir, yfir, undir, gegnumDat
(6631; 57% of non-emptyCase
): í, á, af, með, að, frá, úr, fyrir, hjá, eftirGen
(979; 8% of non-emptyCase
): til, vegna, milli, á, án, meðal, innan, auk, utan, fyrirNom
(26; 0% of non-emptyCase
): sem, á, og, sbr., tilEMPTY
(676): til, um, á, fyrir, í, við, eftir, með, af, að
Paradigm á | Nom | Acc | Dat | Gen |
---|---|---|---|---|
á | á | á, a | á |
PRON
7760 PRON tokens (99% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Number=Sing (5464; 70%), Person=EMPTY (4076; 53%), PronType=Prs (4030; 52%).
PRON
tokens may have the following values of Case
:
Acc
(1534; 20% of non-emptyCase
): það, sig, hann, þetta, þá, sína, mig, þann, hvað, alltDat
(1812; 23% of non-emptyCase
): því, sér, þeim, mér, þessu, sínum, þessum, honum, henni, okkurGen
(802; 10% of non-emptyCase
): þess, þeirra, hans, okkar, hennar, þessa, allra, hvers, annarra, sinnarNom
(3612; 47% of non-emptyCase
): það, hann, ég, við, hún, þetta, þeir, þau, þessi, þúEMPTY
(42): við, þess, hvað, annað, hann, sér, honum, Þetta, því, Hún
Paradigm sá | Nom | Acc | Dat | Gen |
---|---|---|---|---|
_ | þess | |||
Gender=Masc|Number=Sing | sá, þess, þessi | þann | þeim | þess |
Gender=Masc|Number=Sing|Person=3|PronType=Prs | sá | |||
Gender=Masc|Number=Plur | þeir | þá | þeim | þeirra |
Gender=Masc|Number=Plur|Person=3|PronType=Prs | þeir | þeim | þeirra | |
Gender=Fem|Number=Sing | sú | þá, það | þeirri | þeirrar |
Gender=Fem|Number=Plur | þær | þær | þeim | þeirra |
Gender=Neut|Number=Sing | það | það | því | þess |
Gender=Neut|Number=Sing|Person=3|PronType=Prs | það | það | því | þess |
Gender=Neut|Number=Plur | þau | þau | þeim | þeirra |
Gender=Neut|Number=Plur|Person=3|PronType=Prs | þau | þau | þeirra |
VERB
5565 VERB tokens (43% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Voice=Act (5161; 93%), VerbForm=EMPTY (3174; 57%), Mood=Ind (2803; 50%).
VERB
tokens may have the following values of Case
:
Acc
(3748; 67% of non-emptyCase
): segir, hafa, sagði, gera, fá, sjá, segja, taka, fékk, hefurDat
(1084; 19% of non-emptyCase
): finnst, ná, koma, halda, náði, kom, tengjast, fylgja, sinna, komiðGen
(127; 2% of non-emptyCase
): er, krafist, krefjast, krefst, leita, njóta, nýtur, notið, geta, gætaNom
(606; 11% of non-emptyCase
): er, var, eru, verið, sé, verður, verði, voru, varð, verðaEMPTY
(7275): er, segir, var, eru, kemur, fara, verið, koma, verður, sagði
Paradigm vera | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Mood=Ind|Number=Sing|Person=1|Tense=Past | var | |||
Mood=Ind|Number=Sing|Person=1|Tense=Pres | er | |||
Mood=Ind|Number=Sing|Person=3|Tense=Past | var, voru | var | var | |
Mood=Ind|Number=Sing|Person=3|Tense=Pres | er | er | er | er |
Mood=Ind|Number=Plur|Person=3|Tense=Past | voru | voru | ||
Mood=Ind|Number=Plur|Person=3|Tense=Pres | eru | eru | eru | |
Mood=Sub|Number=Sing|Person=1|Tense=Past | væri | |||
Mood=Sub|Number=Sing|Person=1|Tense=Pres | sé | |||
Mood=Sub|Number=Sing|Person=3|Tense=Past | væri | væri | ||
Mood=Sub|Number=Sing|Person=3|Tense=Pres | sé | sé | ||
Mood=Sub|Number=Plur|Person=3|Tense=Past | væru | |||
Mood=Sub|Number=Plur|Person=3|Tense=Pres | séu | |||
VerbForm=Inf | vera | vera | vera | |
VerbForm=Sup | verið | verið |
ADJ
5406 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=EMPTY (4210; 78%), Number=Sing (3779; 70%).
ADJ
tokens may have the following values of Case
:
Acc
(1423; 26% of non-emptyCase
): síðustu, fyrstu, meiri, næsta, næstu, góða, mikla, nýja, fyrsta, mikiðDat
(1184; 22% of non-emptyCase
): síðasta, síðustu, næstu, fyrstu, miklu, mörgum, góðu, miklum, nýju, nýjumGen
(387; 7% of non-emptyCase
): Sameinuðu, margra, íslenskra, síðustu, fyrsta, íslenska, íslensku, aldraðra, bandarískra, breskaNom
(2412; 45% of non-emptyCase
): hægt, ljóst, mikið, mikil, fleiri, gott, margir, mikill, erfitt, mikilvægtEMPTY
(45): hægt, spennandi, Suðlæg, Svört, aðgerðalaus, beyglaður, brotinn, eldri, flestum, gert
Paradigm mikill | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Degree=Pos|Gender=Masc|Number=Sing | mikill | mikla | miklum | mikils |
Degree=Pos|Gender=Fem|Number=Sing | mikla | |||
Degree=Pos|Gender=Fem|Number=Plur | mikil | miklum | ||
Degree=Pos|Gender=Neut|Number=Sing | mikið | mikið, mikla | miklu | |
Degree=Pos|Gender=Neut|Number=Plur | mikil | |||
Degree=Cmp|Gender=Masc|Number=Sing | meiri | meiri | meiri | |
Degree=Cmp|Gender=Fem|Number=Sing | meiri | meiri | meira | |
Degree=Cmp|Gender=Fem|Number=Plur | meiri | meiri | ||
Degree=Cmp|Gender=Neut|Number=Sing | meira | meira | meira | |
Degree=Cmp|Gender=Neut|Number=Plur | meiri | meiri | meiri | |
Degree=Sup|Gender=Masc|Number=Sing | mestan | mestum | ||
Degree=Sup|Gender=Masc|Number=Plur | mestir | |||
Degree=Sup|Gender=Fem|Number=Sing | mest, mesta | mesta | mestu | |
Degree=Sup|Gender=Fem|Number=Plur | Mestar | mestar | ||
Degree=Sup|Gender=Neut|Number=Sing | mesta, mest | mest | mestu, mesta | |
Degree=Sup|Gender=Neut|Number=Plur | mestu | |||
Gender=Masc|Number=Sing | mikill, mikli | mikinn, mikla | miklum | mikils, mikla |
Gender=Masc|Number=Plur | miklir | miklum, miklu | ||
Gender=Fem|Number=Sing | mikil, mikla | mikla | mikilli | mikillar |
Gender=Fem|Number=Plur | miklar | miklar | miklum | mikilla |
Gender=Neut|Number=Sing | mikið | mikið | miklu, meiru | mikils |
Gender=Neut|Number=Plur | mikil | mikil, miklu | miklum |
PROPN
5158 PROPN tokens (84% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=EMPTY (2595; 50%).
PROPN
tokens may have the following values of Case
:
Acc
(528; 10% of non-emptyCase
): Trump, Ísland, Jón, Audi, Clinton, James, Reykjavík, Singapúr, Ólaf, AlbertaDat
(1132; 22% of non-emptyCase
): Íslandi, Bandaríkjunum, Reykjavík, Morgunblaðinu, Akureyri, Danmörku, London, Noregi, Evrópu, HMGen
(1048; 20% of non-emptyCase
): Íslands, Bandaríkjanna, Reykjavíkur, 2, Alþingis, stöðvar, Sjálfstæðisflokksins, RÚV, Trump, TrumpsNom
(2450; 47% of non-emptyCase
): Trump, þór, Jón, Guðmundur, Ísland, Katrín, Sigurður, Ólafur, Bjarni, BjörnEMPTY
(967): Icelandair, United, New, air, Facebook, Group, Manchester, WOW, York, Post
Paradigm Ísland | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Ísland | Ísland | Íslandi | Íslands, Ísland |
NUM
1353 NUM tokens (72% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: Number=Plur (1159; 86%), NumType=EMPTY (765; 57%).
NUM
tokens may have the following values of Case
:
Acc
(448; 33% of non-emptyCase
): tvo, tvö, fimm, eitt, tíu, 10, þrjú, milljónir, prósent, tværDat
(284; 21% of non-emptyCase
): tveimur, þremur, fimm, fjórum, sex, átta, sjö, tíu, einu, milljónumGen
(273; 20% of non-emptyCase
): tveggja, þriggja, fjögurra, fimm, sex, átta, 100, 16, 18, 6Nom
(348; 26% of non-emptyCase
): tveir, prósent, þrír, einn, fimm, þúsund, tvær, tvö, tíu, sexEMPTY
(537): 2017, 2016, 0, 2, 2012, 2014, 1, prósent, 2013, 3
Paradigm tveir | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Gender=Masc | tveir | tvo | tveimur | tveggja |
Gender=Fem | tvær | tvær | tveimur | tveggja |
Gender=Fem|NumType=Card | tveggja | |||
Gender=Neut | tvö | tvö | tveimur | tveggja |
AUX
986 AUX tokens (24% of all AUX
tokens) have a non-empty value of Case
.
The most frequent other feature values with which AUX
and Case
co-occurred: Voice=Act (983; 100%), VerbForm=EMPTY (861; 87%), Person=3 (824; 84%), Mood=Ind (716; 73%), Number=Sing (699; 71%), Tense=Pres (623; 63%).
AUX
tokens may have the following values of Case
:
Acc
(22; 2% of non-emptyCase
): hafi, hefur, hafa, hefði, hefðu, muni, fengið, hafði, munum, máDat
(29; 3% of non-emptyCase
): er, var, hefur, sé, hafi, hafði, myndu, mætti, skuli, veriðGen
(16; 2% of non-emptyCase
): er, var, sé, eru, vera, veriðNom
(919; 93% of non-emptyCase
): er, var, eru, verið, sé, vera, væri, voru, séu, væruEMPTY
(3187): er, hefur, var, hafi, hafa, verið, eru, sé, hefði, má
Paradigm vera | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Mood=Imp|Number=Plur|Voice=Act | Verið | |||
Mood=Ind|Number=Sing|Person=1|Tense=Past|Voice=Act | var | |||
Mood=Ind|Number=Sing|Person=1|Tense=Pres|Voice=Act | er | |||
Mood=Ind|Number=Sing|Person=3|Tense=Past|Voice=Act | var | var | var | |
Mood=Ind|Number=Sing|Person=3|Tense=Pres|Voice=Act | er | er | er | |
Mood=Ind|Number=Plur|Person=1|Tense=Past|Voice=Act | vorum | |||
Mood=Ind|Number=Plur|Person=1|Tense=Pres|Voice=Act | erum | |||
Mood=Ind|Number=Plur|Person=3|Tense=Past|Voice=Act | voru | |||
Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Act | eru, erum | eru | ||
Mood=Ind|Person=3|Tense=Pres|Voice=Act | er | |||
Mood=Sub|Number=Sing|Person=1|Tense=Past|Voice=Act | væri | |||
Mood=Sub|Number=Sing|Person=1|Tense=Pres|Voice=Act | sé | |||
Mood=Sub|Number=Sing|Person=2|Tense=Past|Voice=Act | værir | |||
Mood=Sub|Number=Sing|Person=3|Tense=Past|Voice=Act | væri | |||
Mood=Sub|Number=Sing|Person=3|Tense=Pres|Voice=Act | sé, væri | sé | sé | |
Mood=Sub|Number=Plur|Person=1|Tense=Past|Voice=Act | værum | |||
Mood=Sub|Number=Plur|Person=1|Tense=Pres|Voice=Act | séum | |||
Mood=Sub|Number=Plur|Person=3|Tense=Past|Voice=Act | væru | |||
Mood=Sub|Number=Plur|Person=3|Tense=Pres|Voice=Act | séu, sé | |||
Number=Sing|VerbForm=Part | verið | |||
Number=Plur|VerbForm=Part | verið | |||
VerbForm=Inf | vera | |||
VerbForm=Inf|Voice=Act | vera | vera | vera | |
VerbForm=Sup|Voice=Act | verið | verið | verið | verið |
Voice=Act | verið |
ADV
983 ADV tokens (10% of all ADV
tokens) have a non-empty value of Case
.
ADV
tokens may have the following values of Case
:
Acc
(668; 68% of non-emptyCase
): dag, ár, daga, fyrra, sumar, daginn, lok, viku, á, íDat
(288; 29% of non-emptyCase
): ári, árum, á, byrjun, því, frá, viku, mánuði, mánuðum, mínútuGen
(12; 1% of non-emptyCase
): til, ára, daga, framtíðar, metra, morguns, neins, sólarhringsNom
(15; 2% of non-emptyCase
): allt, það, Föstudaginn, ein, einn, erfitt, kvöld, mínútur, síður, vikaEMPTY
(9316): ekki, þar, þá, í, fram, á, svo, upp, til, út
Paradigm ár | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Definite=Def|Gender=Neut|Number=Sing | árið | árinu | ||
Definite=Def|Gender=Neut|Number=Plur | árin | árunum | ||
Gender=Fem|Number=Sing | ár | |||
Gender=Neut|Number=Sing | ár | ári | ||
Gender=Neut|Number=Plur | ár | árum | ára |
DET
45 DET tokens (100% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Definite=Def (45; 100%), Number=Sing (36; 80%).
DET
tokens may have the following values of Case
:
Acc
(13; 29% of non-emptyCase
): hið, hin, hina, WOW-ið, hinar, volume-iðDat
(10; 22% of non-emptyCase
): hinum, hinu, hinniGen
(1; 2% of non-emptyCase
): hinnarNom
(21; 47% of non-emptyCase
): hinn, hin, hið, Hinir
Paradigm hinn | Nom | Acc | Dat | Gen |
---|---|---|---|---|
Gender=Masc|Number=Sing | hinn | hinum | ||
Gender=Masc|Number=Plur | Hinir | |||
Gender=Fem|Number=Sing | hin | hina | hinni | hinnar |
Gender=Fem|Number=Plur | hinar | hinum | ||
Gender=Neut|Number=Sing | hið | hið | hinu | |
Gender=Neut|Number=Plur | hin | hin | hinum |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[case]–> ADP (7816; 97%),
NOUN –[amod]–> ADJ (3305; 97%),
VERB –[obj]–> NOUN (2713; 90%),
NOUN –[nmod]–> PRON (1749; 97%),
PROPN –[flat]–> PROPN (1189; 100%),
PROPN –[case]–> ADP (1091; 84%),
PRON –[case]–> ADP (895; 98%),
NOUN –[conj]–> NOUN (860; 92%),
ADV –[case]–> ADP (708; 81%),
NOUN –[nummod]–> NUM (682; 84%).