Treebank Statistics: UD_Icelandic-GC: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
39478 tokens (40%) have a non-empty value of Gender.
16804 types (84%) occur at least once with a non-empty value of Gender.
11319 lemmas (83%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (20376; 20% instances), PRON (6328; 6% instances), ADJ (5405; 5% instances), PROPN (4964; 5% instances), NUM (1336; 1% instances), ADV (933; 1% instances), VERB (91; 0% instances), DET (45; 0% instances).
NOUN
20376 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Definite=EMPTY (15591; 77%), Number=Sing (14204; 70%).
NOUN tokens may have the following values of Gender:
Fem(6779; 33% of non-emptyGender): leið, ákvörðun, upplýsingar, konur, stjórn, sögn, þjónustu, frétt, tilkynningu, lögregluMasc(6621; 32% of non-emptyGender): maður, stað, tíma, menn, leik, þátt, sigur, formaður, forsætisráðherra, hlutiNeut(6976; 34% of non-emptyGender): ára, fólk, sæti, málið, mál, áhrif, liðið, landsins, ráð, ogEMPTY(188): mars, 2017, apríl, ágúst, desember, febrúar, júlí, júní, nóvember, október
| Paradigm ár | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | ár | ||
| Case=Acc|Definite=Def|Number=Sing | árið | ||
| Case=Acc|Definite=Def|Number=Plur | árarnar | árin | |
| Case=Acc|Number=Sing | ár | ||
| Case=Acc|Number=Plur | ára | árar | ár |
| Case=Dat|Definite=Def|Number=Sing | árinu | ||
| Case=Dat|Definite=Def|Number=Plur | árunum | ||
| Case=Dat|Number=Sing | ári | ár | ári, árum |
| Case=Dat|Number=Plur | árum | ||
| Case=Gen|Definite=Def|Number=Sing | ársins | ársins | |
| Case=Gen|Definite=Def|Number=Plur | áranna | ||
| Case=Gen|Number=Sing | árs, ára | árs | |
| Case=Gen|Number=Plur | ára | ||
| Case=Nom|Definite=Def|Number=Sing | árið | ||
| Case=Nom|Definite=Def|Number=Plur | árin | ||
| Case=Nom|Number=Sing | ár | ||
| Case=Nom|Number=Plur | ár |
Gender seems to be lexical feature of NOUN. 98% lemmas (6892) occur only with one value of Gender.
PRON
6328 PRON tokens (81% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4808; 76%), Person=EMPTY (3757; 59%), PronType=EMPTY (3715; 59%).
PRON tokens may have the following values of Gender:
Fem(1215; 19% of non-emptyGender): hún, þær, hennar, henni, sinni, sína, sú, þeim, þá, hanaMasc(2037; 32% of non-emptyGender): hann, þeir, hans, þeim, þeirra, sínum, honum, allir, þess, þannNeut(3076; 49% of non-emptyGender): það, því, þetta, þess, þau, þessu, allt, hvað, þeirra, þeimEMPTY(1474): ég, við, sér, sig, mér, okkur, okkar, mig, þú, þér
| Paradigm sá | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | þann | þá, það | það |
| Case=Acc|Number=Sing|Person=3|PronType=Prs | það | ||
| Case=Acc|Number=Plur | þá | þær | þau |
| Case=Acc|Number=Plur|Person=3|PronType=Prs | þau | ||
| Case=Dat|Number=Sing | þeim | þeirri | því |
| Case=Dat|Number=Sing|Person=3|PronType=Prs | því | ||
| Case=Dat|Number=Plur | þeim | þeim | þeim |
| Case=Dat|Number=Plur|Person=3|PronType=Prs | þeim | ||
| Case=Gen|Number=Sing | þess | þeirrar | þess |
| Case=Gen|Number=Sing|Person=3|PronType=Prs | þess | ||
| Case=Gen|Number=Plur | þeirra | þeirra | þeirra |
| Case=Gen|Number=Plur|Person=3|PronType=Prs | þeirra | þeirra | |
| Case=Nom|Number=Sing | sá, þess, þessi | sú | það |
| Case=Nom|Number=Sing|Person=3|PronType=Prs | sá | það | |
| Case=Nom|Number=Plur | þeir | þær | þau |
| Case=Nom|Number=Plur|Person=3|PronType=Prs | þeir | þau |
ADJ
5405 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=EMPTY (4207; 78%), Number=Sing (3779; 70%).
ADJ tokens may have the following values of Gender:
Fem(1521; 28% of non-emptyGender): síðustu, fyrstu, mikil, næstu, Sameinuðu, meiri, frekari, mikla, ný, góðarMasc(1730; 32% of non-emptyGender): margir, síðustu, fleiri, fyrri, mikill, fyrrverandi, fyrstu, fyrsti, seinni, fyrstaNeut(2154; 40% of non-emptyGender): hægt, mikið, ljóst, gott, síðasta, meira, miklu, íslenska, erfitt, fyrstaEMPTY(46): hægt, sammála, spennandi, Suðlæg, Svört, aðgerðalaus, beyglaður, brotinn, eftirfarandi, eldri
| Paradigm mikill | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Degree=Pos|Number=Sing | mikla | mikla | mikið, mikla |
| Case=Acc|Degree=Pos|Number=Plur | mikil | ||
| Case=Acc|Degree=Cmp|Number=Sing | meiri | meiri | meira |
| Case=Acc|Degree=Cmp|Number=Plur | meiri | meiri | |
| Case=Acc|Degree=Sup|Number=Sing | mestan | mesta | mest |
| Case=Acc|Degree=Sup|Number=Plur | mestar | mestu | |
| Case=Acc|Number=Sing | mikinn, mikla | mikla | mikið |
| Case=Acc|Number=Plur | miklar | mikil, miklu | |
| Case=Dat|Degree=Pos|Number=Sing | miklum | miklu | |
| Case=Dat|Degree=Pos|Number=Plur | miklum | ||
| Case=Dat|Degree=Cmp|Number=Sing | meiri | meira | |
| Case=Dat|Degree=Cmp|Number=Plur | meiri | ||
| Case=Dat|Degree=Sup|Number=Sing | mestum | mestu | mestu, mesta |
| Case=Dat|Number=Sing | miklum | mikilli | miklu, meiru |
| Case=Dat|Number=Plur | miklum, miklu | miklum | miklum |
| Case=Gen|Degree=Pos|Number=Sing | mikils | ||
| Case=Gen|Degree=Cmp|Number=Sing | meira | ||
| Case=Gen|Number=Sing | mikils, mikla | mikillar | mikils |
| Case=Gen|Number=Plur | mikilla | ||
| Case=Nom|Degree=Pos|Number=Sing | mikill | mikið | |
| Case=Nom|Degree=Pos|Number=Plur | mikil | ||
| Case=Nom|Degree=Cmp|Number=Sing | meiri | meiri | meira |
| Case=Nom|Degree=Cmp|Number=Plur | meiri | meiri | |
| Case=Nom|Degree=Sup|Number=Sing | mest, mesta | mesta, mest | |
| Case=Nom|Degree=Sup|Number=Plur | mestir | Mestar | |
| Case=Nom|Number=Sing | mikill, mikli | mikil, mikla | mikið |
| Case=Nom|Number=Plur | miklir | miklar | mikil |
PROPN
4964 PROPN tokens (81% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(1423; 29% of non-emptyGender): Reykjavík, Katrín, Reykjavíkur, Evrópu, 2, Akureyri, Sigríður, Anna, Danmörku, ErlaMasc(2577; 52% of non-emptyGender): Trump, Jón, þór, Guðmundur, Sigurður, Ólafur, Bjarni, Björn, Davíð, DonaldNeut(964; 19% of non-emptyGender): Íslands, Íslandi, Ísland, Bandaríkjunum, Bandaríkjanna, Morgunblaðinu, RÚV, Alþingi, Alþingis, ESBEMPTY(1161): Icelandair, Arsenal, Facebook, New, United, WOW, York, air, City, Group
| Paradigm Trump | Masc | Fem |
|---|---|---|
| Case=Acc | Trump | Trump |
| Case=Acc|Number=Sing | Trump | |
| Case=Dat | Trump | |
| Case=Gen | Trump, Trumps | |
| Case=Gen|Number=Sing | Trump | |
| Case=Nom | Trump | |
| Case=Nom|Number=Sing | Trump |
Gender seems to be lexical feature of PROPN. 98% lemmas (2595) occur only with one value of Gender.
NUM
1336 NUM tokens (71% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (1149; 86%), NumType=EMPTY (752; 56%).
NUM tokens may have the following values of Gender:
Fem(293; 22% of non-emptyGender): tvær, milljónir, tveggja, tveimur, þremur, fjórar, tíu, fimm, milljónum, prósentMasc(420; 31% of non-emptyGender): tveir, tvo, sjö, einn, átta, þrír, fimm, tveggja, tveimur, tíuNeut(623; 47% of non-emptyGender): fimm, tvö, prósent, þrjú, eitt, sex, þúsund, tíu, tveggja, fjögurEMPTY(554): 2017, 2016, 0, 2, 2012, 2014, 1, prósent, 2013, 3
| Paradigm tveir | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | tvo | tvær | tvö |
| Case=Acc|NumType=Card | tveggja | ||
| Case=Dat | tveimur | tveimur | tveimur |
| Case=Gen | tveggja | tveggja | tveggja |
| Case=Nom | tveir | tvær | tvö |
ADV
933 ADV tokens (9% of all ADV tokens) have a non-empty value of Gender.
ADV tokens may have the following values of Gender:
Fem(176; 19% of non-emptyGender): viku, byrjun, nótt, mínútu, vikur, mínútur, vikum, helgi, leið, helginaMasc(409; 44% of non-emptyGender): dag, mánuði, daga, daginn, hálfleik, tíma, mánuðum, dagana, laugardag, laugardaginnNeut(348; 37% of non-emptyGender): ár, ári, árum, fyrra, sumar, lok, því, árið, kvöld, þaðEMPTY(9366): ekki, þar, þá, í, á, fram, svo, upp, til, út
| Paradigm ár | Fem | Neut |
|---|---|---|
| Case=Acc|Definite=Def|Number=Sing | árið | |
| Case=Acc|Definite=Def|Number=Plur | árin | |
| Case=Acc|Number=Sing | ár | |
| Case=Acc|Number=Plur | ár | |
| Case=Dat|Definite=Def|Number=Sing | árinu | |
| Case=Dat|Definite=Def|Number=Plur | árunum | |
| Case=Dat|Number=Sing | ári | |
| Case=Dat|Number=Plur | árum | |
| Case=Gen|Number=Plur | ára | |
| Case=Nom|Number=Sing | ár |
Gender seems to be lexical feature of ADV. 97% lemmas (163) occur only with one value of Gender.
VERB
91 VERB tokens (1% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (91; 100%), Person=EMPTY (91; 100%), Voice=EMPTY (91; 100%), Tense=EMPTY (90; 99%), VerbForm=Part (89; 98%), Case=Nom (87; 96%), Number=Sing (71; 78%).
VERB tokens may have the following values of Gender:
Fem(19; 21% of non-emptyGender): kynnt, sögð, afhenta, birt, búin, farin, flutta, gerð, hafin, hafnarMasc(29; 32% of non-emptyGender): hafinn, orðinn, handteknir, bundinn, byggður, búsettur, eigandi, endurgreiddir, gagnrýndir, handteknaNeut(43; 47% of non-emptyGender): búið, farið, gert, kveðið, stefnt, talið, þekkt, Fjallað, Skrifað, SýntEMPTY(12749): er, segir, var, eru, sagði, hafa, kemur, gera, verið, koma
| Paradigm segja | Fem | Neut |
|---|---|---|
| sögð | sagt |
DET
45 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Definite=Def (45; 100%), Number=Sing (36; 80%).
DET tokens may have the following values of Gender:
Fem(11; 24% of non-emptyGender): hin, hina, hinar, hinnar, hinni, hinumMasc(13; 29% of non-emptyGender): hinn, hinum, HinirNeut(21; 47% of non-emptyGender): hið, hin, hinu, WOW-ið, hinum, volume-ið
| Paradigm hinn | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | hina | hið | |
| Case=Acc|Number=Plur | hinar | hin | |
| Case=Dat|Number=Sing | hinum | hinni | hinu |
| Case=Dat|Number=Plur | hinum | hinum | |
| Case=Gen|Number=Sing | hinnar | ||
| Case=Nom|Number=Sing | hinn | hin | hið |
| Case=Nom|Number=Plur | Hinir | hin |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (3260; 96%),
NOUN –[nmod]–> PRON (1713; 95%),
PROPN –[flat]–> PROPN (1140; 100%),
NOUN –[nummod]–> NUM (690; 85%),
NOUN –[flat]–> NOUN (324; 100%),
ADJ –[nsubj]–> NOUN (317; 95%),
PROPN –[obl]–> NOUN (219; 52%),
ADV –[amod]–> ADJ (209; 80%),
ADJ –[nsubj]–> PRON (184; 75%),
PROPN –[conj]–> PROPN (157; 57%).