Treebank Statistics: UD_Icelandic-Modern: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
27963 tokens (35%) have a non-empty value of Gender.
7725 types (76%) occur at least once with a non-empty value of Gender.
4348 lemmas (74%) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: NOUN (12915; 16% instances), PRON (4843; 6% instances), ADJ (3583; 4% instances), DET (3495; 4% instances), PROPN (1994; 2% instances), VERB (740; 1% instances), NUM (224; 0% instances), ADV (131; 0% instances), AUX (36; 0% instances), X (2; 0% instances).
NOUN
12915 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Definite=Ind (10004; 77%), Number=Sing (9176; 71%).
NOUN tokens may have the following values of Gender:
Fem(3870; 30% of non-emptyGender): leið, raun, ræðu, ríkisstjórn, umræðu, klukkan, upplýsingar, veru, vinnu, aðgerðirMasc(4407; 34% of non-emptyGender): forseti, menn, þingmaður, ráðherra, tíma, herra, dag, stað, vegar, þingmanniNeut(4638; 36% of non-emptyGender): mál, fólk, máli, málið, ár, ára, ári, sæti, dæmis, áriðEMPTY(729): m, Frú, móti,, gr., nefnd, kr., stundum, k., allsherjar-
| Paradigm lið | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Number=Sing | liðið | ||
| Case=Acc|Definite=Ind|Number=Sing | lið | ||
| Case=Acc|Definite=Ind|Number=Plur | lið | lið | |
| Case=Dat|Definite=Def|Number=Sing | liðinu | ||
| Case=Dat|Definite=Def|Number=Plur | liðunum | ||
| Case=Dat|Definite=Ind|Number=Sing | lið | liði | |
| Case=Dat|Definite=Ind|Number=Plur | liðum | ||
| Case=Gen|Definite=Def|Number=Sing | liðsins | ||
| Case=Gen|Definite=Def|Number=Plur | liðanna | ||
| Case=Gen|Definite=Ind|Number=Sing | liðs | ||
| Case=Gen|Definite=Ind|Number=Plur | liða | ||
| Case=Nom|Definite=Def|Number=Sing | Liðin | liðið | |
| Case=Nom|Definite=Def|Number=Plur | liðin | ||
| Case=Nom|Definite=Ind|Number=Sing | lið | ||
| Case=Nom|Definite=Ind|Number=Plur | lið | ||
| Case=Nom|Number=Sing|VerbForm=Part|Voice=Act | liðið |
Gender seems to be lexical feature of NOUN. 99% lemmas (2701) occur only with one value of Gender.
PRON
4843 PRON tokens (63% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=EMPTY (4843; 100%), Number=Sing (4266; 88%), PronType=Prs (4136; 85%).
PRON tokens may have the following values of Gender:
Fem(565; 12% of non-emptyGender): hún, þær, hana, sér, sinni, henni, hennar, minni, mín, aðrarMasc(811; 17% of non-emptyGender): hann, þeir, sér, sig, hans, sínum, honum, annars, öðrum, þeimNeut(3467; 72% of non-emptyGender): það, því, þess, hvað, þau, annað, sér, hverju, sig, annarsEMPTY(2891): ég, við, mér, okkur, mig, því, okkar, maður, annars, það
| Paradigm það | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing|PronType=Dem | það | ||
| Case=Acc|Number=Sing|PronType=Prs | það | ||
| Case=Acc|Number=Plur|PronType=Dem | þau | ||
| Case=Acc|Number=Plur|PronType=Prs | þau | ||
| Case=Dat|Number=Sing|PronType=Dem | því | ||
| Case=Dat|Number=Sing|PronType=Prs | því | ||
| Case=Dat|Number=Plur|PronType=Dem | þeim | þeim | |
| Case=Dat|Number=Plur|PronType=Prs | þeim | þeim | þeim |
| Case=Gen|Number=Sing|PronType=Dem | þess | þess | |
| Case=Gen|Number=Sing|PronType=Prs | þess | ||
| Case=Gen|Number=Plur|PronType=Prs | þeirra | þeirra | |
| Case=Nom|Number=Sing | það | ||
| Case=Nom|Number=Sing|PronType=Dem | það | ||
| Case=Nom|Number=Sing|PronType=Prs | það | ||
| Case=Nom|Number=Plur|PronType=Dem | þau | ||
| Case=Nom|Number=Plur|PronType=Prs | þau |
ADJ
3583 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (2894; 81%), Number=Sing (2801; 78%), Definite=Ind (2273; 63%), Case=Nom (1892; 53%).
ADJ tokens may have the following values of Gender:
Fem(848; 24% of non-emptyGender): góð, fyrri, síðustu, næstu, betri, fyrstu, ánægð, mikla, sammála, góðaMasc(995; 28% of non-emptyGender): virðulegi, sammála, minnsta, besta, minni, nýjan, vinstri, fatlaðra, síðustu, vissNeut(1740; 49% of non-emptyGender): hægt, gott, rétt, miklu, fyrsta, mikilvægt, sjálfsögðu, ljóst, síðasta, erfittEMPTY(737): hv., hæstv., sama, 2., 1., 3., 5., 8., 9., m.
| Paradigm háttvirtur | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Ind|Number=Sing | háttvirtan | ||
| Case=Acc|Definite=Ind|Number=Plur | háttvirt | ||
| Case=Dat|Definite=Ind|Number=Plur | háttvirtum | ||
| Case=Gen|Definite=Ind|Number=Sing | háttvirts | ||
| Case=Nom|Definite=Def|Number=Sing | hv. | ||
| Case=Nom|Definite=Ind|Number=Sing | háttvirtur, hv. | ||
| Case=Nom|Definite=Ind|Number=Plur | háttvirtar |
DET
3495 DET tokens (94% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (3119; 89%), Degree=EMPTY (3119; 89%), Number=Sing (2573; 74%), PronType=Dem (1841; 53%).
DET tokens may have the following values of Gender:
Fem(681; 19% of non-emptyGender): þá, þessa, þessari, sú, þessar, þeirri, þær, þessi, þeim, hvaðaMasc(841; 24% of non-emptyGender): þeim, allir, meiri, þann, hins, einhvern, þeir, alla, sá, enginnNeut(1973; 56% of non-emptyGender): þetta, það, þessu, allt, eitthvað, ekkert, því, þessi, þau, þeimEMPTY(205): meira, eitt, mikið, 1, einn, ein, svolítið, einu, þetta, einum
| Paradigm þessi | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | þennan | þessa | þetta |
| Case=Acc|Number=Plur | þessa | þessar | þessi |
| Case=Dat|Number=Sing | þessum | þessari | þessu |
| Case=Dat|Number=Plur | þessum | þessum | þessum |
| Case=Gen|Number=Sing | þessa | þessarar | þessa |
| Case=Gen|Number=Plur | þessara | þessara | þessara |
| Case=Nom|Number=Sing | þessi | þessi | þetta |
| Case=Nom|Number=Plur | þessir | þessar | þessi |
PROPN
1994 PROPN tokens (73% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1755; 88%), Definite=Ind (1701; 85%).
PROPN tokens may have the following values of Gender:
Fem(512; 26% of non-emptyGender): Hrafnhildur, Bryndís, Evrópu, Chusovitina, Rún, Danmörku, Lúthersdóttir, Brasilíu, Grótta, ParísMasc(1054; 53% of non-emptyGender): Ólympíuleikunum, Blöndal, Íslendingar, Ólympíuleikum, Þór, Jón, Pétur, Arnar, Forseti, ValurNeut(428; 21% of non-emptyGender): Íslands, Ríó, Ísland, Alþingi, Íslandi, Frakklandi, Alþingis, Evrópusambandinu, Evrópusambandið, EvrópumótinuEMPTY(749): þm., RÚV, EM, London, H., HM, Collins, KSÍ, United, KR
| Paradigm hrafnhildur | Masc | Fem |
|---|---|---|
| Case=Acc | Hrafnhildi | Hrafnhildi |
| Case=Dat | Hrafnhildi | Hrafnhildi, Hrafnhildur |
| Case=Gen | Hrafnhildar | |
| Case=Nom | Hrafnhildur |
Gender seems to be lexical feature of PROPN. 98% lemmas (620) occur only with one value of Gender.
VERB
740 VERB tokens (8% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (740; 100%), Person=EMPTY (740; 100%), Tense=EMPTY (740; 100%), VerbForm=Part (731; 99%), Voice=Act (723; 98%), Number=Sing (596; 81%).
VERB tokens may have the following values of Gender:
Fem(116; 16% of non-emptyGender): orðin, farin, komin, tekin, teknar, samþykkt, settar, skráð, felld, gerðarMasc(136; 18% of non-emptyGender): kominn, settir, sýndur, farinn, haldnir, komnir, orðinn, valinn, fluttur, gefinnNeut(488; 66% of non-emptyGender): gert, farið, keppt, sagt, tekið, haldið, komið, sett, miðað, lagtEMPTY(8556): fara, gera, hringir, held, koma, taka, þakka, kemur, á, segja
| Paradigm koma | Masc | Fem | Neut |
|---|---|---|---|
| Degree=Pos|Number=Plur | komandi | ||
| Number=Sing|VerbForm=Part|Voice=Act | kominn | komin | komið |
| Number=Plur|VerbForm=Part|Voice=Act | komnir | komnar | komin |
NUM
224 NUM tokens (21% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (223; 100%), Number=Plur (217; 97%).
NUM tokens may have the following values of Gender:
Fem(40; 18% of non-emptyGender): tvær, þrjár, fimm, þúsund, sjö, þremur, níu, sex, tveggja, fjögurraMasc(73; 33% of non-emptyGender): þrír, fjóra, tveimur, átta, fimm, sex, tvo, fjórir, þrjá, fjórumNeut(111; 50% of non-emptyGender): tvö, tveimur, fjögur, þrjú, fjórum, tíu, tveggja, þriggja, þremur, fjögurraEMPTY(824): 100, 2, 200, 0, 50, 2012, 3, 16, 18, 20
| Paradigm tveir | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | tvo | tvær | tvö |
| Case=Dat | tveimur, tveim | tveimur | tveimur |
| Case=Gen | tveggja | tveggja | |
| Case=Nom | tveir | tvær | tvö |
ADV
131 ADV tokens (2% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: Degree=Pos (96; 73%).
ADV tokens may have the following values of Gender:
Fem(28; 21% of non-emptyGender): svona, fleiri, þannig, mikla, fallega, gríðarlega, kvitt, margar, meiri, mikilMasc(24; 18% of non-emptyGender): svona, meiri, mikinn, margir, eins, fleiri, hugsanlega, marga, miklu, sammálaNeut(79; 60% of non-emptyGender): rétt, svona, meira, mikið, mikil, mörgum, skýrt, ekkert, ytra, öðruvísiEMPTY(6828): ekki, þá, svo, hér, bara, eins, þar, nú, þannig, mjög
| Paradigm svona | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | svona | svona | |
| Case=Acc|Number=Plur | svona | svona | |
| Case=Dat|Number=Sing | svona | svona | |
| Case=Dat|Number=Plur | svona | svona | svona |
| Case=Nom|Number=Sing | svona | svona | |
| Case=Nom|Number=Plur | svona |
AUX
36 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (36; 100%), Number=Sing (36; 100%), Person=EMPTY (36; 100%), Tense=EMPTY (36; 100%), VerbForm=Part (36; 100%), Voice=Act (36; 100%).
AUX tokens may have the following values of Gender:
Neut(36; 100% of non-emptyGender): verið, haftEMPTY(5268): er, var, eru, sé, verið, hefur, hafa, vera, hafi, væri
X
2 X tokens (2% of all X tokens) have a non-empty value of Gender.
The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (2; 100%).
X tokens may have the following values of Gender:
Masc(1; 50% of non-emptyGender): final-fourNeut(1; 50% of non-emptyGender): nýafstöðuEMPTY(88): Molde, 2016, Eidur, FK, að, i, se, your, 22, 3
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (1829; 79%),
NOUN –[det]–> DET (1172; 94%),
NOUN –[amod]–> DET (633; 95%),
NOUN –[conj]–> NOUN (330; 52%),
NOUN –[nmod:poss]–> PRON (276; 67%),
PROPN –[flat:name]–> PROPN (246; 65%),
ADJ –[nsubj]–> PRON (188; 54%),
NOUN –[det]–> PRON (137; 98%),
ADJ –[nsubj]–> NOUN (123; 87%),
ADJ –[conj]–> NOUN (118; 86%).