home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-Modern: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

27967 tokens (35%) have a non-empty value of Gender. 7728 types (76%) occur at least once with a non-empty value of Gender. 4349 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (12915; 16% instances), PRON (4843; 6% instances), ADJ (3580; 4% instances), DET (3497; 4% instances), PROPN (1994; 2% instances), VERB (744; 1% instances), NUM (224; 0% instances), ADV (131; 0% instances), AUX (36; 0% instances), X (3; 0% instances).

NOUN

12915 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Definite=Ind (10003; 77%), Number=Sing (9176; 71%).

NOUN tokens may have the following values of Gender:

Paradigm liðMascFemNeut
Case=Acc|Definite=Def|Number=Singliðið
Case=Acc|Definite=Ind|Number=Singlið
Case=Acc|Definite=Ind|Number=Plurliðlið
Case=Dat|Definite=Def|Number=Singliðinu
Case=Dat|Definite=Def|Number=Plurliðunum
Case=Dat|Definite=Ind|Number=Singliðliði
Case=Dat|Definite=Ind|Number=Plurliðum
Case=Gen|Definite=Def|Number=Singliðsins
Case=Gen|Definite=Def|Number=Plurliðanna
Case=Gen|Definite=Ind|Number=Singliðs
Case=Gen|Definite=Ind|Number=Plurliða
Case=Nom|Definite=Def|Number=SingLiðinliðið
Case=Nom|Definite=Def|Number=Plurliðin
Case=Nom|Definite=Ind|Number=Singlið
Case=Nom|Definite=Ind|Number=Plurlið
Case=Nom|Number=Sing|VerbForm=Part|Voice=Actliðið

Gender seems to be lexical feature of NOUN. 99% lemmas (2701) occur only with one value of Gender.

PRON

4843 PRON tokens (63% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=EMPTY (4843; 100%), Number=Sing (4266; 88%), PronType=Prs (4136; 85%).

PRON tokens may have the following values of Gender:

Paradigm þaðMascFemNeut
Case=Acc|Number=Sing|PronType=Demþað
Case=Acc|Number=Sing|PronType=Prsþað
Case=Acc|Number=Plur|PronType=Demþau
Case=Acc|Number=Plur|PronType=Prsþau
Case=Dat|Number=Sing|PronType=Demþví
Case=Dat|Number=Sing|PronType=Prsþví
Case=Dat|Number=Plur|PronType=Demþeimþeim
Case=Dat|Number=Plur|PronType=Prsþeimþeimþeim
Case=Gen|Number=Sing|PronType=Demþessþess
Case=Gen|Number=Sing|PronType=Prsþess
Case=Gen|Number=Plur|PronType=Prsþeirraþeirra
Case=Nom|Number=Singþað
Case=Nom|Number=Sing|PronType=Demþað
Case=Nom|Number=Sing|PronType=Prsþað
Case=Nom|Number=Plur|PronType=Demþau
Case=Nom|Number=Plur|PronType=Prsþau

ADJ

3580 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (2892; 81%), Number=Sing (2798; 78%), Definite=Ind (2272; 63%), Case=Nom (1891; 53%).

ADJ tokens may have the following values of Gender:

Paradigm háttvirturMascFemNeut
Case=Acc|Definite=Ind|Number=Singháttvirtan
Case=Acc|Definite=Ind|Number=Plurháttvirt
Case=Dat|Definite=Ind|Number=Plurháttvirtum
Case=Gen|Definite=Ind|Number=Singháttvirts
Case=Nom|Definite=Def|Number=Singhv.
Case=Nom|Definite=Ind|Number=Singháttvirtur, hv.
Case=Nom|Definite=Ind|Number=Plurháttvirtar

DET

3497 DET tokens (94% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (3119; 89%), Degree=EMPTY (3119; 89%), Number=Sing (2575; 74%), PronType=Dem (1841; 53%).

DET tokens may have the following values of Gender:

Paradigm þessiMascFemNeut
Case=Acc|Number=Singþennanþessaþetta
Case=Acc|Number=Plurþessaþessarþessi
Case=Dat|Number=Singþessumþessariþessu
Case=Dat|Number=Plurþessumþessumþessum
Case=Gen|Number=Singþessaþessararþessa
Case=Gen|Number=Plurþessaraþessaraþessara
Case=Nom|Number=Singþessiþessiþetta
Case=Nom|Number=Plurþessirþessarþessi

PROPN

1994 PROPN tokens (73% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1755; 88%), Definite=Ind (1701; 85%).

PROPN tokens may have the following values of Gender:

Paradigm hrafnhildurMascFem
Case=AccHrafnhildiHrafnhildi
Case=DatHrafnhildiHrafnhildi, Hrafnhildur
Case=GenHrafnhildar
Case=NomHrafnhildur

Gender seems to be lexical feature of PROPN. 98% lemmas (620) occur only with one value of Gender.

VERB

744 VERB tokens (8% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (744; 100%), Person=EMPTY (744; 100%), Tense=EMPTY (744; 100%), VerbForm=Part (734; 99%), Voice=Act (724; 97%), Number=Sing (599; 81%).

VERB tokens may have the following values of Gender:

Paradigm komaMascFemNeut
Degree=Pos|Number=Plurkomandi
Number=Sing|VerbForm=Part|Voice=Actkominnkominkomið
Number=Sing|VerbForm=Part|Voice=Midkomist
Number=Plur|VerbForm=Part|Voice=Actkomnirkomnarkomin

NUM

224 NUM tokens (21% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (223; 100%), Number=Plur (217; 97%).

NUM tokens may have the following values of Gender:

Paradigm tveirMascFemNeut
Case=Acctvotværtvö
Case=Dattveimur, tveimtveimurtveimur
Case=Gentveggjatveggja
Case=Nomtveirtværtvö

ADV

131 ADV tokens (2% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Degree=Pos (96; 73%).

ADV tokens may have the following values of Gender:

Paradigm svonaMascFemNeut
Case=Acc|Number=Singsvonasvona
Case=Acc|Number=Plursvonasvona
Case=Dat|Number=Singsvonasvona
Case=Dat|Number=Plursvonasvonasvona
Case=Nom|Number=Singsvonasvona
Case=Nom|Number=Plursvona

AUX

36 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (36; 100%), Number=Sing (36; 100%), Person=EMPTY (36; 100%), Tense=EMPTY (36; 100%), VerbForm=Part (36; 100%), Voice=Act (36; 100%).

AUX tokens may have the following values of Gender:

X

3 X tokens (3% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (3; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (1828; 79%), NOUN –[det]–> DET (1172; 94%), NOUN –[amod]–> DET (633; 95%), NOUN –[conj]–> NOUN (331; 53%), NOUN –[nmod:poss]–> PRON (276; 67%), PROPN –[flat:name]–> PROPN (246; 65%), ADJ –[nsubj]–> PRON (188; 54%), NOUN –[det]–> PRON (136; 98%), ADJ –[nsubj]–> NOUN (123; 87%), ADJ –[conj]–> NOUN (118; 86%).