home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

95531 tokens (31%) have a non-empty value of Gender. 22119 types (68%) occur at least once with a non-empty value of Gender. 14991 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (56327; 18% instances), ADJ (13740; 4% instances), PRON (11798; 4% instances), DET (10811; 3% instances), PROPN (2689; 1% instances), NUM (164; 0% instances), ADV (1; 0% instances), VERB (1; 0% instances).

NOUN

56327 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40172; 71%), Definite=Ind (36324; 64%).

NOUN tokens may have the following values of Gender:

Paradigm rådMascFemNeut
_rådråd
Case=Gen|Definite=Def|Number=Singrådets
Definite=Def|Number=Singrådet
Definite=Ind|Number=Singråd
Definite=Ind|Number=Plurråd

Gender seems to be lexical feature of NOUN. 94% lemmas (11544) occur only with one value of Gender.

ADJ

13740 ADJ tokens (51% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Definite=Ind (13738; 100%), Number=Sing (13738; 100%), Degree=Pos (12885; 94%).

ADJ tokens may have the following values of Gender:

Paradigm storFem,MascMascNeut
storstorstort

PRON

11798 PRON tokens (52% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (11798; 100%), PronType=Prs (11401; 97%), Person=3 (10250; 87%), Animacy=EMPTY (8655; 73%), Case=EMPTY (8655; 73%).

PRON tokens may have the following values of Gender:

Paradigm sinMascFemNeut
sinsisitt

DET

10811 DET tokens (75% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (10811; 100%), PronType=Art (6182; 57%).

DET tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Genens
eneiet, at, er, ett

PROPN

2689 PROPN tokens (15% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (364) occur only with one value of Gender.

NUM

164 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (164; 100%), Number=Sing (164; 100%).

NUM tokens may have the following values of Gender:

Paradigm halvannenMascFemNeut
halvannenhalvannenhalvannet

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

VERB

1 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (1; 100%), Tense=Pres (1; 100%), VerbForm=Fin,Part (1; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (9423; 77%), NOUN –[det]–> PRON (1490; 73%), ADJ –[expl]–> PRON (551; 90%), ADJ –[conj]–> ADJ (473; 81%), DET –[nmod]–> NOUN (123; 66%), PRON –[expl]–> PRON (48; 52%), PRON –[amod]–> ADJ (33; 52%), DET –[conj]–> DET (27; 90%), PRON –[det]–> DET (24; 51%), PRON –[conj]–> PRON (10; 56%).