home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

89456 tokens (29%) have a non-empty value of Gender. 20726 types (64%) occur at least once with a non-empty value of Gender. 13975 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (56308; 18% instances), PRON (11798; 4% instances), DET (10811; 3% instances), ADJ (7685; 2% instances), PROPN (2689; 1% instances), NUM (164; 0% instances), ADV (1; 0% instances).

NOUN

56308 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40153; 71%), Definite=Ind (36305; 64%).

NOUN tokens may have the following values of Gender:

Paradigm rådMascFemNeut
_rådråd
Case=Gen|Definite=Def|Number=Singrådets
Definite=Def|Number=Singrådet
Definite=Ind|Number=Singråd
Definite=Ind|Number=Plurråd

Gender seems to be lexical feature of NOUN. 94% lemmas (11544) occur only with one value of Gender.

PRON

11798 PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (11798; 100%), PronType=Prs (11387; 97%), Person=3 (10250; 87%), Animacy=EMPTY (8655; 73%), Case=EMPTY (8655; 73%).

PRON tokens may have the following values of Gender:

Paradigm sinMascFemNeut
sinsisitt

DET

10811 DET tokens (75% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (10811; 100%), PronType=Art (6182; 57%).

DET tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Genens
eneiet, at, er, ett

ADJ

7685 ADJ tokens (29% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (7685; 100%), Definite=Ind (7684; 100%), Degree=Pos (7391; 96%).

ADJ tokens may have the following values of Gender:

Paradigm storMascNeut
storstort

Gender seems to be lexical feature of ADJ. 100% lemmas (1366) occur only with one value of Gender.

PROPN

2689 PROPN tokens (15% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (364) occur only with one value of Gender.

NUM

164 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (164; 100%), Number=Sing (164; 100%).

NUM tokens may have the following values of Gender:

Paradigm halvannenMascFemNeut
halvannenhalvannenhalvannet

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (9413; 77%), NOUN –[nmod]–> PRON (1546; 64%), ADJ –[expl]–> PRON (552; 90%), ADJ –[conj]–> ADJ (209; 78%), DET –[nmod]–> NOUN (121; 66%), NOUN –[acl]–> NOUN (67; 55%), PRON –[expl]–> PRON (48; 52%), DET –[conj]–> DET (27; 90%), PRON –[acl:relcl]–> ADJ (25; 71%), PRON –[det]–> DET (24; 51%).