Treebank Statistics: UD_Norwegian-Bokmaal: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc
.
89456 tokens (29%) have a non-empty value of Gender
.
20726 types (64%) occur at least once with a non-empty value of Gender
.
13975 lemmas (60%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (56308; 18% instances), PRON (11798; 4% instances), DET (10811; 3% instances), ADJ (7685; 2% instances), PROPN (2689; 1% instances), NUM (164; 0% instances), ADV (1; 0% instances).
NOUN
56308 NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (40153; 71%), Definite=Ind (36305; 64%).
NOUN
tokens may have the following values of Gender
:
Fem
(7728; 14% of non-emptyGender
): tid, kirke, kroner, kvinner, støtte, hjelp, uker, side, mor, endringerMasc
(31718; 56% of non-emptyGender
): dag, prosent, gang, verden, del, grunn, saken, ganger, ting, millionerNeut
(16862; 30% of non-emptyGender
): år, folk, land, barn, landet, mennesker, livet, spørsmål, forhold, tilleggEMPTY
(944): §, går, fjor, tros-, islam, vare, bord, dr., kong, stede
Paradigm råd | Masc | Fem | Neut |
---|---|---|---|
_ | råd | råd | |
Case=Gen|Definite=Def|Number=Sing | rådets | ||
Definite=Def|Number=Sing | rådet | ||
Definite=Ind|Number=Sing | råd | ||
Definite=Ind|Number=Plur | råd |
Gender
seems to be lexical feature of NOUN
. 94% lemmas (11544) occur only with one value of Gender
.
PRON
11798 PRON tokens (45% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (11798; 100%), PronType=Prs (11387; 97%), Person=3 (10250; 87%), Animacy=EMPTY (8655; 73%), Case=EMPTY (8655; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(1034; 9% of non-emptyGender
): hun, henne, vår, hans, deres, si, hennes, di, miFem,Masc
(548; 5% of non-emptyGender
): den, noen, denne, ingen, enhver, derMasc
(3161; 27% of non-emptyGender
): han, sin, ham, min, hans, vår, din, deres, hennesNeut
(7055; 60% of non-emptyGender
): det, dette, noe, sitt, alt, mitt, vårt, hans, hennes, dittEMPTY
(14164): som, jeg, vi, de, seg, du, man, meg, hva, oss
Paradigm sin | Masc | Fem | Neut |
---|---|---|---|
sin | si | sitt |
DET
10811 DET tokens (75% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (10811; 100%), PronType=Art (6182; 57%).
DET
tokens may have the following values of Gender
:
Fem
(360; 3% of non-emptyGender
): den, ei, noen, all, denne, hver, egen, annen, enhver, hvilkenMasc
(6554; 61% of non-emptyGender
): en, den, denne, ingen, annen, hver, egen, slik, noen, allNeut
(3897; 36% of non-emptyGender
): et, det, noe, annet, dette, hvert, eget, alt, slikt, hvilketEMPTY
(3569): de, andre, alle, noen, selv, disse, samme, slike, neste, egne
Paradigm en | Masc | Fem | Neut |
---|---|---|---|
Case=Gen | ens | ||
en | ei | et, at, er, ett |
ADJ
7685 ADJ tokens (29% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (7685; 100%), Definite=Ind (7684; 100%), Degree=Pos (7391; 96%).
ADJ
tokens may have the following values of Gender
:
Masc
(3; 0% of non-emptyGender
): antiautoritære, stor, straffetNeut
(7682; 100% of non-emptyGender
): mye, helt, godt, litt, langt, samtidig, veldig, mulig, svært, liteEMPTY
(19124): mer, mange, flere, norske, første, store, nye, hele, siste, stor
Paradigm stor | Masc | Neut |
---|---|---|
stor | stort |
Gender
seems to be lexical feature of ADJ
. 100% lemmas (1366) occur only with one value of Gender
.
PROPN
2689 PROPN tokens (15% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(523; 19% of non-emptyGender
): Kristin, Marit, Hanne, Hanna, Märtha, Gro, Ingrid, Maria, Marie, AnneMasc
(1931; 72% of non-emptyGender
): Jan, Espen, Martin, Olav, Erik, Øyvind, Per, Kjell, Aftenposten, SverreNeut
(235; 9% of non-emptyGender
): Stortinget, Dagbladet, Fremskrittspartiet, Senterpartiet, Stortingets, Sørlandet, Internett, Barentshavet, Norden, VestlandetEMPTY
(15571): Norge, Obama, Regjeringen, Oslo, USA, Den, Svalbard, Mayen, Cathrine, Bertelsen
Gender
seems to be lexical feature of PROPN
. 100% lemmas (364) occur only with one value of Gender
.
NUM
164 NUM tokens (4% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (164; 100%), Number=Sing (164; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(5; 3% of non-emptyGender
): halvannen, annenhverMasc
(66; 40% of non-emptyGender
): én, halvannen, annenhver, ÈnNeut
(93; 57% of non-emptyGender
): ett, halvannet, mangt, annethvertEMPTY
(3798): to, tre, fire, eneste, 2, fem, ti, 20, seks, 1
Paradigm halvannen | Masc | Fem | Neut |
---|---|---|---|
halvannen | halvannen | halvannet |
ADV
1 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): JoEMPTY
(9970): også, så, nå, bare, her, da, selv, hvor, nok, jo
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (9413; 77%),
NOUN –[nmod]–> PRON (1546; 64%),
ADJ –[expl]–> PRON (552; 90%),
ADJ –[conj]–> ADJ (209; 78%),
DET –[nmod]–> NOUN (121; 66%),
NOUN –[acl]–> NOUN (67; 55%),
PRON –[expl]–> PRON (48; 52%),
DET –[conj]–> DET (27; 90%),
PRON –[acl:relcl]–> ADJ (25; 71%),
PRON –[det]–> DET (24; 51%).