Treebank Statistics: UD_Norwegian-Bokmaal: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc
.
95531 tokens (31%) have a non-empty value of Gender
.
22119 types (68%) occur at least once with a non-empty value of Gender
.
14991 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (56327; 18% instances), ADJ (13740; 4% instances), PRON (11798; 4% instances), DET (10811; 3% instances), PROPN (2689; 1% instances), NUM (164; 0% instances), ADV (1; 0% instances), VERB (1; 0% instances).
NOUN
56327 NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (40172; 71%), Definite=Ind (36324; 64%).
NOUN
tokens may have the following values of Gender
:
Fem
(7728; 14% of non-emptyGender
): tid, kirke, kroner, kvinner, støtte, hjelp, uker, side, mor, endringerFem,Masc
(1; 0% of non-emptyGender
): SportssjefMasc
(31736; 56% of non-emptyGender
): dag, prosent, gang, verden, del, grunn, saken, ganger, ting, millionerNeut
(16862; 30% of non-emptyGender
): år, folk, land, barn, landet, mennesker, livet, spørsmål, forhold, tilleggEMPTY
(926): §, går, fjor, tros-, islam, vare, bord, dr., stede, nr
Paradigm råd | Masc | Fem | Neut |
---|---|---|---|
_ | råd | råd | |
Case=Gen|Definite=Def|Number=Sing | rådets | ||
Definite=Def|Number=Sing | rådet | ||
Definite=Ind|Number=Sing | råd | ||
Definite=Ind|Number=Plur | råd |
Gender
seems to be lexical feature of NOUN
. 94% lemmas (11544) occur only with one value of Gender
.
ADJ
13740 ADJ tokens (51% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Definite=Ind (13738; 100%), Number=Sing (13738; 100%), Degree=Pos (12885; 94%).
ADJ
tokens may have the following values of Gender
:
Fem,Masc
(6055; 44% of non-emptyGender
): stor, ny, god, norsk, liten, politisk, klar, full, sterk, myeMasc
(3; 0% of non-emptyGender
): antiautoritære, stor, straffetNeut
(7682; 56% of non-emptyGender
): mye, helt, godt, litt, langt, samtidig, veldig, mulig, svært, liteEMPTY
(13070): mer, mange, flere, norske, første, store, nye, hele, siste, tidligere
Paradigm stor | Fem,Masc | Masc | Neut |
---|---|---|---|
stor | stor | stort |
PRON
11798 PRON tokens (52% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (11798; 100%), PronType=Prs (11401; 97%), Person=3 (10250; 87%), Animacy=EMPTY (8655; 73%), Case=EMPTY (8655; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(1034; 9% of non-emptyGender
): hun, henne, vår, hans, deres, si, hennes, di, miFem,Masc
(548; 5% of non-emptyGender
): den, noen, denne, ingen, enhver, derMasc
(3161; 27% of non-emptyGender
): han, sin, ham, min, hans, vår, din, deres, hennesNeut
(7055; 60% of non-emptyGender
): det, dette, noe, sitt, alt, mitt, vårt, hans, hennes, dittEMPTY
(10847): jeg, vi, de, seg, du, man, meg, hva, oss, dem
Paradigm sin | Masc | Fem | Neut |
---|---|---|---|
sin | si | sitt |
DET
10811 DET tokens (75% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (10811; 100%), PronType=Art (6182; 57%).
DET
tokens may have the following values of Gender
:
Fem
(360; 3% of non-emptyGender
): den, ei, noen, all, denne, hver, egen, annen, enhver, hvilkenMasc
(6554; 61% of non-emptyGender
): en, den, denne, ingen, annen, hver, egen, slik, noen, allNeut
(3897; 36% of non-emptyGender
): et, det, noe, annet, dette, hvert, eget, alt, slikt, hvilketEMPTY
(3585): de, andre, alle, noen, selv, disse, samme, slike, neste, egne
Paradigm en | Masc | Fem | Neut |
---|---|---|---|
Case=Gen | ens | ||
en | ei | et, at, er, ett |
PROPN
2689 PROPN tokens (15% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(523; 19% of non-emptyGender
): Kristin, Marit, Hanne, Hanna, Märtha, Gro, Ingrid, Maria, Marie, AnneMasc
(1931; 72% of non-emptyGender
): Jan, Espen, Martin, Olav, Erik, Øyvind, Per, Kjell, Aftenposten, SverreNeut
(235; 9% of non-emptyGender
): Stortinget, Dagbladet, Fremskrittspartiet, Senterpartiet, Stortingets, Sørlandet, Internett, Barentshavet, Norden, VestlandetEMPTY
(15571): Norge, Obama, Regjeringen, Oslo, USA, Den, Svalbard, Mayen, Cathrine, Bertelsen
Gender
seems to be lexical feature of PROPN
. 100% lemmas (364) occur only with one value of Gender
.
NUM
164 NUM tokens (4% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (164; 100%), Number=Sing (164; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(5; 3% of non-emptyGender
): halvannen, annenhverMasc
(66; 40% of non-emptyGender
): én, halvannen, annenhver, ÈnNeut
(93; 57% of non-emptyGender
): ett, halvannet, mangt, annethvertEMPTY
(3781): to, tre, fire, eneste, 2, fem, ti, 20, seks, 1
Paradigm halvannen | Masc | Fem | Neut |
---|---|---|---|
halvannen | halvannen | halvannet |
ADV
1 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): JoEMPTY
(12697): også, så, nå, bare, opp, ut, her, da, selv, hvor
VERB
1 VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=Ind (1; 100%), Tense=Pres (1; 100%), VerbForm=Fin,Part (1; 100%).
VERB
tokens may have the following values of Gender
:
Fem,Masc
(1; 100% of non-emptyGender
): overrasketEMPTY
(33350): har, sier, er, blir, kommer, går, ha, få, bli, ta
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (9423; 77%),
NOUN –[det]–> PRON (1490; 73%),
ADJ –[expl]–> PRON (551; 90%),
ADJ –[conj]–> ADJ (473; 81%),
DET –[nmod]–> NOUN (123; 66%),
PRON –[expl]–> PRON (48; 52%),
PRON –[amod]–> ADJ (33; 52%),
DET –[conj]–> DET (27; 90%),
PRON –[det]–> DET (24; 51%),
PRON –[conj]–> PRON (10; 56%).