Treebank Statistics: UD_Norwegian-Bokmaal: Features: Gender
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.
95531 tokens (31%) have a non-empty value of Gender.
22119 types (68%) occur at least once with a non-empty value of Gender.
14989 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (56327; 18% instances), ADJ (13740; 4% instances), PRON (11798; 4% instances), DET (10811; 3% instances), PROPN (2689; 1% instances), NUM (164; 0% instances), ADV (1; 0% instances), VERB (1; 0% instances).
NOUN
56327 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40172; 71%), Definite=Ind (36324; 64%).
NOUN tokens may have the following values of Gender:
Com(1; 0% of non-emptyGender): SportssjefFem(7728; 14% of non-emptyGender): tid, kirke, kroner, kvinner, støtte, hjelp, uker, side, mor, endringerMasc(31736; 56% of non-emptyGender): dag, prosent, gang, verden, del, grunn, saken, ganger, ting, millionerNeut(16862; 30% of non-emptyGender): år, folk, land, barn, landet, mennesker, livet, spørsmål, forhold, tilleggEMPTY(926): §, går, fjor, tros-, islam, vare, bord, dr., stede, nr
| Paradigm råd | Masc | Fem | Neut |
|---|---|---|---|
| _ | råd | råd | |
| Case=Gen|Definite=Def|Number=Sing | rådets | ||
| Definite=Def|Number=Sing | rådet | ||
| Definite=Ind|Number=Sing | råd | ||
| Definite=Ind|Number=Plur | råd |
Gender seems to be lexical feature of NOUN. 94% lemmas (11544) occur only with one value of Gender.
ADJ
13740 ADJ tokens (51% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Definite=Ind (13738; 100%), Number=Sing (13738; 100%), Degree=Pos (12885; 94%).
ADJ tokens may have the following values of Gender:
Com(6055; 44% of non-emptyGender): stor, ny, god, norsk, liten, politisk, klar, full, sterk, myeMasc(3; 0% of non-emptyGender): antiautoritære, stor, straffetNeut(7682; 56% of non-emptyGender): mye, helt, godt, litt, langt, samtidig, veldig, mulig, svært, liteEMPTY(13070): mer, mange, flere, norske, første, store, nye, hele, siste, tidligere
| Paradigm stor | Masc | Neut | Com |
|---|---|---|---|
| stor | stort | stor |
PRON
11798 PRON tokens (52% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (11798; 100%), PronType=Prs (11401; 97%), Person=3 (10250; 87%), Animacy=EMPTY (8655; 73%), Case=EMPTY (8655; 73%).
PRON tokens may have the following values of Gender:
Fem(1034; 9% of non-emptyGender): hun, henne, vår, hans, deres, si, hennes, di, miFem,Masc(548; 5% of non-emptyGender): den, noen, denne, ingen, enhver, derMasc(3161; 27% of non-emptyGender): han, sin, ham, min, hans, vår, din, deres, hennesNeut(7055; 60% of non-emptyGender): det, dette, noe, sitt, alt, mitt, vårt, hans, hennes, dittEMPTY(10847): jeg, vi, de, seg, du, man, meg, hva, oss, dem
| Paradigm sin | Masc | Fem | Neut |
|---|---|---|---|
| sin | si | sitt |
DET
10811 DET tokens (75% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (10811; 100%), PronType=Art (8769; 81%).
DET tokens may have the following values of Gender:
Fem(360; 3% of non-emptyGender): den, ei, noen, all, denne, hver, egen, annen, enhver, hvilkenMasc(6554; 61% of non-emptyGender): en, den, denne, ingen, annen, hver, egen, slik, noen, allNeut(3897; 36% of non-emptyGender): et, det, noe, annet, dette, hvert, eget, alt, slikt, hvilketEMPTY(3585): de, andre, alle, noen, selv, disse, samme, slike, neste, egne
| Paradigm en | Masc | Fem | Neut |
|---|---|---|---|
| Case=Gen | ens | ||
| en | ei | et, at, er, ett |
PROPN
2689 PROPN tokens (15% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(523; 19% of non-emptyGender): Kristin, Marit, Hanne, Hanna, Märtha, Gro, Ingrid, Maria, Marie, AnneMasc(1931; 72% of non-emptyGender): Jan, Espen, Martin, Olav, Erik, Øyvind, Per, Kjell, Aftenposten, SverreNeut(235; 9% of non-emptyGender): Stortinget, Dagbladet, Fremskrittspartiet, Senterpartiet, Stortingets, Sørlandet, Internett, Barentshavet, Norden, VestlandetEMPTY(15540): Norge, Obama, Regjeringen, Oslo, USA, Den, Svalbard, Mayen, Cathrine, Bertelsen
Gender seems to be lexical feature of PROPN. 100% lemmas (364) occur only with one value of Gender.
NUM
164 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (164; 100%), Number=Sing (164; 100%).
NUM tokens may have the following values of Gender:
Fem(5; 3% of non-emptyGender): halvannen, annenhverMasc(66; 40% of non-emptyGender): én, halvannen, annenhver, ÈnNeut(93; 57% of non-emptyGender): ett, halvannet, mangt, annethvertEMPTY(3781): to, tre, fire, eneste, 2, fem, ti, 20, seks, 1
| Paradigm halvannen | Masc | Fem | Neut |
|---|---|---|---|
| halvannen | halvannen | halvannet |
ADV
1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.
ADV tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): JoEMPTY(12697): også, så, nå, bare, opp, ut, her, da, selv, hvor
VERB
1 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (1; 100%), Tense=Pres (1; 100%), VerbForm=Fin,Part (1; 100%).
VERB tokens may have the following values of Gender:
Com(1; 100% of non-emptyGender): overrasketEMPTY(33350): har, sier, er, blir, kommer, går, ha, få, bli, ta
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (9423; 77%),
NOUN –[nmod:poss]–> PRON (1488; 74%),
ADJ –[expl]–> PRON (551; 90%),
ADJ –[conj]–> ADJ (473; 81%),
DET –[nmod]–> NOUN (123; 66%),
PRON –[expl]–> PRON (48; 52%),
PRON –[amod]–> ADJ (32; 51%),
DET –[conj]–> DET (27; 90%),
PRON –[det]–> DET (24; 51%),
PRON –[conj]–> PRON (10; 56%).