Gender
: gender
Gender is a lexical feature of nouns and an inflectional feature of other parts of speech (adjectives, determiners) that mark agreement with nouns.
Com
: common gender
Swedish does not distinguish masculine/feminine most of the time but does distinguish neuter vs. non-neuter (neutrum/utrum). The non-neuter is called common gender.
Examples
- [sv] en ogift mor “an unmarried mother”
- [sv] anmälan blir ifylld “the report is filled in”
Neut
: neuter gender
Nouns and other parts of speech that are neither masculine nor feminine (grammatically).
Examples
- [sv] fullt bostadstillägg “full housing supplement”
- [sv] det första kravet “the first requirement”
Masc
: masculine gender
Some adjectives, when referring specifically to a male person, take a masculine form in Swedish.
Examples
- [sv] den arbetslöse “the unemployed (man)”
- [sv] den brittiske författaren “the British author”
Treebank Statistics (UD_Swedish)
This feature is universal.
It occurs with 3 different values: Com
, Masc
, Neut
.
33612 tokens (35%) have a non-empty value of Gender
.
10232 types (68%) occur at least once with a non-empty value of Gender
.
6848 lemmas (67%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: sv-pos/NOUN (22564; 23% instances), sv-pos/DET (4082; 4% instances), sv-pos/PRON (3706; 4% instances), sv-pos/ADJ (3141; 3% instances), sv-pos/NUM (92; 0% instances), sv-pos/VERB (27; 0% instances).
NOUN
22564 sv-pos/NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Case=Nom (21443; 95%), Number=Sing (15356; 68%), Definite=Ind (14738; 65%).
NOUN
tokens may have the following values of Gender
:
Com
(15734; 70% of non-emptyGender
): del, procent, människor, tid, familjen, kvinnor, man, dag, miljoner, frågaNeut
(6830; 30% of non-emptyGender
): år, barn, äktenskapet, barnen, sätt, samhället, arbete, fall, äktenskap, barnetEMPTY
(420): kr, %, dr, s., kap., proc, KPI, milj, mån, sid
Paradigm äktenskap | Neut | Com |
---|---|---|
Case=Gen|Definite=Def|Number=Sing | äktenskapets | äktenskapens |
Case=Gen|Definite=Ind|Number=Sing | äktenskaps | |
Case=Gen|Definite=Ind|Number=Plur | äktenskaps | |
Case=Nom|Definite=Def|Number=Sing | äktenskapet | äktenskapen |
Case=Nom|Definite=Ind|Number=Sing | äktenskap | |
Case=Nom|Definite=Ind|Number=Plur | äktenskap |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (5834) occur only with one value of Gender
.
DET
4082 sv-pos/DET tokens (72% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (4082; 100%), Poss=EMPTY (3717; 91%), Definite=Ind (2324; 57%).
DET
tokens may have the following values of Gender
:
Com
(2814; 69% of non-emptyGender
): en, den, sin, denna, någon, vår, ingen, vilken, din, varNeut
(1268; 31% of non-emptyGender
): ett, det, detta, sitt, något, vårt, allt, ditt, inget, vilketEMPTY
(1574): de, alla, varje, dessa, samma, sina, deras, några, våra, vilka
Paradigm en | Neut | Com |
---|---|---|
Definite=Def | det | den |
Definite=Ind | ett | en |
PRON
3706 sv-pos/PRON tokens (62% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: PronType=EMPTY (3526; 95%), Number=Sing (3288; 89%), Definite=Def (2577; 70%), Case=EMPTY (2026; 55%).
PRON
tokens may have the following values of Gender
:
Com
(2100; 57% of non-emptyGender
): man, vi, den, du, han, jag, oss, hon, en, digMasc
(2; 0% of non-emptyGender
): denneNeut
(1604; 43% of non-emptyGender
): det, detta, vad, något, allt, vilket, mycket, annat, detsamma, ettEMPTY
(2269): som, de, sig, dem, andra, många, alla, varandra, dessa, vilka
Paradigm denna | Masc | Neut | Com |
---|---|---|---|
denne | detta | denna |
ADJ
3141 sv-pos/ADJ tokens (37% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (3136; 100%), Case=Nom (3130; 100%), Definite=Ind (3030; 96%), VerbForm=EMPTY (2578; 82%), Tense=EMPTY (2578; 82%), Degree=Pos (2573; 82%).
ADJ
tokens may have the following values of Gender
:
Com
(1978; 63% of non-emptyGender
): stor, annan, själv, sådan, viss, egen, ny, hög, kristen, socialMasc
(46; 1% of non-emptyGender
): enskilde, andre, sjuke, ene, enskildes, lille, skattskyldige, unge, amerikanske, andresNeut
(1117; 36% of non-emptyGender
): annat, svårt, nytt, möjligt, sådant, viktigt, eget, socialt, stort, övrigtEMPTY
(5357): olika, andra, nya, många, stora, större, första, vissa, hela, flera
Paradigm stor | Masc | Neut | Com |
---|---|---|---|
Definite=Def|Degree=Sup | störste | ||
Definite=Ind|Degree=Pos | stort | stor |
NUM
92 sv-pos/NUM tokens (5% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Case=Nom (92; 100%).
NUM
tokens may have the following values of Gender
:
Com
(60; 65% of non-emptyGender
): enNeut
(32; 35% of non-emptyGender
): ettEMPTY
(1649): två, tre, 1, 20, 2, 1970, 3, 10, 1971, 7
Paradigm en | Neut | Com |
---|---|---|
ett | en |
VERB
27 sv-pos/VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Tense=Past (27; 100%), Voice=EMPTY (27; 100%), VerbForm=Part (27; 100%), Mood=EMPTY (27; 100%).
VERB
tokens may have the following values of Gender
:
Com
(21; 78% of non-emptyGender
): vald, vänd, hörselskadad, accepterad, förstärkt, förändrad, ifylld, komplicerad, likställd, lämnadNeut
(6; 22% of non-emptyGender
): förbjudet, opåverkat, reglerat, sysselsatt, tillgodosett, upplagtEMPTY
(9942): har, finns, är, blir, få, får, ha, gäller, ger, går
Gender
seems to be lexical feature of VERB
. 100% lemmas (22) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3459; 76%),
NOUN –[nmod]–> NOUN (1688; 53%),
NOUN –[conj]–> NOUN (1318; 68%),
NOUN –[nmod:poss]–> NOUN (552; 60%),
NOUN –[nmod:poss]–> DET (339; 51%),
NOUN –[appos]–> NOUN (183; 64%),
NOUN –[nsubj]–> NOUN (165; 56%),
ADJ –[conj]–> ADJ (125; 80%),
ADJ –[expl]–> PRON (112; 93%),
ADJ –[nsubj]–> PRON (93; 52%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]