Treebank Statistics: UD_Swedish-Talbanken: Features: Gender
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
33303 tokens (34%) have a non-empty value of Gender.
10151 types (67%) occur at least once with a non-empty value of Gender.
6823 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (22559; 23% instances), PRON (3973; 4% instances), DET (3712; 4% instances), ADJ (2940; 3% instances), NUM (92; 0% instances), VERB (27; 0% instances).
NOUN
22559 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (21440; 95%), Number=Sing (15353; 68%), Definite=Ind (14734; 65%).
NOUN tokens may have the following values of Gender:
Com(15734; 70% of non-emptyGender): del, procent, människor, tid, familjen, kvinnor, man, dag, miljoner, frågaFem(1; 0% of non-emptyGender): nuptiamMasc(1; 0% of non-emptyGender): consensusNeut(6823; 30% of non-emptyGender): år, barn, äktenskapet, barnen, sätt, samhället, arbete, fall, äktenskap, barnetEMPTY(436): kr, %, dr, s., kap., proc, KPI, milj, mån, kl
| Paradigm äktenskap | Neut | Com |
|---|---|---|
| Case=Gen|Definite=Def|Number=Sing | äktenskapets | äktenskapens |
| Case=Gen|Definite=Ind|Number=Sing | äktenskaps | |
| Case=Gen|Definite=Ind|Number=Plur | äktenskaps | |
| Case=Nom|Definite=Def|Number=Sing | äktenskapet | äktenskapen |
| Case=Nom|Definite=Ind|Number=Sing | äktenskap | |
| Case=Nom|Definite=Ind|Number=Plur | äktenskap |
Gender seems to be lexical feature of NOUN. 99% lemmas (5874) occur only with one value of Gender.
PRON
3973 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (3607; 91%), Number=Sing (3555; 89%), Definite=Def (2946; 74%), PronType=Prs (2848; 72%), Case=EMPTY (2292; 58%).
PRON tokens may have the following values of Gender:
Com(2347; 59% of non-emptyGender): man, vi, den, du, sin, han, jag, oss, hon, enNeut(1626; 41% of non-emptyGender): det, detta, sitt, något, vad, vårt, allt, vilket, annat, dittEMPTY(2773): som, de, sig, dem, sina, vad, deras, våra, andra, många
| Paradigm den | Neut | Com |
|---|---|---|
| ExtPos=ADV|PronType=Prs | det | |
| PronType=Dem | det | den |
| PronType=Prs | det | den |
DET
3712 DET tokens (76% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3712; 100%), PronType=Art (3209; 86%), Definite=Ind (2319; 62%).
DET tokens may have the following values of Gender:
Com(2577; 69% of non-emptyGender): en, den, denna, någon, ingen, vilken, var, all, varannan, nånNeut(1135; 31% of non-emptyGender): ett, det, detta, något, allt, inget, vilket, vart, vartannatEMPTY(1181): de, alla, varje, dessa, några, vilka, båda, ena, inga, bägge
| Paradigm en | Neut | Com |
|---|---|---|
| Definite=Def | den | |
| Definite=Ind | ett | en |
ADJ
2940 ADJ tokens (34% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (2935; 100%), Degree=Pos (2930; 100%), Number=Sing (2929; 100%), Definite=Ind (2893; 98%), Tense=EMPTY (2511; 85%), VerbForm=EMPTY (2511; 85%).
ADJ tokens may have the following values of Gender:
Com(1920; 65% of non-emptyGender): stor, annan, själv, sådan, viss, egen, ny, hög, kristen, socialNeut(1020; 35% of non-emptyGender): annat, svårt, nytt, möjligt, sådant, viktigt, eget, socialt, stort, övrigtEMPTY(5613): olika, andra, nya, många, stora, samma, större, vissa, första, hela
| Paradigm stor | Neut | Com |
|---|---|---|
| Definite=Def|Degree=Sup | störste | |
| Definite=Ind|Degree=Pos|Number=Sing | stort | stor |
NUM
92 NUM tokens (5% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Case=Nom (92; 100%), NumType=Card (92; 100%).
NUM tokens may have the following values of Gender:
Com(60; 65% of non-emptyGender): enNeut(32; 35% of non-emptyGender): ettEMPTY(1648): två, tre, 1, 20, 2, 1970, 3, 10, 1971, 7
| Paradigm en | Neut | Com |
|---|---|---|
| ett | en |
VERB
27 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (27; 100%), Voice=Pass (27; 100%), Tense=Past (20; 74%), VerbForm=Part (20; 74%).
VERB tokens may have the following values of Gender:
Com(21; 78% of non-emptyGender): vald, vänd, hörselskadad, accepterad, förstärkt, förändrad, ifylld, komplicerad, likställd, lämnadNeut(6; 22% of non-emptyGender): förbjudet, opåverkat, reglerat, sysselsatt, tillgodosett, upplagtEMPTY(9763): har, finns, blir, få, får, ha, gäller, behöver, ger, går
Gender seems to be lexical feature of VERB. 100% lemmas (22) occur only with one value of Gender.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3477; 77%),
NOUN –[nmod]–> NOUN (1733; 54%),
NOUN –[conj]–> NOUN (1399; 67%),
NOUN –[nmod:poss]–> NOUN (552; 60%),
NOUN –[nmod:poss]–> PRON (352; 51%),
NOUN –[appos]–> NOUN (173; 64%),
NOUN –[nsubj]–> NOUN (151; 56%),
NOUN –[obl]–> NOUN (121; 51%),
ADJ –[conj]–> ADJ (106; 70%),
ADJ –[expl]–> PRON (105; 85%).