Treebank Statistics: UD_Swedish-LinES: Features: Gender
This feature is universal.
It occurs with 2 different values: Com, Neut.
33932 tokens (33%) have a non-empty value of Gender.
9298 types (60%) occur at least once with a non-empty value of Gender.
6520 lemmas (60%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (17665; 17% instances), PRON (8847; 9% instances), DET (4528; 4% instances), ADJ (2840; 3% instances), PROPN (34; 0% instances), NUM (9; 0% instances), VERB (6; 0% instances), X (3; 0% instances).
NOUN
17665 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (17114; 97%), Number=Sing (12907; 73%), Definite=Ind (11539; 65%).
NOUN tokens may have the following values of Gender:
Com(12357; 70% of non-emptyGender): gång, far, man, sidan, del, väg, tiden, mor, människor, frågaNeut(5308; 30% of non-emptyGender): sätt, år, fält, barn, data, ögon, liv, ansikte, exempel, huvudetEMPTY(175): går, Language, Web, tack, vänster, Components, Server, Engine, Stylesheet, dansande
| Paradigm man | Neut | Com |
|---|---|---|
| Case=Gen|Definite=Def|Number=Sing | mannens | |
| Case=Gen|Definite=Ind|Number=Sing | mans | |
| Case=Nom|Definite=Def|Number=Sing | mannen | |
| Case=Nom|Definite=Def|Number=Plur | männen | männen |
| Case=Nom|Definite=Ind|Number=Sing | man | |
| Case=Nom|Definite=Ind|Number=Plur | män | män, man |
Gender seems to be lexical feature of NOUN. 97% lemmas (5252) occur only with one value of Gender.
PRON
8847 PRON tokens (71% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (8416; 95%), Poss=EMPTY (8076; 91%), Definite=Def (7716; 87%), PronType=Prs (7634; 86%), Case=Nom (4526; 51%).
PRON tokens may have the following values of Gender:
Com(6366; 72% of non-emptyGender): han, jag, du, vi, hon, mig, honom, man, den, sinNeut(2481; 28% of non-emptyGender): det, vad, sitt, detta, allt, något, ingenting, mitt, vilket, någontingEMPTY(3688): som, sig, de, hans, dem, sina, hennes, deras, mina, alla
| Paradigm den | Neut | Com |
|---|---|---|
| Case=Nom|Definite=Def|PronType=Prs | den | |
| Definite=Def|ExtPos=ADV|PronType=Prs | Det | |
| Definite=Def|PronType=Art | det | |
| Definite=Def|PronType=Dem | Det | den |
| Definite=Def|PronType=Prs | det | den |
| Definite=Ind|PronType=Prs | det |
DET
4528 DET tokens (85% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4528; 100%), PronType=Art (3970; 88%), Definite=Ind (3177; 70%).
DET tokens may have the following values of Gender:
Com(3038; 67% of non-emptyGender): en, den, någon, denna, ingen, all, varenda, vilken, denne, nånNeut(1490; 33% of non-emptyGender): ett, det, något, detta, inget, allt, vilket, nåt, intet, vartendaEMPTY(795): de, alla, några, varje, dessa, ena, inga, båda, vilka, dom
| Paradigm en | Neut | Com |
|---|---|---|
| ett | en |
ADJ
2840 ADJ tokens (40% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2838; 100%), Case=Nom (2832; 100%), Definite=Ind (2804; 99%), Degree=Pos (2796; 98%), Tense=EMPTY (2561; 90%), VerbForm=EMPTY (2551; 90%).
ADJ tokens may have the following values of Gender:
Com(1874; 66% of non-emptyGender): själv, stor, annan, egen, liten, lång, vit, sådan, gammal, ungNeut(966; 34% of non-emptyGender): annat, stort, eget, nytt, svårt, litet, möjligt, rött, taget, klartEMPTY(4315): andra, hela, samma, första, många, enda, flera, stora, nya, vita
| Paradigm annan | Neut | Com |
|---|---|---|
| annat | annan |
PROPN
34 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (32; 94%), Number=Sing (32; 94%).
PROPN tokens may have the following values of Gender:
Com(30; 88% of non-emptyGender): Stella, Athena, Paniotis, Ryan, Alice, Jove, Aten, Dior, Hefaistos, LutyensNeut(4; 12% of non-emptyGender): Cunards, Jung, Kolonakitorget, VenezuelaEMPTY(2938): Harry, Quinn, Stillman, Bray, Auster, Access, Microsoft, Weasley, Ron, Dobby
Gender seems to be lexical feature of PROPN. 100% lemmas (20) occur only with one value of Gender.
NUM
9 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Com(4; 44% of non-emptyGender): enNeut(5; 56% of non-emptyGender): ett, BeckettEMPTY(525): två, tre, en, fem, sex, fyra, tio, 1, 2, 2000
| Paradigm en | Neut | Com |
|---|---|---|
| Definite=Ind | ett | en |
| NumType=Card | Ett |
VERB
6 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6; 100%), Voice=EMPTY (5; 83%), Tense=Past (4; 67%), VerbForm=Part (4; 67%).
VERB tokens may have the following values of Gender:
Com(4; 67% of non-emptyGender): förhörd, slagen, uppdelad, uppslukadNeut(2; 33% of non-emptyGender): genomkorsat, sittEMPTY(12409): sa, hade, kom, såg, gick, har, ta, göra, se, tog
X
3 X tokens (12% of all X tokens) have a non-empty value of Gender.
The most frequent other feature values with which X and Gender co-occurred: Case=Nom (3; 100%), Definite=Ind (3; 100%), Number=Sing (3; 100%).
X tokens may have the following values of Gender:
Neut(3; 100% of non-emptyGender): alium, coniunctis, internumEMPTY(23): W3C, TSQL, foie, maris, stella, .adp, .lpk, .mdb, .odc, .udl
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (4183; 85%),
NOUN –[nmod]–> NOUN (1455; 59%),
NOUN –[conj]–> NOUN (826; 64%),
NOUN –[nmod:poss]–> NOUN (271; 56%),
ADJ –[nsubj]–> PRON (208; 67%),
ADJ –[conj]–> ADJ (158; 72%),
NOUN –[nsubj]–> NOUN (109; 72%),
NOUN –[appos]–> NOUN (95; 71%),
ADJ –[expl]–> PRON (92; 81%),
PRON –[nmod]–> NOUN (71; 56%).