Treebank Statistics: UD_Swedish-SweLL: Features: Gender
This feature is universal.
It occurs with 3 different values: Com, Fem, Neut.
3627 tokens (33%) have a non-empty value of Gender.
1162 types (52%) occur at least once with a non-empty value of Gender.
817 lemmas (51%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (1798; 17% instances), PRON (1168; 11% instances), ADJ (326; 3% instances), DET (320; 3% instances), PROPN (7; 0% instances), NUM (5; 0% instances), VERB (3; 0% instances).
NOUN
1798 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (1766; 98%), Definite=Ind (1314; 73%), Number=Sing (1219; 68%).
NOUN tokens may have the following values of Gender:
Com(1268; 71% of non-emptyGender): kläder, människor, pengar, plats, saker, tid, kärlek, lycka, familj, klädernaFem(1; 0% of non-emptyGender): temperatureNeut(529; 29% of non-emptyGender): språk, barn, sätt, år, land, samhället, språket, jobb, exempel, livEMPTY(20): hand, 200m, C, Forsknings, Jeans, crush, ex, hit, kilo, kr
| Paradigm land | Neut | Com |
|---|---|---|
| Case=Gen|Definite=Def|Number=Sing | landets | |
| Case=Nom|Definite=Def|Number=Sing | landet | |
| Case=Nom|Definite=Def|Number=Plur | länderna | |
| Case=Nom|Definite=Def|Number=Plur|Typo=Yes | Landerna | |
| Case=Nom|Definite=Ind|Number=Sing | land | |
| Case=Nom|Definite=Ind|Number=Sing|Typo=Yes | länd | |
| Case=Nom|Definite=Ind|Number=Plur | länder, land |
Gender seems to be lexical feature of NOUN. 97% lemmas (622) occur only with one value of Gender.
PRON
1168 PRON tokens (75% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (1062; 91%), Number=Sing (1045; 89%), Definite=Def (957; 82%), PronType=Prs (953; 82%), Case=Nom (617; 53%).
PRON tokens may have the following values of Gender:
Com(812; 70% of non-emptyGender): jag, man, vi, mig, du, min, oss, han, hon, sinNeut(356; 30% of non-emptyGender): det, vad, mitt, sitt, vilket, detta, allt, ditt, någonting, vårtEMPTY(394): som, sig, de, sina, mina, andra, dem, deras, varandra, alla
| Paradigm jag | Neut | Com |
|---|---|---|
| Case=Acc | mig | |
| Case=Nom | jag | |
| Poss=Yes | mitt | min |
ADJ
326 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (326; 100%), Number=Sing (324; 99%), Definite=Ind (323; 99%), Degree=Pos (322; 99%).
ADJ tokens may have the following values of Gender:
Com(206; 63% of non-emptyGender): själv, stor, viktig, annan, glad, lång, ny, gammal, lycklig, rikNeut(120; 37% of non-emptyGender): viktigt, svårt, nytt, annat, dyrt, allmänmänskligt, eget, gammalt, gott, jämförtEMPTY(551): olika, många, bra, andra, nya, bättre, första, bästa, flesta, viktigaste
| Paradigm viktig | Neut | Com |
|---|---|---|
| viktigt | viktig |
DET
320 DET tokens (80% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (320; 100%), PronType=Art (289; 90%), Definite=Ind (224; 70%).
DET tokens may have the following values of Gender:
Com(216; 68% of non-emptyGender): en, den, vilken, ingen, denna, all, någonNeut(104; 33% of non-emptyGender): ett, det, detta, inget, allt, et, någotEMPTY(79): de, varje, alla, några, dessa, vilka, inga, dem, no, varj
| Paradigm en | Neut | Com |
|---|---|---|
| ett | en | |
| Typo=Yes | et |
PROPN
7 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (7; 100%).
PROPN tokens may have the following values of Gender:
Com(2; 29% of non-emptyGender): Haga, SegerstadNeut(5; 71% of non-emptyGender): Linsbiblioteket, Mongoliet, BungahjuletEMPTY(169): sverige, Bagdad, Finland, Sund, Anna, Caracas, Danmark, Haga, Karin, Paris
NUM
5 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Case=Nom (4; 80%), NumType=Card (4; 80%).
NUM tokens may have the following values of Gender:
Com(1; 20% of non-emptyGender): enNeut(4; 80% of non-emptyGender): ett, enEMPTY(50): två, 18, 1, tre, fyra, 2, 25, 4, 50, 1-12
| Paradigm en | Neut | Com |
|---|---|---|
| Case=Nom|NumType=Card | ett, en | |
| en |
VERB
3 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3; 100%), Tense=Past (3; 100%), VerbForm=Part (3; 100%), Voice=Pass (3; 100%).
VERB tokens may have the following values of Gender:
Com(3; 100% of non-emptyGender): dömd, fylled, utsattEMPTY(1457): har, tycker, finns, ha, kommer, lära, behöver, blir, ta, bor
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (270; 74%),
NOUN –[nmod:poss]–> PRON (94; 54%),
NOUN –[conj]–> NOUN (88; 68%),
NOUN –[nmod]–> NOUN (74; 51%),
ADJ –[nsubj]–> PRON (37; 51%),
ADJ –[expl]–> PRON (27; 68%),
NOUN –[nmod:poss]–> NOUN (15; 56%),
NOUN –[obl]–> NOUN (10; 63%),
NOUN –[compound]–> NOUN (9; 60%),
NOUN –[conj]–> PRON (4; 57%).