Treebank Statistics: UD_Swedish-SweLL: Features: Gender
This feature is universal.
It occurs with 2 different values: Com, Neut.
2878 tokens (33%) have a non-empty value of Gender.
1017 types (51%) occur at least once with a non-empty value of Gender.
780 lemmas (50%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (1455; 17% instances), PRON (913; 11% instances), ADJ (255; 3% instances), DET (242; 3% instances), NUM (5; 0% instances), PROPN (5; 0% instances), VERB (3; 0% instances).
NOUN
1455 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (1426; 98%), Definite=Ind (1068; 73%), Number=Sing (992; 68%).
NOUN tokens may have the following values of Gender:
Com(1021; 70% of non-emptyGender): människor, kläder, pengar, plats, familj, kärlek, saker, tid, världen, bokenNeut(434; 30% of non-emptyGender): språk, barn, sätt, år, språket, land, samhället, liv, jobb, exempelEMPTY(13): hand, 200m, Forsknings, Jeans, ex, hit, kilo, refunds, seafood, sommras
| Paradigm land | Neut | Com |
|---|---|---|
| Case=Gen|Definite=Def|Number=Sing | landets | |
| Case=Nom|Definite=Def|Number=Sing | landet | |
| Case=Nom|Definite=Def|Number=Plur | länderna | |
| Case=Nom|Definite=Def|Number=Plur|Typo=Yes | Landerna | |
| Case=Nom|Definite=Ind|Number=Sing | land | |
| Case=Nom|Definite=Ind|Number=Plur | länder, land |
Gender seems to be lexical feature of NOUN. 98% lemmas (606) occur only with one value of Gender.
PRON
913 PRON tokens (75% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (823; 90%), Number=Sing (812; 89%), Definite=Def (764; 84%), PronType=Prs (760; 83%), Case=Nom (472; 52%).
PRON tokens may have the following values of Gender:
Com(631; 69% of non-emptyGender): jag, man, vi, mig, du, min, oss, han, sin, honNeut(282; 31% of non-emptyGender): det, vad, mitt, sitt, vilket, detta, allt, ditt, någonting, annatEMPTY(299): som, de, sig, sina, mina, andra, dem, alla, varandra, deras
| Paradigm jag | Neut | Com |
|---|---|---|
| Case=Acc | mig | |
| Case=Nom | jag | |
| Poss=Yes | mitt | min |
ADJ
255 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (255; 100%), Number=Sing (255; 100%), Degree=Pos (253; 99%), Definite=Ind (252; 99%).
ADJ tokens may have the following values of Gender:
Com(159; 62% of non-emptyGender): stor, själv, viktig, annan, lång, glad, ny, ensam, fin, gammalNeut(96; 38% of non-emptyGender): viktigt, svårt, nytt, dyrt, allmänmänskligt, eget, jämfört, jätte, möjligt, svensktEMPTY(439): många, olika, bra, nya, andra, bästa, bättre, flesta, första, mer
| Paradigm viktig | Neut | Com |
|---|---|---|
| viktigt | viktig |
DET
242 DET tokens (79% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (242; 100%), PronType=Art (214; 88%), Definite=Ind (168; 69%).
DET tokens may have the following values of Gender:
Com(170; 70% of non-emptyGender): en, den, ingen, vilken, denna, allNeut(72; 30% of non-emptyGender): ett, det, detta, inget, allt, et, någotEMPTY(64): de, varje, alla, några, dessa, vilka, inga, dem, no, varj
| Paradigm en | Neut | Com |
|---|---|---|
| ett | en | |
| Typo=Yes | et |
NUM
5 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Definite=Ind (5; 100%), Number=Sing (5; 100%), Case=Nom (4; 80%), NumType=Card (4; 80%).
NUM tokens may have the following values of Gender:
Com(1; 20% of non-emptyGender): enNeut(4; 80% of non-emptyGender): ett, enEMPTY(44): två, 1, 18, tre, 2, 4, fyra, 1-12, 10, 16
| Paradigm en | Neut | Com |
|---|---|---|
| Case=Nom|NumType=Card | ett, en | |
| en |
PROPN
5 PROPN tokens (3% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (5; 100%).
PROPN tokens may have the following values of Gender:
Com(2; 40% of non-emptyGender): Haga, SegerstadNeut(3; 60% of non-emptyGender): Mongoliet, LinsbiblioteketEMPTY(158): Sverige, Bagdad, Finland, Sund, Caracas, Haga, Paris, Peru, Sara, Segerstad
VERB
3 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3; 100%), Tense=Past (3; 100%), VerbForm=Part (3; 100%), Voice=Pass (3; 100%).
VERB tokens may have the following values of Gender:
Com(3; 100% of non-emptyGender): dömd, fylled, utsattEMPTY(1157): har, tycker, finns, kommer, ha, lära, blir, bor, göra, köpa
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (208; 74%),
NOUN –[nmod:poss]–> PRON (81; 58%),
NOUN –[conj]–> NOUN (73; 70%),
NOUN –[nmod]–> NOUN (63; 53%),
ADJ –[nsubj]–> PRON (28; 58%),
ADJ –[expl]–> PRON (21; 70%),
NOUN –[nmod:poss]–> NOUN (14; 54%),
NOUN –[compound]–> NOUN (7; 54%),
NOUN –[obl]–> NOUN (7; 54%),
NOUN –[appos]–> NOUN (4; 57%).