Treebank Statistics: UD_Swedish-PUD: Features: Gender
This feature is universal.
It occurs with 2 different values: Com, Neut.
5865 tokens (31%) have a non-empty value of Gender.
3109 types (50%) occur at least once with a non-empty value of Gender.
2474 lemmas (50%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (3879; 20% instances), DET (822; 4% instances), PRON (679; 4% instances), ADJ (474; 2% instances), VERB (5; 0% instances), NUM (3; 0% instances), PROPN (3; 0% instances).
NOUN
3879 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (3746; 97%), Number=Sing (2750; 71%), Definite=Ind (2521; 65%).
NOUN tokens may have the following values of Gender:
Com(2804; 72% of non-emptyGender): personer, miljoner, grund, oktober, världen, del, delen, tiden, plats, dollarNeut(1075; 28% of non-emptyGender): år, havet, fall, kriget, liv, antal, barn, åren, land, slutetEMPTY(159): %, f.Kr., University, Ms, md, mr, morse, school, Association, Bank
| Paradigm val | Neut | Com |
|---|---|---|
| Definite=Def|Number=Sing | valet | |
| Definite=Ind|Number=Sing | val | val |
| Definite=Ind|Number=Plur | val |
Gender seems to be lexical feature of NOUN. 100% lemmas (2121) occur only with one value of Gender.
DET
822 DET tokens (81% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (822; 100%), PronType=Art (745; 91%), Definite=Ind (476; 58%).
DET tokens may have the following values of Gender:
Com(577; 70% of non-emptyGender): en, den, denna, någon, ingen, all, ett, vilkenNeut(245; 30% of non-emptyGender): ett, det, detta, något, inget, vilket, alltEMPTY(194): de, varje, dessa, alla, några, the, båda, a, inga, Die
| Paradigm en | Neut | Com |
|---|---|---|
| ett | en, ett |
PRON
679 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (635; 94%), Definite=Def (608; 90%), PronType=Prs (604; 89%), Poss=EMPTY (602; 89%), Case=EMPTY (398; 59%).
PRON tokens may have the following values of Gender:
Com(406; 60% of non-emptyGender): han, jag, sin, den, hon, vi, honom, en, du, henneNeut(273; 40% of non-emptyGender): det, detta, sitt, vilket, ett, allt, vårt, inget, något, alltihopEMPTY(659): som, de, sig, hans, sina, dess, deras, hennes, vad, mer
| Paradigm den | Neut | Com |
|---|---|---|
| PronType=Dem | det | den |
| PronType=Prs | det | den |
ADJ
474 ADJ tokens (30% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (474; 100%), Degree=Pos (472; 100%), Number=Sing (472; 100%), Definite=Ind (453; 96%), Tense=EMPTY (394; 83%), VerbForm=EMPTY (394; 83%).
ADJ tokens may have the following values of Gender:
Com(331; 70% of non-emptyGender): stor, lång, egen, ensam, hög, liten, modern, politisk, ekonomisk, nyNeut(143; 30% of non-emptyGender): annat, nytt, otroligt, sett, öppet, allmänt, möjligt, dåligt, eget, klartEMPTY(1091): andra, första, nya, många, flera, stora, hela, senaste, samma, sista
| Paradigm stor | Neut | Com |
|---|---|---|
| stort | stor |
Gender seems to be lexical feature of ADJ. 91% lemmas (296) occur only with one value of Gender.
VERB
5 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (5; 100%), Tense=Past (5; 100%), VerbForm=Part (5; 100%), Voice=Pass (5; 100%).
VERB tokens may have the following values of Gender:
Com(5; 100% of non-emptyGender): avskedad, besegrad, filmad, förbluffad, intervjuadEMPTY(1966): har, sade, finns, säger, började, ha, hade, blev, få, göra
NUM
3 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Case=Nom (2; 67%).
NUM tokens may have the following values of Gender:
Com(2; 67% of non-emptyGender): enNeut(1; 33% of non-emptyGender): ettEMPTY(399): två, tre, 1, fyra, sex, 10, tio, 000, 2014, 2015
PROPN
3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (2; 67%).
PROPN tokens may have the following values of Gender:
Com(2; 67% of non-emptyGender): Karels, låglandseuropaNeut(1; 33% of non-emptyGender): PanamanäsetEMPTY(1213): Kina, Storbritannien, Trump, USA, Frankrike, Hong, Italien, North, Medelhavet, Albanien
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (784; 83%),
NOUN –[nmod]–> NOUN (386; 58%),
NOUN –[conj]–> NOUN (141; 60%),
NOUN –[nmod:poss]–> NOUN (61; 55%),
NOUN –[nsubj]–> NOUN (32; 54%),
ADJ –[nsubj]–> NOUN (30; 52%),
NOUN –[appos]–> NOUN (22; 76%),
ADJ –[nsubj]–> PRON (21; 62%),
NOUN –[obl]–> NOUN (20; 56%),
ADJ –[expl]–> PRON (12; 80%).