Treebank Statistics: UD_Swedish-PUD: Features: Gender
This feature is universal.
It occurs with 3 different values: Com
, Masc
, Neut
.
5929 tokens (31%) have a non-empty value of Gender
.
3136 types (51%) occur at least once with a non-empty value of Gender
.
2502 lemmas (50%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (3881; 20% instances), DET (821; 4% instances), PRON (708; 4% instances), ADJ (511; 3% instances), NUM (3; 0% instances), PROPN (3; 0% instances), VERB (2; 0% instances).
NOUN
3881 NOUN tokens (96% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Case=Nom (3748; 97%), Number=Sing (2752; 71%), Definite=Ind (2523; 65%).
NOUN
tokens may have the following values of Gender
:
Com
(2807; 72% of non-emptyGender
): personer, miljoner, grund, oktober, världen, del, delen, tiden, plats, dollarNeut
(1074; 28% of non-emptyGender
): år, havet, fall, kriget, liv, antal, barn, åren, land, slutetEMPTY
(154): %, f.Kr., University, Ms, md, mr, morse, school, Association, Bank
Paradigm val | Neut | Com |
---|---|---|
Definite=Def|Number=Sing | valet | |
Definite=Ind|Number=Sing | val | val |
Definite=Ind|Number=Plur | val |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (2117) occur only with one value of Gender
.
DET
821 DET tokens (80% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (820; 100%), Definite=Ind (475; 58%).
DET
tokens may have the following values of Gender
:
Com
(578; 70% of non-emptyGender
): en, den, denna, någon, ingen, all, det, ett, vilkenNeut
(243; 30% of non-emptyGender
): ett, det, detta, något, inget, vilket, De, alltEMPTY
(200): de, varje, dessa, alla, några, samma, the, a, inga, Die
Paradigm en | Neut | Com |
---|---|---|
Definite=Def|PronType=Art | det | |
Definite=Ind | ett | en, ett |
Definite=Ind|PronType=Art | en |
PRON
708 PRON tokens (54% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (664; 94%), PronType=EMPTY (660; 93%), Poss=EMPTY (631; 89%), Definite=Def (609; 86%), Case=EMPTY (427; 60%).
PRON
tokens may have the following values of Gender
:
Com
(406; 57% of non-emptyGender
): han, jag, sin, hon, den, vi, honom, en, du, henneNeut
(302; 43% of non-emptyGender
): det, detta, sitt, vad, vilket, ett, mycket, allt, vårt, alltingEMPTY
(610): som, de, sig, hans, sina, dess, deras, hennes, dem, vilka
Paradigm sin | Neut | Com |
---|---|---|
sitt | sin |
ADJ
511 ADJ tokens (33% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Case=Nom (511; 100%), Number=Sing (511; 100%), Definite=Ind (481; 94%), Tense=EMPTY (411; 80%), VerbForm=EMPTY (411; 80%), Degree=Pos (408; 80%).
ADJ
tokens may have the following values of Gender
:
Com
(331; 65% of non-emptyGender
): stor, lång, egen, ensam, hög, liten, modern, politisk, direkt, ekonomiskMasc
(22; 4% of non-emptyGender
): egyptiske, misstänkte, Simple, anglikanske, belgiske, brittiske, demokratiske, dominikanske, högste, kanadensiskeNeut
(158; 31% of non-emptyGender
): annat, nytt, otroligt, sett, öppet, allmänt, möjligt, stort, dåligt, egetEMPTY
(1049): andra, första, nya, många, flera, stora, hela, senaste, sista, brittiska
Paradigm ny | Masc | Neut | Com |
---|---|---|---|
Definite=Def | nye | ||
Definite=Ind | nytt | ny |
NUM
3 NUM tokens (1% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Case=Nom (2; 67%).
NUM
tokens may have the following values of Gender
:
Com
(2; 67% of non-emptyGender
): enNeut
(1; 33% of non-emptyGender
): ettEMPTY
(399): två, tre, 1, fyra, sex, 10, tio, 000, 2014, 2015
PROPN
3 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Case=Nom (2; 67%).
PROPN
tokens may have the following values of Gender
:
Com
(2; 67% of non-emptyGender
): Karels, låglandseuropaNeut
(1; 33% of non-emptyGender
): PanamanäsetEMPTY
(1212): Kina, Storbritannien, Trump, USA, Frankrike, Hong, Italien, North, Medelhavet, Albanien
VERB
2 VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2; 100%), Tense=Past (2; 100%), VerbForm=Part (2; 100%), Voice=EMPTY (2; 100%).
VERB
tokens may have the following values of Gender
:
Com
(2; 100% of non-emptyGender
): förlorad, tvungenEMPTY
(1966): har, sade, finns, säger, började, ha, hade, blev, få, göra
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (777; 82%),
NOUN –[nmod]–> NOUN (374; 58%),
NOUN –[conj]–> NOUN (142; 60%),
NOUN –[nmod:poss]–> NOUN (61; 55%),
NOUN –[nsubj]–> NOUN (33; 54%),
ADJ –[nsubj]–> NOUN (31; 53%),
NOUN –[appos]–> NOUN (23; 77%),
ADJ –[nsubj]–> PRON (22; 61%),
NOUN –[obl]–> NOUN (21; 57%),
PRON –[nmod]–> NOUN (20; 69%).