home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Com, Masc, Neut.

5929 tokens (31%) have a non-empty value of Gender. 3136 types (51%) occur at least once with a non-empty value of Gender. 2502 lemmas (50%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (3881; 20% instances), DET (821; 4% instances), PRON (708; 4% instances), ADJ (511; 3% instances), NUM (3; 0% instances), PROPN (3; 0% instances), VERB (2; 0% instances).

NOUN

3881 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (3748; 97%), Number=Sing (2752; 71%), Definite=Ind (2523; 65%).

NOUN tokens may have the following values of Gender:

Paradigm valNeutCom
Definite=Def|Number=Singvalet
Definite=Ind|Number=Singvalval
Definite=Ind|Number=Plurval

Gender seems to be lexical feature of NOUN. 100% lemmas (2117) occur only with one value of Gender.

DET

821 DET tokens (80% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (820; 100%), Definite=Ind (475; 58%).

DET tokens may have the following values of Gender:

Paradigm enNeutCom
Definite=Def|PronType=Artdet
Definite=Indetten, ett
Definite=Ind|PronType=Arten

PRON

708 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (664; 94%), PronType=EMPTY (660; 93%), Poss=EMPTY (631; 89%), Definite=Def (609; 86%), Case=EMPTY (427; 60%).

PRON tokens may have the following values of Gender:

Paradigm sinNeutCom
sittsin

ADJ

511 ADJ tokens (33% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (511; 100%), Number=Sing (511; 100%), Definite=Ind (481; 94%), Tense=EMPTY (411; 80%), VerbForm=EMPTY (411; 80%), Degree=Pos (408; 80%).

ADJ tokens may have the following values of Gender:

Paradigm nyMascNeutCom
Definite=Defnye
Definite=Indnyttny

NUM

3 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Case=Nom (2; 67%).

NUM tokens may have the following values of Gender:

PROPN

3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (2; 67%).

PROPN tokens may have the following values of Gender:

VERB

2 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2; 100%), Tense=Past (2; 100%), VerbForm=Part (2; 100%), Voice=EMPTY (2; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (777; 82%), NOUN –[nmod]–> NOUN (374; 58%), NOUN –[conj]–> NOUN (142; 60%), NOUN –[nmod:poss]–> NOUN (61; 55%), NOUN –[nsubj]–> NOUN (33; 54%), ADJ –[nsubj]–> NOUN (31; 53%), NOUN –[appos]–> NOUN (23; 77%), ADJ –[nsubj]–> PRON (22; 61%), NOUN –[obl]–> NOUN (21; 57%), PRON –[nmod]–> NOUN (20; 69%).