home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: Features: Gender

This feature is universal. It occurs with 3 different values: Com, Masc, Neut.

29307 tokens (32%) have a non-empty value of Gender. 8382 types (59%) occur at least once with a non-empty value of Gender. 5919 lemmas (59%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (15603; 17% instances), PRON (7455; 8% instances), DET (3996; 4% instances), ADJ (2222; 2% instances), PROPN (25; 0% instances), X (3; 0% instances), VERB (2; 0% instances), NUM (1; 0% instances).

NOUN

15603 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (15154; 97%), Number=Sing (11320; 73%), Definite=Ind (10175; 65%).

NOUN tokens may have the following values of Gender:

Paradigm manNeutCom
Case=Gen|Definite=Def|Number=Singmannens
Case=Gen|Definite=Ind|Number=Singmans
Case=Nom|Definite=Def|Number=Singmannen
Case=Nom|Definite=Def|Number=Plurmännenmännen
Case=Nom|Definite=Ind|Number=Singman
Case=Nom|Definite=Ind|Number=Plurmänmän, man

Gender seems to be lexical feature of NOUN. 98% lemmas (4874) occur only with one value of Gender.

PRON

7455 PRON tokens (69% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (7058; 95%), Poss=EMPTY (6768; 91%), Definite=Def (6569; 88%), PronType=Prs (6503; 87%), Case=Nom (3796; 51%).

PRON tokens may have the following values of Gender:

Paradigm dennaMascNeutCom
dennedettaDenna

DET

3996 DET tokens (84% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3996; 100%), PronType=Art (3490; 87%), Definite=Ind (2794; 70%).

DET tokens may have the following values of Gender:

Paradigm enNeutCom
etten

ADJ

2222 ADJ tokens (36% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2220; 100%), Case=Nom (2219; 100%), Degree=Pos (2154; 97%), Definite=Ind (1912; 86%).

ADJ tokens may have the following values of Gender:

Paradigm annanMascNeutCom
Definite=Defandre
Definite=Indannatannan

PROPN

25 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (23; 92%), Number=Sing (23; 92%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (15) occur only with one value of Gender.

X

3 X tokens (11% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Case=Nom (3; 100%), Definite=Ind (3; 100%), Number=Sing (3; 100%).

X tokens may have the following values of Gender:

VERB

2 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=EMPTY (2; 100%), Voice=EMPTY (2; 100%).

VERB tokens may have the following values of Gender:

NUM

1 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (3666; 84%), NOUN –[nmod]–> NOUN (1267; 59%), NOUN –[conj]–> NOUN (730; 63%), NOUN –[nmod:poss]–> PRON (602; 51%), NOUN –[nmod:poss]–> NOUN (229; 52%), ADJ –[nsubj]–> PRON (134; 54%), ADJ –[conj]–> ADJ (80; 63%), NOUN –[appos]–> NOUN (80; 71%), NOUN –[nsubj]–> NOUN (79; 72%), ADJ –[expl]–> PRON (60; 69%).