home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: Features: Gender

This feature is universal. It occurs with 2 different values: Com, Neut.

33932 tokens (33%) have a non-empty value of Gender. 9298 types (60%) occur at least once with a non-empty value of Gender. 6520 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (17665; 17% instances), PRON (8847; 9% instances), DET (4528; 4% instances), ADJ (2840; 3% instances), PROPN (34; 0% instances), NUM (9; 0% instances), VERB (6; 0% instances), X (3; 0% instances).

NOUN

17665 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (17114; 97%), Number=Sing (12907; 73%), Definite=Ind (11539; 65%).

NOUN tokens may have the following values of Gender:

Paradigm manNeutCom
Case=Gen|Definite=Def|Number=Singmannens
Case=Gen|Definite=Ind|Number=Singmans
Case=Nom|Definite=Def|Number=Singmannen
Case=Nom|Definite=Def|Number=Plurmännenmännen
Case=Nom|Definite=Ind|Number=Singman
Case=Nom|Definite=Ind|Number=Plurmänmän, man

Gender seems to be lexical feature of NOUN. 97% lemmas (5252) occur only with one value of Gender.

PRON

8847 PRON tokens (71% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (8416; 95%), Poss=EMPTY (8076; 91%), Definite=Def (7716; 87%), PronType=Prs (7634; 86%), Case=Nom (4526; 51%).

PRON tokens may have the following values of Gender:

Paradigm denNeutCom
Case=Nom|Definite=Def|PronType=Prsden
Definite=Def|ExtPos=ADV|PronType=PrsDet
Definite=Def|PronType=Artdet
Definite=Def|PronType=DemDetden
Definite=Def|PronType=Prsdetden
Definite=Ind|PronType=Prsdet

DET

4528 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4528; 100%), PronType=Art (3970; 88%), Definite=Ind (3177; 70%).

DET tokens may have the following values of Gender:

Paradigm enNeutCom
etten

ADJ

2840 ADJ tokens (40% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2838; 100%), Case=Nom (2832; 100%), Definite=Ind (2804; 99%), Degree=Pos (2796; 98%), Tense=EMPTY (2561; 90%), VerbForm=EMPTY (2551; 90%).

ADJ tokens may have the following values of Gender:

Paradigm annanNeutCom
annatannan

PROPN

34 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (32; 94%), Number=Sing (32; 94%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (20) occur only with one value of Gender.

NUM

9 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm enNeutCom
Definite=Indetten
NumType=CardEtt

VERB

6 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6; 100%), Voice=EMPTY (5; 83%), Tense=Past (4; 67%), VerbForm=Part (4; 67%).

VERB tokens may have the following values of Gender:

X

3 X tokens (12% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Case=Nom (3; 100%), Definite=Ind (3; 100%), Number=Sing (3; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4183; 85%), NOUN –[nmod]–> NOUN (1455; 59%), NOUN –[conj]–> NOUN (826; 64%), NOUN –[nmod:poss]–> NOUN (271; 56%), ADJ –[nsubj]–> PRON (208; 67%), ADJ –[conj]–> ADJ (158; 72%), NOUN –[nsubj]–> NOUN (109; 72%), NOUN –[appos]–> NOUN (95; 71%), ADJ –[expl]–> PRON (92; 81%), PRON –[nmod]–> NOUN (71; 56%).