home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-SweLL: Features: Gender

This feature is universal. It occurs with 2 different values: Com, Neut.

2878 tokens (33%) have a non-empty value of Gender. 1017 types (51%) occur at least once with a non-empty value of Gender. 780 lemmas (50%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (1455; 17% instances), PRON (913; 11% instances), ADJ (255; 3% instances), DET (242; 3% instances), NUM (5; 0% instances), PROPN (5; 0% instances), VERB (3; 0% instances).

NOUN

1455 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Nom (1426; 98%), Definite=Ind (1068; 73%), Number=Sing (992; 68%).

NOUN tokens may have the following values of Gender:

Paradigm landNeutCom
Case=Gen|Definite=Def|Number=Singlandets
Case=Nom|Definite=Def|Number=Singlandet
Case=Nom|Definite=Def|Number=Plurländerna
Case=Nom|Definite=Def|Number=Plur|Typo=YesLanderna
Case=Nom|Definite=Ind|Number=Singland
Case=Nom|Definite=Ind|Number=Plurländer, land

Gender seems to be lexical feature of NOUN. 98% lemmas (606) occur only with one value of Gender.

PRON

913 PRON tokens (75% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (823; 90%), Number=Sing (812; 89%), Definite=Def (764; 84%), PronType=Prs (760; 83%), Case=Nom (472; 52%).

PRON tokens may have the following values of Gender:

Paradigm jagNeutCom
Case=Accmig
Case=Nomjag
Poss=Yesmittmin

ADJ

255 ADJ tokens (37% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (255; 100%), Number=Sing (255; 100%), Degree=Pos (253; 99%), Definite=Ind (252; 99%).

ADJ tokens may have the following values of Gender:

Paradigm viktigNeutCom
viktigtviktig

DET

242 DET tokens (79% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (242; 100%), PronType=Art (214; 88%), Definite=Ind (168; 69%).

DET tokens may have the following values of Gender:

Paradigm enNeutCom
etten
Typo=Yeset

NUM

5 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Definite=Ind (5; 100%), Number=Sing (5; 100%), Case=Nom (4; 80%), NumType=Card (4; 80%).

NUM tokens may have the following values of Gender:

Paradigm enNeutCom
Case=Nom|NumType=Cardett, en
en

PROPN

5 PROPN tokens (3% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (5; 100%).

PROPN tokens may have the following values of Gender:

VERB

3 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3; 100%), Tense=Past (3; 100%), VerbForm=Part (3; 100%), Voice=Pass (3; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (208; 74%), NOUN –[nmod:poss]–> PRON (81; 58%), NOUN –[conj]–> NOUN (73; 70%), NOUN –[nmod]–> NOUN (63; 53%), ADJ –[nsubj]–> PRON (28; 58%), ADJ –[expl]–> PRON (21; 70%), NOUN –[nmod:poss]–> NOUN (14; 54%), NOUN –[compound]–> NOUN (7; 54%), NOUN –[obl]–> NOUN (7; 54%), NOUN –[appos]–> NOUN (4; 57%).