home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-HSE: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

2232 tokens (42%) have a non-empty value of Gender. 1636 types (70%) occur at least once with a non-empty value of Gender. 1085 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (1102; 21% instances), ADJ (399; 7% instances), PROPN (300; 6% instances), VERB (178; 3% instances), PRON (145; 3% instances), DET (91; 2% instances), NUM (14; 0% instances), AUX (3; 0% instances).

NOUN

1102 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (714; 65%).

NOUN tokens may have the following values of Gender:

Paradigm mąstyklaMascFem
Case=Accmąstyklą
Case=Genmąstyklos
Case=Locmąstykloje
Case=Nommąstykla

Gender seems to be lexical feature of NOUN. 99% lemmas (582) occur only with one value of Gender.

ADJ

399 ADJ tokens (96% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (374; 94%), Definite=Ind (366; 92%), Number=Sing (245; 61%).

ADJ tokens may have the following values of Gender:

Paradigm aiškusMascFemNeut
Case=Dat|Number=Singaiškiam
Case=Gen|Number=Singaiškios
Case=Nom|Number=Singaiškus
Polarity=Posaišku

PROPN

300 PROPN tokens (93% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (287; 96%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (142) occur only with one value of Gender.

VERB

178 VERB tokens (25% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (178; 100%), Mood=EMPTY (170; 96%), VerbForm=Part (169; 95%), Definite=Ind (165; 93%), Polarity=Pos (155; 87%), Number=Sing (101; 57%), Case=Nom (97; 54%), Voice=Act (96; 54%).

VERB tokens may have the following values of Gender:

Paradigm žinotiMascFemNeut
Aspect=Perf|Case=Nom|Number=Sing|Polarity=Pos|Tense=Past|Voice=Actžinojęs
Case=Loc|Number=Plur|Polarity=Neg|Tense=Pres|Voice=Actnežinančiose
Case=Nom|Number=Sing|Polarity=Pos|Tense=Pres|Voice=Actžinomas
Polarity=Pos|Tense=Pres|Voice=Passžinoma

PRON

145 PRON tokens (57% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (108; 74%), Person=EMPTY (78; 54%).

PRON tokens may have the following values of Gender:

Paradigm kurisMascFem
Case=Acc|Number=Singkurįkurią
Case=Acc|Number=Plurkuriuoskurias
Case=Dat|Number=Plurkuriems
Case=Gen|Number=Singkuriokurios
Case=Gen|Number=Plurkurių
Case=Ins|Number=Plurkuriais
Case=Loc|Number=Singkurioje
Case=Nom|Number=Singkuriskuri
Case=Nom|Number=Plurkuriekurios

DET

91 DET tokens (55% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (60; 66%).

DET tokens may have the following values of Gender:

Paradigm tasMascFem
Case=Acc|Number=Sing
Case=Gen|Definite=Ind|Number=Plur
Case=Gen|Number=Singtotos
Case=Ins|Number=Plurtais
Case=Loc|Number=Singtame
Case=Nom|Number=Singtastoji
Case=Nom|Number=PlurtieTos

NUM

14 NUM tokens (58% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Case=Acc (8; 57%), Number=EMPTY (8; 57%).

NUM tokens may have the following values of Gender:

Paradigm trysMascFem
Case=Gentrijųtrijų
Case=Nomtrys

AUX

3 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (3; 100%), Person=EMPTY (3; 100%), Polarity=Pos (3; 100%), Tense=Pres (3; 100%), VerbForm=Part (3; 100%), Number=EMPTY (2; 67%), Voice=Act (2; 67%).

AUX tokens may have the following values of Gender:

Paradigm būtiMascNeut
Case=Nom|Number=Sing|Voice=Actesąs
Voice=Actesą
Voice=PassEsama

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (221; 87%), NOUN –[conj]–> NOUN (91; 69%), PROPN –[flat]–> PROPN (35; 85%), ADJ –[conj]–> ADJ (29; 94%), PROPN –[nmod]–> NOUN (28; 90%), NOUN –[acl]–> VERB (26; 84%), NOUN –[amod]–> VERB (22; 81%), ADJ –[nsubj]–> NOUN (21; 95%), PROPN –[conj]–> PROPN (15; 75%), VERB –[conj]–> VERB (14; 58%).