home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-LLCT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

132432 tokens (55%) have a non-empty value of Gender. 7500 types (80%) occur at least once with a non-empty value of Gender. 3088 lemmas (88%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (51209; 21% instances), DET (20197; 8% instances), PROPN (20153; 8% instances), PRON (17204; 7% instances), ADJ (12368; 5% instances), VERB (9756; 4% instances), NUM (1545; 1% instances).

NOUN

51209 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40995; 80%).

NOUN tokens may have the following values of Gender:

Paradigm heresMascFem
Case=Abl|Number=Singherede
Case=Abl|Number=Plurheredibus, heredes, eredibus, heridibus
Case=Acc|Number=Singheredem, heredes
Case=Acc|Number=Plurheredes, heredis, herides, heridisheredes
Case=Dat|Number=Plurheredibus, heridibus
Case=Gen|Number=Singheredis
Case=Gen|Number=Plurheredum
Case=Nom|Number=Singheredes, heres
Case=Nom|Number=Plurheredes, heredis

Gender seems to be lexical feature of NOUN. 97% lemmas (703) occur only with one value of Gender.

DET

20197 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (14970; 74%), Number=Sing (14783; 73%), Poss=EMPTY (14268; 71%), Person=EMPTY (13148; 65%).

DET tokens may have the following values of Gender:

Paradigm ipseMascFemNeut
Case=Abl|Number=Singipso, ipsum, isso, ipse, ipsiipsa, ipsam, ipsoipso, ipsum
Case=Abl|Number=Pluripsis, ipsiipsis, ipsiipsis
Case=Acc|Number=Singipso, ipsumipsa, ipsam, ipsasipsum, ipso, ipsu, ipsud
Case=Acc|Number=Pluripsos, ipsoipsas, ipsaipsa, ipsas
Case=Dat|Number=Singipsi, ipso, ipsumipsei, ipsi
Case=Dat|Number=Pluripsi, ipsis
Case=Gen|Number=Singipsiusipsius, ipse, ipssiusipsius
Case=Gen|Number=Pluripsorumipsarum
Case=Nom|Number=Singipse, ipsi, ipso, ipsumipsa, ipsam, ipseipsum, ipso
Case=Nom|Number=Pluripsi, ipsisipse, ipsae, ipsisipsa

PROPN

20153 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (20034; 99%).

PROPN tokens may have the following values of Gender:

Paradigm VarianusMascFemNeut
Case=AblVarianaVarianu
Case=AccVaianu, Variano

Gender seems to be lexical feature of PROPN. 98% lemmas (1807) occur only with one value of Gender.

PRON

17204 PRON tokens (100% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (12781; 74%), PronType=Prs (12640; 73%), Person=1 (9730; 57%).

PRON tokens may have the following values of Gender:

Paradigm quiMascFemNeut
Case=Abl|Number=Singquo, quod, quotqua, quam, quasquo, quod
Case=Abl|Number=Plurquibusquibusquibus
Case=Acc|Number=Singque, quemquam, quas, quaquod, quot, quo
Case=Acc|Number=Plurquosquas, quaque, quem
Case=Dat|Number=Singcuicuicui
Case=Dat|Number=Plurquibusquibus
Case=Gen|Number=Singcuius
Case=Gen|Number=Plurchorum
Case=Nom|Number=Singqui, quis, quitque, quem, quae, quaquod, quot, cod, quo
Case=Nom|Number=Plurquique, quem, quaeque, quem

ADJ

12368 ADJ tokens (93% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10245; 83%), NumType=EMPTY (9218; 75%).

ADJ tokens may have the following values of Gender:

Paradigm sanctusMascFemNeut
Case=Abl|Number=Singsancto, sanctumsanctasancto
Case=Acc|Number=Singsanctum, sanctosancta, sanctam
Case=Acc|Number=Plursancta
Case=Dat|Number=Singsancte
Case=Gen|Number=Singsanctisancte, sanctae, sanctem
Case=Gen|Number=Plursanctorum
Case=Nom|Number=Singsanctussancta, sanctam
Case=Nom|Number=Plursancte

VERB

9756 VERB tokens (33% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (9756; 100%), Person=EMPTY (9756; 100%), Number=Sing (8426; 86%), VerbForm=Part (7901; 81%), Voice=Pass (7892; 81%), Aspect=Perf (6044; 62%), Tense=Past (6044; 62%).

VERB tokens may have the following values of Gender:

Paradigm doMascFemNeut
Aspect=Imp|Case=Abl|Number=Sing|Tense=Pres|VerbForm=Gdv|Voice=Passdandum
Aspect=Imp|Case=Acc|Number=Sing|Tense=Pres|VerbForm=Ger|Voice=Passdandum
Aspect=Imp|Case=Acc|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actdante, dantemdante
Aspect=Imp|Case=Gen|Number=Sing|Tense=Pres|VerbForm=Ger|Voice=Passdandi
Aspect=Imp|Case=Nom|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actdantes
Aspect=Perf|Case=Abl|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passdata
Aspect=Perf|Case=Acc|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passdata, datam
Aspect=Perf|Case=Acc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passdatas
Aspect=Perf|Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passdata, datasdatum
Aspect=Perf|Case=Nom|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passdatidatedata

NUM

1545 NUM tokens (64% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1545; 100%), Case=Acc (1287; 83%), Number=Sing (916; 59%).

NUM tokens may have the following values of Gender:

Paradigm unusMascFemNeut
Case=Abluno, unumuna, unamuno
Case=Accuno, unum, unuuna, unamuno, unum
Case=Nomunusuna, unam

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (15334; 100%), PROPN –[appos]–> NOUN (7049; 100%), NOUN –[amod]–> ADJ (6363; 100%), PRON –[appos]–> PROPN (4878; 100%), NOUN –[conj]–> NOUN (3871; 69%), PROPN –[amod]–> ADJ (2258; 70%), NOUN –[acl]–> VERB (1851; 70%), NOUN –[nsubj]–> PRON (1848; 92%), PROPN –[acl]–> VERB (1822; 99%), VERB –[obl:arg]–> PROPN (1792; 86%).