home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-LLCT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

122626 tokens (51%) have a non-empty value of Gender. 7486 types (80%) occur at least once with a non-empty value of Gender. 3085 lemmas (88%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (51206; 21% instances), PROPN (20150; 8% instances), DET (20065; 8% instances), ADJ (12350; 5% instances), VERB (9754; 4% instances), PRON (8470; 3% instances), NUM (629; 0% instances), AUX (2; 0% instances).

NOUN

51206 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40994; 80%).

NOUN tokens may have the following values of Gender:

Paradigm heresMascFem
Case=Abl|Number=Singherede
Case=Abl|Number=Plurheredibus, heredes, eredibus, heridibus
Case=Acc|Number=Singheredem, heredes
Case=Acc|Number=Plurheredes, heredis, herides, heridisheredes
Case=Dat|Number=Plurheredibus, heridibus
Case=Gen|Number=Singheredis
Case=Gen|Number=Plurheredum
Case=Nom|Number=Singheredes, heres
Case=Nom|Number=Plurheredes, heredis

Gender seems to be lexical feature of NOUN. 97% lemmas (701) occur only with one value of Gender.

PROPN

20150 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (20031; 99%).

PROPN tokens may have the following values of Gender:

Paradigm VarianusMascFemNeut
Case=AblVarianaVarianu
Case=AccVaianu, Variano

Gender seems to be lexical feature of PROPN. 98% lemmas (1807) occur only with one value of Gender.

DET

20065 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (14838; 74%), Number=Sing (14697; 73%), Person[psor]=EMPTY (14136; 70%), Poss=EMPTY (14136; 70%).

DET tokens may have the following values of Gender:

Paradigm ipseMascFemNeut
Case=Abl|Number=Singipso, ipsum, isso, ipse, ipsiipsa, ipsam, ipsoipso, ipsum
Case=Abl|Number=Pluripsis, ipsiipsis, ipsiipsis
Case=Acc|Number=Singipso, ipsumipsa, ipsam, ipsasipsum, ipso, ipsu, ipsud
Case=Acc|Number=Pluripsos, ipsoipsas, ipsaipsa, ipsas
Case=Dat|Number=Singipsi, ipso, ipsumipsei, ipsi
Case=Dat|Number=Pluripsi, ipsis
Case=Gen|Number=Singipsiusipsius, ipse, ipssiusipsius
Case=Gen|Number=Pluripsorumipsarum
Case=Nom|Number=Singipse, ipsi, ipso, ipsumipsa, ipsam, ipseipsum, ipso
Case=Nom|Number=Pluripsi, ipsisipse, ipsae, ipsisipsa

ADJ

12350 ADJ tokens (93% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10249; 83%), NumType=EMPTY (9200; 74%).

ADJ tokens may have the following values of Gender:

Paradigm sanctusMascFemNeut
Case=Abl|Number=Singsancto, sanctumsanctasancto
Case=Acc|Number=Singsanctum, sanctosancta, sanctam
Case=Acc|Number=Plursancta
Case=Dat|Number=Singsancte
Case=Gen|Number=Singsanctisancte, sanctae, sanctem
Case=Gen|Number=Plursanctorum
Case=Nom|Number=Singsanctussancta, sanctam
Case=Nom|Number=Plursancte

VERB

9754 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (9754; 100%), Person=EMPTY (9754; 100%), Tense=EMPTY (9754; 100%), VerbForm=Part (9754; 100%), Number=Sing (8425; 86%), Voice=Pass (7897; 81%), Aspect=Perf (6044; 62%).

VERB tokens may have the following values of Gender:

Paradigm doMascFemNeut
Aspect=Imp|Case=Acc|Number=Sing|Voice=Actdante, dantemdante
Aspect=Imp|Case=Nom|Number=Plur|Voice=Actdantes
Aspect=Perf|Case=Abl|Number=Sing|Voice=Passdata
Aspect=Perf|Case=Acc|Number=Sing|Voice=Passdata, datam
Aspect=Perf|Case=Acc|Number=Plur|Voice=Passdatas
Aspect=Perf|Case=Nom|Number=Sing|Voice=Passdata, datasdatum
Aspect=Perf|Case=Nom|Number=Plur|Voice=Passdatidatedata
Aspect=Prosp|Case=Abl|Number=Sing|Voice=Passdandum
Aspect=Prosp|Case=Acc|Number=Sing|Voice=Passdandum
Aspect=Prosp|Case=Gen|Number=Sing|Voice=Passdandi

PRON

8470 PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6140; 72%), PronType=Prs (5218; 62%).

PRON tokens may have the following values of Gender:

Paradigm quiMascFemNeut
Case=Abl|Number=Singquo, quodqua, quam, quasquo, quod
Case=Abl|Number=Plurquibusquibusquibus
Case=Acc|Number=Singque, quemquam, quas, quaquod, quo
Case=Acc|Number=Plurquosquas, quaque, quem
Case=Dat|Number=Singcuicuicui
Case=Dat|Number=Plurquibusquibus
Case=Gen|Number=Singcuius
Case=Gen|Number=Plurchorum
Case=Nom|Number=Singqui, quitque, quem, quae, quaquod, cod, quo
Case=Nom|Number=Plurquique, quem, quaeque, quem

NUM

629 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (629; 100%), NumType=Card (629; 100%), Number=Plur (629; 100%), Case=Acc (551; 88%).

NUM tokens may have the following values of Gender:

Paradigm duoMascFemNeut
Case=Ablduobusduabusduobus
Case=Accduo, duosduas, duaduo
Case=Nomduodue, duae, duesduo

AUX

2 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (2; 100%), Mood=EMPTY (2; 100%), Person=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Part (2; 100%).

AUX tokens may have the following values of Gender:

Paradigm sumFemNeut
Case=Abl|Number=Singfutura
Case=Acc|Number=Plurfutura

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (15932; 100%), PROPN –[appos]–> NOUN (7052; 100%), NOUN –[amod]–> ADJ (6355; 100%), NOUN –[conj]–> NOUN (3823; 69%), PROPN –[amod]–> ADJ (2261; 70%), NOUN –[acl]–> VERB (1843; 78%), PROPN –[acl]–> VERB (1824; 96%), VERB –[obl:arg]–> PROPN (1796; 71%), PROPN –[det]–> DET (1529; 100%), PROPN –[nmod]–> NOUN (1240; 69%).