home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-LLCT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

132433 tokens (55%) have a non-empty value of Gender. 7501 types (80%) occur at least once with a non-empty value of Gender. 3088 lemmas (88%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (51206; 21% instances), PROPN (20149; 8% instances), DET (20013; 8% instances), PRON (18308; 8% instances), ADJ (12372; 5% instances), VERB (9754; 4% instances), NUM (629; 0% instances), AUX (2; 0% instances).

NOUN

51206 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40994; 80%).

NOUN tokens may have the following values of Gender:

Paradigm heresMascFem
Case=Abl|Number=Singherede
Case=Abl|Number=Plurheredibus, heredes, eredibus, heridibus
Case=Acc|Number=Singheredem, heredes
Case=Acc|Number=Plurheredes, heredis, herides, heridisheredes
Case=Dat|Number=Plurheredibus, heridibus
Case=Gen|Number=Singheredis
Case=Gen|Number=Plurheredum
Case=Nom|Number=Singheredes, heres
Case=Nom|Number=Plurheredes, heredis

Gender seems to be lexical feature of NOUN. 97% lemmas (701) occur only with one value of Gender.

PROPN

20149 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (20030; 99%).

PROPN tokens may have the following values of Gender:

Paradigm VarianusMascFemNeut
Case=AblVarianaVarianu
Case=AccVaianu, Variano

Gender seems to be lexical feature of PROPN. 98% lemmas (1807) occur only with one value of Gender.

DET

20013 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (14786; 74%), Number=Sing (14667; 73%), Person[psor]=EMPTY (14084; 70%), Poss=EMPTY (14084; 70%).

DET tokens may have the following values of Gender:

Paradigm ipseMascFemNeut
Case=Abl|Number=Singipso, ipsum, isso, ipse, ipsiipsa, ipsam, ipsoipso, ipsum
Case=Abl|Number=Pluripsis, ipsiipsis, ipsiipsis
Case=Acc|Number=Singipso, ipsumipsa, ipsam, ipsasipsum, ipso, ipsu, ipsud
Case=Acc|Number=Pluripsos, ipsoipsas, ipsaipsa, ipsas
Case=Dat|Number=Singipsi, ipso, ipsumipsei, ipsi
Case=Dat|Number=Pluripsi, ipsis
Case=Gen|Number=Singipsiusipsius, ipse, ipssiusipsius
Case=Gen|Number=Pluripsorumipsarum
Case=Nom|Number=Singipse, ipsi, ipso, ipsumipsa, ipsam, ipseipsum, ipso
Case=Nom|Number=Pluripsi, ipsisipse, ipsae, ipsisipsa

PRON

18308 PRON tokens (100% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (15009; 82%), Number=Sing (13815; 75%), Person=1 (9732; 53%).

PRON tokens may have the following values of Gender:

Paradigm quiMascFemNeut
Case=Abl|Number=Singquo, quod, quotqua, quam, quasquo, quod
Case=Abl|Number=Plurquibusquibusquibus
Case=Acc|Number=Singque, quemquam, quas, quaquod, quot, quo
Case=Acc|Number=Plurquosquas, quaque, quem
Case=Dat|Number=Singcuicuicui
Case=Dat|Number=Plurquibusquibus
Case=Gen|Number=Singcuius
Case=Gen|Number=Plurchorum
Case=Nom|Number=Singqui, quitque, quem, quae, quaquod, quot, cod, quo
Case=Nom|Number=Plurquique, quem, quaeque, quem

ADJ

12372 ADJ tokens (93% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10249; 83%), NumType=EMPTY (9222; 75%).

ADJ tokens may have the following values of Gender:

Paradigm sanctusMascFemNeut
Case=Abl|Number=Singsancto, sanctumsanctasancto
Case=Acc|Number=Singsanctum, sanctosancta, sanctam
Case=Acc|Number=Plursancta
Case=Dat|Number=Singsancte
Case=Gen|Number=Singsanctisancte, sanctae, sanctem
Case=Gen|Number=Plursanctorum
Case=Nom|Number=Singsanctussancta, sanctam
Case=Nom|Number=Plursancte

VERB

9754 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (9754; 100%), Person=EMPTY (9754; 100%), Tense=EMPTY (9754; 100%), VerbForm=Part (9754; 100%), Number=Sing (8425; 86%), Voice=Pass (7897; 81%), Aspect=Perf (6044; 62%).

VERB tokens may have the following values of Gender:

Paradigm doMascFemNeut
Aspect=Imp|Case=Acc|Number=Sing|Voice=Actdante, dantemdante
Aspect=Imp|Case=Nom|Number=Plur|Voice=Actdantes
Aspect=Perf|Case=Abl|Number=Sing|Voice=Passdata
Aspect=Perf|Case=Acc|Number=Sing|Voice=Passdata, datam
Aspect=Perf|Case=Acc|Number=Plur|Voice=Passdatas
Aspect=Perf|Case=Nom|Number=Sing|Voice=Passdata, datasdatum
Aspect=Perf|Case=Nom|Number=Plur|Voice=Passdatidatedata
Aspect=Prosp|Case=Abl|Number=Sing|Voice=Passdandum
Aspect=Prosp|Case=Acc|Number=Sing|Voice=Passdandum
Aspect=Prosp|Case=Gen|Number=Sing|Voice=Passdandi

NUM

629 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (629; 100%), Number=Plur (629; 100%), Case=Acc (551; 88%).

NUM tokens may have the following values of Gender:

Paradigm duoMascFemNeut
Case=Ablduobusduabusduobus
Case=Accduo, duosduas, duaduo
Case=Nomduodue, duae, duesduo

AUX

2 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (2; 100%), Mood=EMPTY (2; 100%), Person=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Part (2; 100%).

AUX tokens may have the following values of Gender:

Paradigm sumFemNeut
Case=Abl|Number=Singfutura
Case=Acc|Number=Plurfutura

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (15833; 100%), PROPN –[appos]–> NOUN (7052; 100%), NOUN –[amod]–> ADJ (6377; 100%), PROPN –[det]–> PRON (4883; 100%), NOUN –[conj]–> NOUN (3818; 69%), PROPN –[amod]–> ADJ (2261; 70%), NOUN –[acl]–> VERB (1851; 73%), PROPN –[acl]–> VERB (1824; 96%), VERB –[obl:arg]–> PROPN (1796; 71%), NOUN –[nsubj]–> PRON (1745; 92%).