home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-PROIEL: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

98149 tokens (49%) have a non-empty value of Gender. 17315 types (59%) occur at least once with a non-empty value of Gender. 6779 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (40357; 20% instances), PRON (18320; 9% instances), ADJ (16046; 8% instances), VERB (9008; 5% instances), DET (6948; 3% instances), PROPN (6421; 3% instances), NUM (997; 0% instances), AUX (52; 0% instances).

NOUN

40357 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (28016; 69%).

NOUN tokens may have the following values of Gender:

Paradigm diesFem,MascMascMasc,NeutFemNeut
Case=Abl|Number=Singdiediediedie
Case=Abl|Number=Plurdiebusdiebusdiebus
Case=Acc|Number=Singdiemdiem, diesdiem
Case=Acc|Number=Plurdiesdiesdies
Case=Dat|Number=Singdiei
Case=Gen|Number=Singdieidieidiei
Case=Gen|Number=Plurdierumdierumdierum
Case=Nom|Number=Singdiesdiesdiesdies
Case=Nom|Number=Plurdiesdiesdies

Gender seems to be lexical feature of NOUN. 97% lemmas (3138) occur only with one value of Gender.

PRON

18320 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (12234; 67%), PronType=Prs (11602; 63%).

PRON tokens may have the following values of Gender:

Paradigm quiFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Abl|Number=Sing|PronType=Intquoquaquo
Case=Abl|Number=Sing|PronType=Relquo, quiquoquaquo
Case=Abl|Number=Plur|PronType=Intquibusquibusquibus
Case=Abl|Number=Plur|PronType=Relquibusquibusquibusquibus
Case=Acc|Number=Sing|PronType=Intquemquamquod
Case=Acc|Number=Sing|PronType=Relquemquam, quae, quemquod, quae
Case=Acc|Number=Plur|PronType=Intquosquasquae
Case=Acc|Number=Plur|PronType=Relquosquas, quaequae
Case=Dat|Number=Sing|PronType=Intquocui
Case=Dat|Number=Sing|PronType=Relcui, quoi, quocui, quoicui
Case=Dat|Number=Plur|PronType=Intquibus
Case=Dat|Number=Plur|PronType=Relquibusquibusquibus
Case=Gen|Number=Sing|PronType=Intcuius
Case=Gen|Number=Sing|PronType=Relcuius, quoiuscuius, quoiuscuius
Case=Gen|Number=Plur|PronType=Intquorum
Case=Gen|Number=Plur|PronType=Relquorumquorumquarumquorum
Case=Nom|Number=Sing|PronType=Intquiquaequod
Case=Nom|Number=Sing|PronType=Relquiquaequod, quae
Case=Nom|Number=Plur|PronType=Intquiquaequae
Case=Nom|Number=Plur|PronType=Relquaequiquaequae, QVAE

ADJ

16046 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9933; 62%), Degree=Pos (8621; 54%).

ADJ tokens may have the following values of Gender:

Paradigm illeFem,MascMascMasc,NeutFemNeut
Case=Abl|Number=Singilloilloillaillo
Case=Abl|Number=Plurillisillisillisillis
Case=Acc|Number=Singillum, illudillamillud
Case=Acc|Number=Plurillosillasilla
Case=Dat|Number=Singilli, illoilliilli
Case=Dat|Number=Plurillisillisillis
Case=Gen|Number=Singillius
Case=Gen|Number=Plurillorumillarumillorum
Case=Nom|Number=Singilleillaillud
Case=Nom|Number=Plurilliillaeilla

VERB

9008 VERB tokens (22% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (9008; 100%), Person=EMPTY (9008; 100%), VerbForm=Part (8269; 92%), Case=Nom (5739; 64%), Number=Sing (5656; 63%), Aspect=Perf (5017; 56%), Tense=Past (5017; 56%), Voice=Pass (5003; 56%).

VERB tokens may have the following values of Gender:

Paradigm facioFem,MascMascMasc,NeutFemNeut
Aspect=Perf|Case=Abl|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passfactofactafacto
Aspect=Perf|Case=Abl|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passfactisfactis
Aspect=Perf|Case=Acc|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passfactumfactamfactum
Aspect=Perf|Case=Acc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passfactosfactasfacta
Aspect=Perf|Case=Gen|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passfacti
Aspect=Perf|Case=Gen|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passfactorumfactorum
Aspect=Perf|Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passfactus, factafactafactum
Aspect=Perf|Case=Nom|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passfactifactaefacta
Case=Abl|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actfaciente
Case=Abl|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actfacientibus
Case=Abl|Number=Plur|VerbForm=Gdvfaciendis
Case=Acc|Number=Sing|Tense=Fut|VerbForm=Part|Voice=Actfacturum
Case=Acc|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actfacientem
Case=Acc|Number=Sing|VerbForm=Gdvfaciendumfaciendamfaciendum
Case=Acc|Number=Plur|Tense=Fut|VerbForm=Part|Voice=Actfacturosfacturas
Case=Dat|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actfacienti
Case=Gen|Number=Sing|VerbForm=Gdvfaciendi, faciundifaciendae
Case=Nom|Number=Sing|Tense=Fut|VerbForm=Part|Voice=Actfacturus
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actfaciensfaciensfaciensfaciens
Case=Nom|Number=Sing|VerbForm=Gdvfaciendafaciendum, facteon
Case=Nom|Number=Plur|Tense=Fut|VerbForm=Part|Voice=Actfacturi
Case=Nom|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actfacientes
Case=Nom|Number=Plur|VerbForm=Gdvfaciendifaciendaefacienda

DET

6948 DET tokens (88% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Reflex=EMPTY (5988; 86%), Number=Sing (4903; 71%), Person=EMPTY (3588; 52%), Poss=EMPTY (3588; 52%).

DET tokens may have the following values of Gender:

Paradigm omnisFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Abl|Number=Singomniomniomni
Case=Abl|Number=Pluromnibusomnibusomnibusomnibusomnibusomnibus
Case=Acc|Number=Singomnem, omnisomnem, omnisomnemomnemomne
Case=Acc|Number=Pluromnes, omnisomnesomnesomnia
Case=Dat|Number=Singomniomni
Case=Dat|Number=Pluromnibusomnibusomnibus
Case=Gen|Number=Singomnis
Case=Gen|Number=Pluromniumomnium
Case=Nom|Number=Singomnisomnisomnisomnisomne
Case=Nom|Number=Pluromnesomnesomnesomnia
Case=Voc|Number=Singomnis
Case=Voc|Number=Pluromnesomnes

PROPN

6421 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6276; 98%).

PROPN tokens may have the following values of Gender:

Paradigm HierosolymaFem,MascFemNeut
Case=Abl|Number=PlurHierosolymisHierosolymis
Case=Acc|Number=SingHierosolymam, Hierosolyma
Case=Acc|Number=PlurHierosolymaHierosolyma
Case=Dat|Number=SingHierosolymae
Case=Dat|Number=PlurHierosolymis
Case=Nom|Number=SingHierosolyma

Gender seems to be lexical feature of PROPN. 97% lemmas (879) occur only with one value of Gender.

NUM

997 NUM tokens (59% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (507; 51%).

NUM tokens may have the following values of Gender:

Paradigm unusMascMasc,NeutFemNeut
Case=Abl|Number=Singunounounauno
Case=Acc|Number=Singunumunumunam, Vnamunum
Case=Acc|Number=Plurunas
Case=Dat|Number=Singuni, Vni, unouni
Case=Gen|Number=Singuniusuniusunius
Case=Nom|Number=Singunus, unumuna, Vnaunum

AUX

52 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=EMPTY (52; 100%), Mood=EMPTY (52; 100%), Person=EMPTY (52; 100%), Tense=Fut (52; 100%), VerbForm=Part (52; 100%), Voice=Act (52; 100%), Number=Sing (37; 71%).

AUX tokens may have the following values of Gender:

Paradigm sumMascMasc,NeutFemNeut
Case=Acc|Number=Singfuturumfuturamfuturum
Case=Acc|Number=Plurfuturosfutura
Case=Gen|Number=Plurfuturorumfuturarum
Case=Nom|Number=Singfuturusfuturafuturum
Case=Nom|Number=Plurfuturifuturaefutura

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4648; 70%), NOUN –[amod]–> ADJ (3074; 67%), NOUN –[conj]–> NOUN (1741; 54%), VERB –[nsubj:pass]–> NOUN (1307; 59%), ADJ –[conj]–> ADJ (571; 86%), PROPN –[flat:name]–> PROPN (480; 94%), ADJ –[nsubj]–> NOUN (435; 74%), PROPN –[appos]–> NOUN (427; 86%), PROPN –[conj]–> PROPN (338; 87%), ADJ –[nsubj]–> PRON (290; 82%).