Treebank Statistics: UD_Latin-LLCT: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
122626 tokens (51%) have a non-empty value of Gender.
7486 types (80%) occur at least once with a non-empty value of Gender.
3085 lemmas (88%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (51206; 21% instances), PROPN (20150; 8% instances), DET (20065; 8% instances), ADJ (12350; 5% instances), VERB (9754; 4% instances), PRON (8470; 3% instances), NUM (629; 0% instances), AUX (2; 0% instances).
NOUN
51206 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40994; 80%).
NOUN tokens may have the following values of Gender:
Fem(20581; 40% of non-emptyGender): ecclesie, manus, casa, terra, res, cartula, rebus, ecclesia, indictione, memorieMasc(23000; 45% of non-emptyGender): teste, filio, loco, notarius, presbitero, anno, domno, episcopus, presbiter, testisNeut(7625; 15% of non-emptyGender): signum, nomine, argentum, lato, caput, regni, imperii, tempore, mandato, capoEMPTY(6): [noun], [–], [pronoun]
| Paradigm heres | Masc | Fem |
|---|---|---|
| Case=Abl|Number=Sing | herede | |
| Case=Abl|Number=Plur | heredibus, heredes, eredibus, heridibus | |
| Case=Acc|Number=Sing | heredem, heredes | |
| Case=Acc|Number=Plur | heredes, heredis, herides, heridis | heredes |
| Case=Dat|Number=Plur | heredibus, heridibus | |
| Case=Gen|Number=Sing | heredis | |
| Case=Gen|Number=Plur | heredum | |
| Case=Nom|Number=Sing | heredes, heres | |
| Case=Nom|Number=Plur | heredes, heredis |
Gender seems to be lexical feature of NOUN. 97% lemmas (701) occur only with one value of Gender.
PROPN
20150 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (20031; 99%).
PROPN tokens may have the following values of Gender:
Fem(2051; 10% of non-emptyGender): Luca, Marie, Italia, Langubardiam, Lunata, Langobardiam, Langubardia, Mariae, Verriana, PisciaMasc(17383; 86% of non-emptyGender): Dei, Martini, Deo, Petri, Gherardus, Petrus, domini, Adalfridi, Fridiani, AndreasNeut(716; 4% of non-emptyGender): Sexto, Castronovo, Vuamo, Suborbano, Feruniano, Paterno, Sugrominio, Tempaniano, Asulari, TuringoEMPTY(102): [Propn]
| Paradigm Varianus | Masc | Fem | Neut |
|---|---|---|---|
| Case=Abl | Variana | Varianu | |
| Case=Acc | Vaianu, Variano |
Gender seems to be lexical feature of PROPN. 98% lemmas (1807) occur only with one value of Gender.
DET
20065 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (14838; 74%), Number=Sing (14697; 73%), Person[psor]=EMPTY (14136; 70%), Poss=EMPTY (14136; 70%).
DET tokens may have the following values of Gender:
Fem(8836; 44% of non-emptyGender): ipsa, mea, suprascripta, hanc, illa, suprascripte, ipsius, una, omnibus, huiusMasc(7877; 39% of non-emptyGender): qui, nostro, tuis, meis, vestro, ipsius, ipso, ipse, suprascripto, taliNeut(3352; 17% of non-emptyGender): omnia, uno, alio, hec, omnibus, vestro, quolibet, suo, ipso, ipsum
| Paradigm ipse | Masc | Fem | Neut |
|---|---|---|---|
| Case=Abl|Number=Sing | ipso, ipsum, isso, ipse, ipsi | ipsa, ipsam, ipso | ipso, ipsum |
| Case=Abl|Number=Plur | ipsis, ipsi | ipsis, ipsi | ipsis |
| Case=Acc|Number=Sing | ipso, ipsum | ipsa, ipsam, ipsas | ipsum, ipso, ipsu, ipsud |
| Case=Acc|Number=Plur | ipsos, ipso | ipsas, ipsa | ipsa, ipsas |
| Case=Dat|Number=Sing | ipsi, ipso, ipsum | ipsei, ipsi | |
| Case=Dat|Number=Plur | ipsi, ipsis | ||
| Case=Gen|Number=Sing | ipsius | ipsius, ipse, ipssius | ipsius |
| Case=Gen|Number=Plur | ipsorum | ipsarum | |
| Case=Nom|Number=Sing | ipse, ipsi, ipso, ipsum | ipsa, ipsam, ipse | ipsum, ipso |
| Case=Nom|Number=Plur | ipsi, ipsis | ipse, ipsae, ipsis | ipsa |
ADJ
12350 ADJ tokens (93% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10249; 83%), NumType=EMPTY (9200; 74%).
ADJ tokens may have the following values of Gender:
Fem(3943; 32% of non-emptyGender): sancte, bone, Lucane, decima, tertia, publica, quinta, cultis, incultis, LucenseMasc(6623; 54% of non-emptyGender): sancti, singulos, manifestu, decimo, vigisimo, bonos, humilis, tertio, sexto, propitioNeut(1784; 14% of non-emptyGender): integrum, livellario, duplum, prefinito, manifestum, cultum, incultum, Romanum, recto, purumEMPTY(963): quondam
| Paradigm sanctus | Masc | Fem | Neut |
|---|---|---|---|
| Case=Abl|Number=Sing | sancto, sanctum | sancta | sancto |
| Case=Acc|Number=Sing | sanctum, sancto | sancta, sanctam | |
| Case=Acc|Number=Plur | sancta | ||
| Case=Dat|Number=Sing | sancte | ||
| Case=Gen|Number=Sing | sancti | sancte, sanctae, sanctem | |
| Case=Gen|Number=Plur | sanctorum | ||
| Case=Nom|Number=Sing | sanctus | sancta, sanctam | |
| Case=Nom|Number=Plur | sancte |
VERB
9754 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (9754; 100%), Person=EMPTY (9754; 100%), Tense=EMPTY (9754; 100%), VerbForm=Part (9754; 100%), Number=Sing (8425; 86%), Voice=Pass (7897; 81%), Aspect=Perf (6044; 62%).
VERB tokens may have the following values of Gender:
Fem(3008; 31% of non-emptyGender): traditam, pertenentes, facta, sita, pegiorata, tradita, pertinentes, tenente, dicta, circumdataMasc(3455; 35% of non-emptyGender): rogatus, regnante, ingressus, coronatus, abitantes, facto, gubernans, dante, ordinatus, visuNeut(3291; 34% of non-emptyGender): actum, faciendum, conservata, designatas, abendi, usufructuandi, faciendi, adimpleta, gubernandi, adinpletaEMPTY(18969): subscripsi, debeamus, dedi, dedisti, legitur, tenet, conplevi, scribere, conponere, rogavimus
| Paradigm do | Masc | Fem | Neut |
|---|---|---|---|
| Aspect=Imp|Case=Acc|Number=Sing|Voice=Act | dante, dantem | dante | |
| Aspect=Imp|Case=Nom|Number=Plur|Voice=Act | dantes | ||
| Aspect=Perf|Case=Abl|Number=Sing|Voice=Pass | data | ||
| Aspect=Perf|Case=Acc|Number=Sing|Voice=Pass | data, datam | ||
| Aspect=Perf|Case=Acc|Number=Plur|Voice=Pass | datas | ||
| Aspect=Perf|Case=Nom|Number=Sing|Voice=Pass | data, datas | datum | |
| Aspect=Perf|Case=Nom|Number=Plur|Voice=Pass | dati | date | data |
| Aspect=Prosp|Case=Abl|Number=Sing|Voice=Pass | dandum | ||
| Aspect=Prosp|Case=Acc|Number=Sing|Voice=Pass | dandum | ||
| Aspect=Prosp|Case=Gen|Number=Sing|Voice=Pass | dandi |
PRON
8470 PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6140; 72%), PronType=Prs (5220; 62%).
PRON tokens may have the following values of Gender:
Fem(1858; 22% of non-emptyGender): quas, eas, que, eam, quam, tibi, eius, ea, qua, cuiMasc(5595; 66% of non-emptyGender): qui, tibi, eius, vobis, tu, te, vos, cui, que, quoNeut(1017; 12% of non-emptyGender): id, quod, que, aliquo, quibus, aliquid, eo, quit, quidquid, quoEMPTY(9809): ego, me, nos, mihi, nobis, se, nihil, nus, novis, sibi
| Paradigm qui | Masc | Fem | Neut |
|---|---|---|---|
| Case=Abl|Number=Sing | quo, quod | qua, quam, quas | quo, quod |
| Case=Abl|Number=Plur | quibus | quibus | quibus |
| Case=Acc|Number=Sing | que, quem | quam, quas, qua | quod, quo |
| Case=Acc|Number=Plur | quos | quas, qua | que, quem |
| Case=Dat|Number=Sing | cui | cui | cui |
| Case=Dat|Number=Plur | quibus | quibus | |
| Case=Gen|Number=Sing | cuius | ||
| Case=Gen|Number=Plur | chorum | ||
| Case=Nom|Number=Sing | qui, quit | que, quem, quae, qua | quod, cod, quo |
| Case=Nom|Number=Plur | qui | que, quem, quae | que, quem |
NUM
629 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (629; 100%), NumType=Card (629; 100%), Number=Plur (629; 100%), Case=Acc (551; 88%).
NUM tokens may have the following values of Gender:
Fem(387; 62% of non-emptyGender): duas, tres, due, dua, duae, duabus, dues, tre, trisMasc(225; 36% of non-emptyGender): duo, tres, duos, ducentos, tricentos, duocentos, duobus, quatringentos, quingentos, ducentusNeut(17; 3% of non-emptyGender): duo, milia, duobus, triaEMPTY(872): viginti, triginta, quinquaginta, decem, sex, quattuor, quinque, duodecim, centum, octo
| Paradigm duo | Masc | Fem | Neut |
|---|---|---|---|
| Case=Abl | duobus | duabus | duobus |
| Case=Acc | duo, duos | duas, dua | duo |
| Case=Nom | duo | due, duae, dues | duo |
AUX
2 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (2; 100%), Mood=EMPTY (2; 100%), Person=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Part (2; 100%).
AUX tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): futuraNeut(1; 50% of non-emptyGender): futuraEMPTY(4125): est, fuerit, sum, fuit, fuerint, esse, fui, sunt, sit, sint
| Paradigm sum | Fem | Neut |
|---|---|---|
| Case=Abl|Number=Sing | futura | |
| Case=Acc|Number=Plur | futura |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (15674; 100%),
PROPN –[appos]–> NOUN (7052; 100%),
NOUN –[amod]–> ADJ (6355; 100%),
NOUN –[conj]–> NOUN (3825; 69%),
PROPN –[amod]–> ADJ (2261; 70%),
NOUN –[acl]–> VERB (1843; 78%),
PROPN –[acl]–> VERB (1824; 96%),
VERB –[obl:arg]–> PROPN (1796; 71%),
PROPN –[det]–> DET (1527; 100%),
PROPN –[nmod]–> NOUN (1240; 69%).