Treebank Statistics: UD_Italian-Old: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
40626 tokens (33%) have a non-empty value of Gender.
6319 types (51%) occur at least once with a non-empty value of Gender.
3833 lemmas (60%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (15904; 13% instances), DET (14215; 12% instances), ADJ (4865; 4% instances), PRON (3178; 3% instances), VERB (2401; 2% instances), AUX (48; 0% instances), ADV (14; 0% instances), PROPN (1; 0% instances).
NOUN
15904 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (12677; 80%).
NOUN tokens may have the following values of Gender:
Fem(7357; 46% of non-emptyGender): terra, gente, parte, mente, donna, vita, parole, luce, anima, vistaMasc(8547; 54% of non-emptyGender): occhi, mondo, maestro, ciel, viso, loco, duca, amor, lume, tempoEMPTY(49): i, quando, sì, O, P, diece, due, no, sei, tre
| Paradigm braccio | Masc | Fem |
|---|---|---|
| Number=Sing | braccio | |
| Number=Plur | braccia |
Gender seems to be lexical feature of NOUN. 98% lemmas (2519) occur only with one value of Gender.
DET
14215 DET tokens (95% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (12246; 86%), Number=Sing (11118; 78%), PronType=Art (9604; 68%), Definite=Def (9209; 65%).
DET tokens may have the following values of Gender:
Fem(5802; 41% of non-emptyGender): la, l’, le, sua, mia, una, quella, questa, tua, altraMasc(8413; 59% of non-emptyGender): il, ‘l, l’, li, lo, un, i, mio, suo, quelEMPTY(753): lor, qual, tal, ogne, più, altro, altra, altrui, alcun, quale
| Paradigm il | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing|PronType=Art | il, 'l, l', lo, l, sul, i | la, l' |
| Definite=Def|Number=Plur|PronType=Art | li, i, il, ', l', l, ne | le, l', il |
| Number=Plur|PronType=Dem | le |
ADJ
4865 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (3767; 77%).
ADJ tokens may have the following values of Gender:
Fem(2206; 45% of non-emptyGender): prima, bella, alta, santa, sola, divina, buona, umana, viva, lungaMasc(2659; 55% of non-emptyGender): alto, primo, buon, dolce, etterno, gran, santo, secondo, novo, vivoEMPTY(334): gran, tal, dolce, grande, più, maggior, dolente, forte, gravi, men
| Paradigm grande | Masc | Fem |
|---|---|---|
| Degree=Cmp|Number=Sing | grande | grande |
| Number=Sing | gran, grande | gran, grande, Grand' |
| Number=Plur | grandi, gran | gran, grandi |
PRON
3178 PRON tokens (22% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3175; 100%), Poss=EMPTY (3141; 99%), Clitic=EMPTY (2893; 91%), Number=Sing (2691; 85%), PronType=Prs (1642; 52%), Person=3 (1619; 51%).
PRON tokens may have the following values of Gender:
Fem(794; 25% of non-emptyGender): la, lei, quella, ella, le, l’, una, essa, questa, altraMasc(2384; 75% of non-emptyGender): lui, quel, li, elli, lo, colui, altro, un, el, queiEMPTY(11005): che, si, io, mi, ch’, tu, s’, ti, me, m’
| Paradigm quello | Masc | Fem |
|---|---|---|
| Number=Sing|Person=1 | quel, quei, quelli, quello, Quell' | quella |
| Number=Sing | quel, quei, quello, quelli, quell' | quella, Quell' |
| Number=Plur|Person=1 | quei, quelli, que' | quelle |
| Number=Plur | quei, quelli | quelle |
VERB
2401 VERB tokens (14% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2401; 100%), VerbForm=Part (2394; 100%), Number=Sing (1949; 81%), Aspect=Perf (1937; 81%), Person=EMPTY (1887; 79%), Voice=Pass (1703; 71%), Tense=EMPTY (1436; 60%).
VERB tokens may have the following values of Gender:
Fem(615; 26% of non-emptyGender): fatta, veduta, morta, volta, stretta, aperta, aperte, rotta, partita, scioltaMasc(1786; 74% of non-emptyGender): fatto, detto, tratto, messo, giunto, vòlto, venuto, fatti, morti, vintoEMPTY(14539): disse, fa, vidi, veder, vedi, fare, ha, fece, fé, va
| Paradigm fare | Masc | Fem |
|---|---|---|
| Aspect=Perf|Number=Sing|Person=1|Tense=Past | fatto | |
| Aspect=Perf|Number=Sing|Person=1|Tense=Past|Voice=Act | fatt' | |
| Aspect=Perf|Number=Sing|Person=2|Tense=Past | fatto | |
| Aspect=Perf|Number=Sing|Person=2|Tense=Past|Voice=Act | fatto, fatt' | |
| Aspect=Perf|Number=Sing|Person=3|Tense=Past | fatto, fatte | |
| Aspect=Perf|Number=Sing|Person=3|Tense=Past|Voice=Act | fatto, fatte, fatt' | |
| Aspect=Perf|Number=Sing|Person=3|Tense=Past|Voice=Pass | fatto | |
| Aspect=Perf|Number=Sing|Tense=Past | fatto | |
| Aspect=Perf|Number=Sing|Tense=Past|Voice=Pass | fatto | fatta |
| Aspect=Perf|Number=Sing|Voice=Pass | fatto, fatta, fatt', fatte | fatta |
| Aspect=Perf|Number=Plur|Tense=Past|Voice=Pass | fatti | |
| Aspect=Perf|Number=Plur|Voice=Pass | fatte | |
| Number=Sing|Person=1|Tense=Past|Voice=Pass | fatta | |
| Number=Sing|Person=3|Tense=Past | fatto | |
| Number=Sing|Person=3|Tense=Past|Voice=Act | fatto | |
| Number=Sing|Person=3|Tense=Past|Voice=Pass | fatta | |
| Number=Sing|Tense=Fut|Voice=Act | fatturo | |
| Number=Sing|Tense=Past | fatto | fatta |
| Number=Sing|Tense=Past|Voice=Act | fatto | fatt' |
| Number=Sing|Tense=Past|Voice=Pass | fatto | fatta |
| Number=Plur|Tense=Past | fatti | |
| Number=Plur|Tense=Past|Voice=Act | fatti | |
| Number=Plur|Tense=Past|Voice=Pass | fatti | |
| Number=Plur|Voice=Act | fatti |
AUX
48 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (48; 100%), VerbForm=Part (47; 98%), Aspect=Perf (46; 96%), Tense=Past (45; 94%), Number=Sing (42; 88%), Voice=EMPTY (42; 88%), Person=3 (25; 52%).
AUX tokens may have the following values of Gender:
Fem(6; 13% of non-emptyGender): state, dee, stataMasc(42; 88% of non-emptyGender): stato, è, fosse, fossero, fossi, son, potuto, stata, stati, avesseEMPTY(3450): è, fu, era, son, esser, fui, se’, avea, fosse, ha
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing|Person=1|Tense=Past | stato, fossi, son, sono, stati | |
| Number=Sing|Person=3|Tense=Past | stato, è, fosse, fossero, stata, state, stati | stata |
| Number=Sing|Voice=Pass | stato | |
| Number=Plur|Person=1|Tense=Past | fossimo | |
| Number=Plur|Person=2|Tense=Past | state | |
| Number=Plur|Person=3|Tense=Past | state | |
| Number=Plur|Tense=Past | state | |
| Number=Plur|Voice=Pass | state |
ADV
14 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (10; 71%).
ADV tokens may have the following values of Gender:
Masc(14; 100% of non-emptyGender): ben, poco, tosto, ‘ncontro, meno, molto, quanto, secondo, sùbito, tantoEMPTY(10367): non, sì, più, come, poi, così, là, già, qui, tanto
Gender seems to be lexical feature of ADV. 100% lemmas (10) occur only with one value of Gender.
PROPN
1 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): TesoroEMPTY(1874): Dio, Beatrice, Cristo, Virgilio, Maria, Pietro, Roma, Fiorenza, Carlo, Guido
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (10310; 96%),
NOUN –[amod]–> ADJ (2788; 92%),
NOUN –[det:poss]–> DET (1769; 93%),
NOUN –[acl]–> VERB (438; 73%),
NOUN –[conj]–> NOUN (387; 53%),
PRON –[det]–> DET (325; 61%),
ADJ –[conj]–> ADJ (223; 92%),
ADJ –[nsubj]–> NOUN (212; 93%),
ADJ –[det]–> DET (113; 95%),
DET –[det]–> DET (88; 85%).