Treebank Statistics: UD_Italian-VIT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
116577 tokens (42%) have a non-empty value of Gender.
12735 types (54%) occur at least once with a non-empty value of Gender.
8505 lemmas (54%) occur at least once with a non-empty value of Gender.
The feature is used with 14 part-of-speech tags: NOUN (54785; 20% instances), DET (37731; 13% instances), ADJ (12481; 4% instances), VERB (7838; 3% instances), PRON (2808; 1% instances), AUX (668; 0% instances), ADV (147; 0% instances), NUM (98; 0% instances), X (8; 0% instances), ADP (6; 0% instances), CCONJ (4; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances).
NOUN
54785 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (34650; 63%).
NOUN tokens may have the following values of Gender:
Fem(24082; 44% of non-emptyGender): società, attività, parte, legge, titolarità, provincia, città, sede, domanda, gestioneMasc(30703; 56% of non-emptyGender): anni, miliardi, anno, posti, presidente, punto, governo, stato, gruppo, lavoroEMPTY(3098): n., art., insegnanti, dpr, mila, a, b, docenti, dl, via
| Paradigm fine | Masc | Fem |
|---|---|---|
| Number=Sing | fine | fine, fin |
| Number=Plur | fini |
Gender seems to be lexical feature of NOUN. 99% lemmas (5507) occur only with one value of Gender.
DET
37731 DET tokens (86% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (34993; 93%), Definite=Def (30878; 82%), Number=Sing (25954; 69%).
DET tokens may have the following values of Gender:
Fem(15861; 42% of non-emptyGender): la, le, una, un’, sua, questa, tutte, queste, sue, quellaMasc(21870; 58% of non-emptyGender): il, i, un, gli, lo, questo, suo, tutti, questi, unoEMPTY(6173): l’, loro, ogni, tale, il, qualche, tali, che, quest’, cui
| Paradigm il | Masc | Fem |
|---|---|---|
| Number=Sing | il, lo, gli, i, l' | la |
| Number=Plur | i, gli, il | le, il, i |
ADJ
12481 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (7827; 63%).
ADJ tokens may have the following values of Gender:
Fem(5638; 45% of non-emptyGender): altre, nuova, italiana, altra, nuove, politica, stessa, pubblica, economica, politicheMasc(6843; 55% of non-emptyGender): altri, nuovo, economico, stesso, nuovi, scorso, altro, finanziario, ultimo, italianoEMPTY(7693): precedente, primo, grande, presente, ex, netto, generale, grandi, nazionale, sociale
| Paradigm altro | Masc | Fem |
|---|---|---|
| Number=Sing | altro | altra |
| Number=Sing|PronType=Dem | altro | |
| Number=Sing|PronType=Ind | altro | altra |
| Number=Plur | altri | altre |
VERB
7838 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7838; 100%), Person=EMPTY (7838; 100%), Tense=Past (7777; 99%), VerbForm=Part (7777; 99%), Number=Sing (5635; 72%).
VERB tokens may have the following values of Gender:
Fem(2293; 29% of non-emptyGender): prevista, indicate, presentata, comprese, effettuata, fatta, richiesta, data, previste, richiesteMasc(5545; 71% of non-emptyGender): fatto, detto, approvato, previsto, avuto, previsti, deciso, ottenuto, visto, datoEMPTY(13557): è, ha, fare, fa, far, hanno, dice, sono, avere, scade
| Paradigm fare | Masc | Fem |
|---|---|---|
| Number=Sing | fatto | fatta |
| Number=Plur | fatti | fatte |
PRON
2808 PRON tokens (29% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (2208; 79%), Number=Sing (2038; 73%), Person=EMPTY (1980; 71%).
PRON tokens may have the following values of Gender:
Fem(716; 25% of non-emptyGender): quella, la, quelle, le, una, essa, questa, queste, altra, esseMasc(2092; 75% of non-emptyGender): lo, quello, quale, quelli, quanto, questo, tutti, gli, li, altroEMPTY(6986): che, si, cui, ci, c’, ne, mi, dove, chi, quali
| Paradigm quello | Masc | Fem |
|---|---|---|
| Number=Sing | quello, quel | quella |
| Number=Plur | quelli | quelle |
AUX
668 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (668; 100%), Person=EMPTY (668; 100%), Tense=Past (667; 100%), VerbForm=Part (667; 100%), Number=Sing (504; 75%).
AUX tokens may have the following values of Gender:
Fem(203; 30% of non-emptyGender): stata, state, dovuta, fatta, volutaMasc(465; 70% of non-emptyGender): stato, stati, potuto, dovuto, voluto, fatto, dovuti, essere, volutiEMPTY(8754): è, ha, sono, essere, hanno, era, sarà, deve, può, sia
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing|PronType=Rel|Tense=Past|VerbForm=Part | stata | |
| Number=Sing|Tense=Past|VerbForm=Part | stato | stata |
| Number=Plur | essere | |
| Number=Plur|Tense=Past|VerbForm=Part | stati | state |
ADV
147 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (145; 99%).
ADV tokens may have the following values of Gender:
Fem(43; 29% of non-emptyGender): estremamente, inizialmente, costantemente, normalmente, celermente, contrariamente, lungamente, solamente, una, MolteMasc(104; 71% of non-emptyGender): volta, molto, poco, fa, lungo, troppo, no, seguito, casual, dietroEMPTY(10599): non, più, anche, solo, così, già, ancora, ieri, poi, sempre
| Paradigm molto | Masc | Fem |
|---|---|---|
| Number=Sing | molto | |
| Number=Plur | Molte |
Gender seems to be lexical feature of ADV. 97% lemmas (65) occur only with one value of Gender.
NUM
98 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (98; 100%).
NUM tokens may have the following values of Gender:
Fem(35; 36% of non-emptyGender): un’, terza, una, mezzaMasc(63; 64% of non-emptyGender): miliardi, milioni, un, primi, terzi, bis, rientro, unoEMPTY(6295): due, tre, cento, 15, 1, 5, 1973, 2, 20, 30
| Paradigm uno | Masc | Fem |
|---|---|---|
| un, uno | un', una |
X
8 X tokens (2% of all X tokens) have a non-empty value of Gender.
The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (8; 100%).
X tokens may have the following values of Gender:
Fem(3; 38% of non-emptyGender): area, deregulation, mountainMasc(5; 63% of non-emptyGender): local, network, personal, show, word-processingEMPTY(384): joint, venture, station, work, baby, cd, sitter, computer, condicio, par
ADP
6 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Masc(6; 100% of non-emptyGender): per, Salvo, dietro, ne, nienteEMPTY(45569): di, a, in, per, da, con, su, tra, ad, come
CCONJ
4 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.
CCONJ tokens may have the following values of Gender:
Fem(1; 25% of non-emptyGender): essaMasc(3; 75% of non-emptyGender): altro, caso, quantoEMPTY(8175): e, ma, o, ed, sia, come, che, cioè, ovvero, nonché
INTJ
1 INTJ tokens (1% of all INTJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which INTJ and Gender co-occurred: Polarity=EMPTY (1; 100%).
INTJ tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): okEMPTY(77): no, sì, basta, Bè, Ecco, addio, avanti, macché, oh, Beh
PUNCT
1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.
PUNCT tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): leEMPTY(31584): ,, ., “, ), -, (, :, ?, ;, «/em>
SCONJ
1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.
SCONJ tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): addebitatiEMPTY(2213): che, se, perché, quando, mentre, come, qualora, poiché, affinché, ove
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (30363; 83%),
NOUN –[amod]–> ADJ (9797; 61%),
NOUN –[conj]–> NOUN (2595; 58%),
NOUN –[advcl]–> VERB (1297; 76%),
VERB –[nsubj:pass]–> NOUN (1076; 93%),
NOUN –[det:poss]–> DET (960; 77%),
VERB –[conj]–> VERB (338; 51%),
NOUN –[det:predet]–> DET (333; 95%),
NOUN –[nsubj]–> NOUN (162; 53%),
ADJ –[amod]–> ADJ (136; 56%).