Treebank Statistics: UD_Italian-PUD: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
12701 tokens (54%) have a non-empty value of Gender.
4689 types (77%) occur at least once with a non-empty value of Gender.
3977 lemmas (85%) occur at least once with a non-empty value of Gender.
The feature is used with 11 part-of-speech tags: NOUN (4384; 18% instances), DET (3783; 16% instances), PROPN (1739; 7% instances), ADJ (1599; 7% instances), PRON (609; 3% instances), VERB (449; 2% instances), AUX (60; 0% instances), ADP (54; 0% instances), NUM (20; 0% instances), ADV (2; 0% instances), CCONJ (2; 0% instances).
NOUN
4384 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3086; 70%).
NOUN tokens may have the following values of Gender:
Fem(1897; 43% of non-emptyGender): parte, città, persone, volta, guerra, vita, popolazione, regione, storia, crescitaMasc(2487; 57% of non-emptyGender): anni, anno, governo, secolo, stato, tempo, giorno, mondo, numero, periodoEMPTY(8): fronte, Più, Qualunque, chiunque, milione, post, verificar
| Paradigm fine | Masc | Fem |
|---|---|---|
| Number=Sing | fine | fine |
| Number=Plur | fini |
Gender seems to be lexical feature of NOUN. 98% lemmas (1785) occur only with one value of Gender.
DET
3783 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3492; 92%), Number=Sing (2838; 75%), Definite=EMPTY (2308; 61%).
DET tokens may have the following values of Gender:
Fem(1600; 42% of non-emptyGender): la, le, l’, una, un’, questa, altre, molte, queste, diverseMasc(2183; 58% of non-emptyGender): il, i, un, l’, gli, lo, questo, ciò, molti, altriEMPTY(2): Ciò, l’
| Paradigm il | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing | il, l', lo | la, l' |
| Definite=Def|Number=Plur | i, gli | le |
| Number=Sing | il, l', lo | la, l' |
| Number=Plur | i, gli | le, la |
PROPN
1739 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1598; 92%).
PROPN tokens may have the following values of Gender:
Fem(777; 45% of non-emptyGender): Cina, Francia, guerra, Europa, Gran, Hong, Italia, Kong, Russia, AlbaniaMasc(962; 55% of non-emptyGender): Mar, Stati, Uniti, Trump, Mediterraneo, Nord, Regno, Caraibi, Donald, JosephEMPTY(17): Doss, Target, ETA, Income, Mailis, Michael, Multi, Reach, Return, St.
| Paradigm Trump | Masc | Fem |
|---|---|---|
| Trump | Trump |
Gender seems to be lexical feature of PROPN. 97% lemmas (1161) occur only with one value of Gender.
ADJ
1599 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1112; 70%).
ADJ tokens may have the following values of Gender:
Fem(689; 43% of non-emptyGender): prima, maggior, gran, grande, alta, maggiore, meridionale, americana, nuova, secondaMasc(910; 57% of non-emptyGender): ultimi, nuovi, nuovo, stesso, primo, tutti, tutto, grande, scorso, ultimoEMPTY(16): ex, evidente, probabile, nord, poca, possibile, post, presente, significativamente, soddisfatta
| Paradigm nuovo | Masc | Fem |
|---|---|---|
| Number=Sing | nuovo | nuova |
| Number=Plur | nuovi | nuove |
PRON
609 PRON tokens (71% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (525; 86%), Number=Sing (439; 72%), Number[psor]=EMPTY (384; 63%), PronType=EMPTY (368; 60%).
PRON tokens may have the following values of Gender:
Fem(249; 41% of non-emptyGender): che, sua, loro, sue, cui, quella, propria, le, la, leiMasc(360; 59% of non-emptyGender): che, suo, lo, loro, questo, cui, suoi, gli, lui, qualiEMPTY(252): si, ci, c’, mi, ne, chi, me, noi, se, vi
| Paradigm che | Masc | Fem |
|---|---|---|
| Number=Sing | che | che |
| Number=Plur | che | che |
VERB
449 VERB tokens (22% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Aspect=EMPTY (449; 100%), Mood=EMPTY (449; 100%), Person=EMPTY (449; 100%), Tense=Past (424; 94%), Number=Sing (287; 64%), Voice=EMPTY (282; 63%).
VERB tokens may have the following values of Gender:
Fem(179; 40% of non-emptyGender): considerate, diventata, usata, basata, cresciute, lasciata, pubblicate, riguardanti, utilizzata, considerataMasc(270; 60% of non-emptyGender): fatto, utilizzato, inclusi, stato, accusato, detto, diretto, fondato, pubblicato, seguitiEMPTY(1605): affermato, avere, ha, afferma, aveva, iniziò, sono, detto, far, hanno
| Paradigm fare | Masc | Fem |
|---|---|---|
| Number=Sing | fatto | |
| Number=Sing|Tense=Past | fatto | fatta |
| Number=Sing|Tense=Past|Voice=Pass | fatto | |
| Number=Plur|Tense=Past | fatte |
AUX
60 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=EMPTY (60; 100%), Mood=EMPTY (60; 100%), Person=EMPTY (60; 100%), Tense=Past (60; 100%), Voice=EMPTY (60; 100%), Number=Sing (45; 75%).
AUX tokens may have the following values of Gender:
Fem(21; 35% of non-emptyGender): stata, stateMasc(39; 65% of non-emptyGender): stato, statiEMPTY(925): è, ha, sono, era, fu, essere, hanno, venne, può, erano
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing | stato | stata |
| Number=Plur | stati | state |
ADP
54 ADP tokens (1% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(14; 26% of non-emptyGender): alla, della, alle, dalla, dell’, nellaMasc(40; 74% of non-emptyGender): al, del, dell’, all’, Da, Sulla, ai, allo, dagli, daiEMPTY(3754): di, in, a, per, da, con, su, come, tra, dopo
| Paradigm di | Masc | Fem |
|---|---|---|
| del, dell', dello | della, dell' |
NUM
20 NUM tokens (5% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(2; 10% of non-emptyGender): unaMasc(18; 90% of non-emptyGender): uno, unEMPTY(424): due, tre, 10, milioni, 1, quattro, 3, sei, 20, 2014
| Paradigm uno | Masc | Fem |
|---|---|---|
| uno, un | una |
ADV
2 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (2; 100%).
ADV tokens may have the following values of Gender:
Masc(2; 100% of non-emptyGender): piùEMPTY(847): non, più, anche, Tuttavia, solo, ancora, inoltre, dove, già, prima
CCONJ
2 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.
CCONJ tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): cheMasc(1; 50% of non-emptyGender): cheEMPTY(580): e, ma, o, ed, sia, che, mentre, quindi, né, oppure
| Paradigm che | Masc | Fem |
|---|---|---|
| che | che |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3023; 100%),
NOUN –[amod]–> ADJ (1219; 99%),
PROPN –[det]–> DET (516; 100%),
PROPN –[flat]–> PROPN (401; 99%),
NOUN –[nmod]–> PROPN (276; 61%),
NOUN –[det:poss]–> PRON (225; 99%),
NOUN –[conj]–> NOUN (141; 59%),
PROPN –[amod]–> ADJ (130; 99%),
NOUN –[acl]–> VERB (124; 66%),
VERB –[nsubj:pass]–> NOUN (107; 94%).