Treebank Statistics: UD_Italian-Valico: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
2860 tokens (42%) have a non-empty value of Gender.
732 types (55%) occur at least once with a non-empty value of Gender.
587 lemmas (61%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (1012; 15% instances), DET (1008; 15% instances), VERB (384; 6% instances), ADJ (240; 4% instances), PRON (194; 3% instances), AUX (20; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances).
NOUN
1012 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (878; 87%).
NOUN tokens may have the following values of Gender:
Fem(399; 39% of non-emptyGender): donna, ragazza, terra, spalle, borsa, città, spalla, cosa, situazione, casaMasc(613; 61% of non-emptyGender): uomo, parco, ragazzo, giornale, amore, momento, banco, giorno, marito, fidanzatoEMPTY(18): amante, delinquente, sol, OCCHIALI, Parco, X, affari, diem, discaount, giornale
| Paradigm amico | Masc | Fem |
|---|---|---|
| Number=Sing | amico | amica |
| Number=Plur | amici |
Gender seems to be lexical feature of NOUN. 98% lemmas (286) occur only with one value of Gender.
DET
1008 DET tokens (99% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (919; 91%), Poss=EMPTY (905; 90%), PronType=Art (830; 82%), Definite=Def (605; 60%).
DET tokens may have the following values of Gender:
Fem(387; 38% of non-emptyGender): la, una, le, sua, un’, l’, mia, questa, altra, delleMasc(621; 62% of non-emptyGender): il, un, l’, suo, i, mio, questo, altro, lo, gliEMPTY(15): che, ogni, qualche, alcun, loro, tutti
| Paradigm suo | Masc | Fem |
|---|---|---|
| Number=Sing|Poss=Yes | suo | sua |
| Number=Plur|Poss=Yes | suoi | sue, suoe |
| Number=Plur | sui |
VERB
384 VERB tokens (40% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (384; 100%), Person=EMPTY (384; 100%), Tense=Past (384; 100%), VerbForm=Part (384; 100%), Number=Sing (380; 99%).
VERB tokens may have the following values of Gender:
Fem(31; 8% of non-emptyGender): andata, salvata, arrivata, alzata, arrabbiata, arrabiata, caduta, chiamata, chiusa, comprataMasc(353; 92% of non-emptyGender): detto, visto, fatto, pensato, sentito, seduto, cominciato, andato, gridato, salvatoEMPTY(584): era, portava, aveva, leggendo, sembrava, leggeva, fare, andare, gridava, leggere
| Paradigm vedere | Masc | Fem |
|---|---|---|
| visto, veduto | vista |
ADJ
240 ADJ tokens (78% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (218; 91%).
ADJ tokens may have the following values of Gender:
Fem(89; 37% of non-emptyGender): bella, contenta, bionda, arrabiata, strana, arraviata, brutta, cattiva, destra, furiosaMasc(151; 63% of non-emptyGender): brutto, simpatico, bel, bello, carino, muscoloso, timido, improviso, nuovo, robustoEMPTY(69): grande, forte, felice, giovane, gentile, normale, incredibile, interessante, triste, SPLENDENTE
| Paradigm brutto | Masc | Fem |
|---|---|---|
| brutto | brutta |
PRON
194 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (172; 89%), PronType=Prs (149; 77%), Person=3 (144; 74%), Clitic=Yes (99; 51%).
PRON tokens may have the following values of Gender:
Fem(80; 41% of non-emptyGender): la, lei, le, l’, questa, quella, altre, molte, queste, unaMasc(114; 59% of non-emptyGender): lui, lo, l’, gli, quello, li, questo, nessuno, tutti, entrambiEMPTY(315): che, mi, si, me, niente, c’, io, qualcosa, se, ne
| Paradigm quello | Masc | Fem |
|---|---|---|
| quello | quella |
AUX
20 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (20; 100%), Person=EMPTY (20; 100%), Tense=Past (20; 100%), VerbForm=Part (20; 100%), Number=Sing (19; 95%).
AUX tokens may have the following values of Gender:
Fem(3; 15% of non-emptyGender): stataMasc(17; 85% of non-emptyGender): stato, dovuto, potuto, stati, viuto, volutoEMPTY(572): ha, ho, era, è, sono, aveva, stava, ero, avevo, essere
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing | stato | stata |
| Number=Plur | stati |
ADV
1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (1; 100%).
ADV tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): perEMPTY(390): non, molto, Ieri, poi, come, più, anche, così, invece, subito
NUM
1 NUM tokens (6% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (1; 100%).
NUM tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): diecisetteEMPTY(17): due, 1, 10, 28, 30, 700, 9, 95, 99, catordieci
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (847; 96%),
NOUN –[amod]–> ADJ (129; 76%),
NOUN –[det:poss]–> DET (95; 93%),
VERB –[conj]–> VERB (60; 51%),
ADJ –[nsubj]–> NOUN (20; 69%),
ADJ –[conj]–> ADJ (15; 60%),
NOUN –[nsubj]–> NOUN (8; 73%),
ADJ –[det]–> DET (7; 100%),
VERB –[ccomp]–> NOUN (6; 67%),
VERB –[obl]–> ADJ (6; 67%).