Treebank Statistics: UD_Italian-ParlaMint: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
8463 tokens (41%) have a non-empty value of Gender.
1959 types (58%) occur at least once with a non-empty value of Gender.
1500 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (4065; 20% instances), DET (2728; 13% instances), ADJ (748; 4% instances), VERB (543; 3% instances), PRON (317; 2% instances), AUX (61; 0% instances), ADP (1; 0% instances).
NOUN
4065 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2723; 67%).
NOUN tokens may have the following values of Gender:
Fem(1761; 43% of non-emptyGender): famiglia, legge, adozione, carceri, comunità, possibilità, relatrice, responsabilità, polizia, parteMasc(2304; 57% of non-emptyGender): affidamento, emendamento, signor, ministro, detenuti, n., Governo, giorno, ordine, emendamentiEMPTY(151): minore, presidente, minori, familiari, fronte, rappresentante, collega, Capigruppo, agenti, grazie
| Paradigm senatore | Masc | Fem |
|---|---|---|
| Number=Sing | senatore | senatrice |
| Number=Plur | senatori | senatrici |
Gender seems to be lexical feature of NOUN. 99% lemmas (928) occur only with one value of Gender.
DET
2728 DET tokens (85% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (2297; 84%), Definite=Def (1976; 72%), Number=Sing (1858; 68%).
DET tokens may have the following values of Gender:
Fem(1154; 42% of non-emptyGender): la, le, una, questa, un’, queste, sua, propria, tutta, delleMasc(1574; 58% of non-emptyGender): il, i, un, gli, questo, lo, questi, tutti, tutto, suoEMPTY(491): l’, loro, ogni, tale, qualche, più, che, quell’, quest’, tal
| Paradigm il | Masc | Fem |
|---|---|---|
| Number=Sing | il, lo | la |
| Number=Plur | i, gli | le |
ADJ
748 ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (511; 68%).
ADJ tokens may have the following values of Gender:
Fem(355; 47% of non-emptyGender): penitenziaria, affidataria, altre, carceraria, prima, stessa, affettiva, odierna, sanitaria, affidatarieMasc(393; 53% of non-emptyGender): contrario, altri, prolungato, primo, altro, stesso, vero, chiaro, penitenziari, penitenziarioEMPTY(383): possibile, familiare, sociali, presente, bis, generale, verbale, difficile, gravi, nazionale
| Paradigm penitenziario | Masc | Fem |
|---|---|---|
| Number=Sing | penitenziario | penitenziaria |
| Number=Plur | penitenziari | penitenziarie |
VERB
543 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (543; 100%), Person=EMPTY (543; 100%), VerbForm=Part (542; 100%), Tense=Past (540; 99%), Number=Sing (397; 73%).
VERB tokens may have the following values of Gender:
Fem(120; 22% of non-emptyGender): appoggiata, fatta, applicata, avanzata, stata, assunte, avvenute, convocata, costrette, effettuateMasc(423; 78% of non-emptyGender): fatto, presentato, detto, visto, adottato, dato, affidato, previsto, proposto, accadutoEMPTY(1497): ha, è, fare, avere, dire, tratta, garantire, chiedo, far, affrontare
| Paradigm fare | Masc | Fem |
|---|---|---|
| Number=Sing | fatto | fatta |
| Number=Plur | fatti |
PRON
317 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (255; 80%), Clitic=EMPTY (244; 77%), Person=EMPTY (215; 68%).
PRON tokens may have the following values of Gender:
Fem(77; 24% of non-emptyGender): lei, la, le, quella, questa, quelle, una, queste, altra, ellaMasc(240; 76% of non-emptyGender): lo, questo, quello, quanto, ciò, altro, altri, tutto, li, tuttiEMPTY(822): che, si, ci, cui, mi, chi, c’, noi, lei, io
| Paradigm questo | Masc | Fem |
|---|---|---|
| Number=Sing | questo | questa |
| Number=Plur | questi | queste |
AUX
61 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (61; 100%), Person=EMPTY (61; 100%), Tense=Past (61; 100%), VerbForm=Part (61; 100%), Number=Sing (36; 59%).
AUX tokens may have the following values of Gender:
Fem(19; 31% of non-emptyGender): state, stataMasc(42; 69% of non-emptyGender): stato, stati, dovuto, potutoEMPTY(925): è, sono, essere, ha, deve, hanno, ho, può, abbiamo, sia
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing | stato | stata |
| Number=Plur | stati | state |
ADP
1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): finEMPTY(3064): di, in, a, per, da, con, su, ad, come, tra
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (2510; 82%),
NOUN –[amod]–> ADJ (594; 69%),
NOUN –[compound]–> NOUN (59; 70%),
VERB –[conj]–> VERB (45; 58%),
ADJ –[conj]–> ADJ (30; 65%),
NOUN –[nsubj]–> NOUN (17; 74%),
NOUN –[appos]–> NOUN (14; 67%),
NOUN –[conj]–> PRON (8; 57%),
ADJ –[det]–> DET (7; 64%),
PRON –[acl]–> NOUN (7; 88%).