Treebank Statistics: UD_Italian-ParTUT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
24141 tokens (43%) have a non-empty value of Gender.
4787 types (57%) occur at least once with a non-empty value of Gender.
3451 lemmas (61%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (11176; 20% instances), DET (8168; 15% instances), ADJ (2538; 5% instances), VERB (1448; 3% instances), PRON (652; 1% instances), AUX (157; 0% instances), ADP (1; 0% instances), PROPN (1; 0% instances).
NOUN
11176 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (7231; 65%).
NOUN tokens may have the following values of Gender:
Fem(5373; 48% of non-emptyGender): società, commissione, parte, opera, opere, vita, attività, sicurezza, crescita, licenzaMasc(5803; 52% of non-emptyGender): anni, lavoro, programma, euro, parlamento, membri, modo, paesi, diritto, statiEMPTY(341): presidente, onorevole, account, commissario, rappresentanti, e-mail, partecipanti, grazie, password, fine
| Paradigm signore | Masc | Fem |
|---|---|---|
| signor | signora |
Gender seems to be lexical feature of NOUN. 99% lemmas (2221) occur only with one value of Gender.
DET
8168 DET tokens (86% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (7123; 87%), Definite=Def (6177; 76%), Number=Sing (5573; 68%).
DET tokens may have the following values of Gender:
Fem(3571; 44% of non-emptyGender): la, le, una, sua, un’, questa, sue, queste, alcuna, tutteMasc(4597; 56% of non-emptyGender): il, i, un, gli, suo, lo, questo, tutti, alcuni, suoiEMPTY(1351): l’, ogni, loro, tale, tali, qualsiasi, più, tal, cui, qualche
| Paradigm il | Masc | Fem |
|---|---|---|
| Number=Sing | il, lo, l' | la |
| Number=Plur | i, gli | le |
ADJ
2538 ADJ tokens (60% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1583; 62%).
ADJ tokens may have the following values of Gender:
Fem(1137; 45% of non-emptyGender): economica, prima, relative, altre, pericolose, stessa, nuova, nuove, altra, direttriciMasc(1401; 55% of non-emptyGender): altri, europeo, primo, nuovo, stesso, finanziario, altro, nuovi, necessario, relativiEMPTY(1670): presente, sociale, importante, maggiore, possibile, grande, strutturali, intellettuale, principali, teatrali
| Paradigm altro | Masc | Fem |
|---|---|---|
| Number=Sing | altro | altra |
| Number=Plur | altri | altre |
VERB
1448 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1448; 100%), Person=EMPTY (1448; 100%), Tense=Past (1448; 100%), VerbForm=Part (1448; 100%), Number=Sing (961; 66%).
VERB tokens may have the following values of Gender:
Fem(486; 34% of non-emptyGender): data, presentata, concessa, pubblicate, applicate, considerata, messe, adottate, armonizzate, modificataMasc(962; 66% of non-emptyGender): considerato, fatto, dato, avuto, visto, svolto, detto, portato, previsto, scrittoEMPTY(3276): ha, è, hanno, scrisse, far, fare, garantire, migliorare, creare, rappresenta
| Paradigm avere | Masc | Fem |
|---|---|---|
| avuto | avuta |
PRON
652 PRON tokens (36% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (526; 81%), Number=Sing (472; 72%), Person=EMPTY (432; 66%).
PRON tokens may have the following values of Gender:
Fem(168; 26% of non-emptyGender): la, quella, le, questa, lei, una, essa, esse, molte, quelleMasc(484; 74% of non-emptyGender): lo, ciò, quanto, quello, altri, uno, questo, tutti, tutto, alcuniEMPTY(1146): che, si, cui, ci, ne, mi, vi, c’, noi, quale
| Paradigm quello | Masc | Fem |
|---|---|---|
| Number=Sing | quello, quel | quella |
| Number=Plur | quelli | quelle |
AUX
157 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (157; 100%), Person=EMPTY (157; 100%), Tense=Past (157; 100%), VerbForm=Part (157; 100%), Number=Sing (111; 71%).
AUX tokens may have the following values of Gender:
Fem(46; 29% of non-emptyGender): stata, state, andata, potutaMasc(111; 71% of non-emptyGender): stato, stati, potuto, dovuto, andato, volutoEMPTY(1887): è, sono, essere, ha, era, hanno, sia, può, fu, possono
| Paradigm essere | Masc | Fem |
|---|---|---|
| Number=Sing | stato | stata |
| Number=Plur | stati | state |
ADP
1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): duEMPTY(9012): di, a, in, per, da, su, con, come, ad, tra
PROPN
1 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): hyeEMPTY(2033): Shakespeare, Balzac, Facebook, Europa, Ucraina, Pericle, Stati, Uniti, Europea, Unione
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (6716; 84%),
NOUN –[amod]–> ADJ (2045; 60%),
NOUN –[conj]–> NOUN (486; 55%),
NOUN –[det:poss]–> DET (470; 85%),
NOUN –[acl]–> VERB (421; 57%),
VERB –[nsubj:pass]–> NOUN (327; 96%),
NOUN –[det:predet]–> DET (83; 98%),
ADJ –[conj]–> ADJ (81; 54%),
PRON –[nmod]–> NOUN (80; 73%),
NOUN –[nsubj]–> NOUN (61; 54%).