Treebank Statistics: UD_Italian-ParTUT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
24141 tokens (43%) have a non-empty value of Gender
.
4787 types (57%) occur at least once with a non-empty value of Gender
.
3451 lemmas (61%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (11176; 20% instances), DET (8168; 15% instances), ADJ (2538; 5% instances), VERB (1448; 3% instances), PRON (652; 1% instances), AUX (157; 0% instances), ADP (1; 0% instances), PROPN (1; 0% instances).
NOUN
11176 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (7231; 65%).
NOUN
tokens may have the following values of Gender
:
Fem
(5373; 48% of non-emptyGender
): società, commissione, parte, opera, opere, vita, attività, sicurezza, crescita, licenzaMasc
(5803; 52% of non-emptyGender
): anni, lavoro, programma, euro, parlamento, membri, modo, paesi, diritto, statiEMPTY
(341): presidente, onorevole, account, commissario, rappresentanti, e-mail, partecipanti, grazie, password, fine
Paradigm signore | Masc | Fem |
---|---|---|
signor | signora |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2221) occur only with one value of Gender
.
DET
8168 DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (7123; 87%), Definite=Def (6177; 76%), Number=Sing (5573; 68%).
DET
tokens may have the following values of Gender
:
Fem
(3571; 44% of non-emptyGender
): la, le, una, sua, un’, questa, sue, queste, alcuna, tutteMasc
(4597; 56% of non-emptyGender
): il, i, un, gli, suo, lo, questo, tutti, alcuni, suoiEMPTY
(1351): l’, ogni, loro, tale, tali, qualsiasi, più, tal, cui, qualche
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing | il, lo, l' | la |
Number=Plur | i, gli | le |
ADJ
2538 ADJ tokens (60% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1583; 62%).
ADJ
tokens may have the following values of Gender
:
Fem
(1137; 45% of non-emptyGender
): economica, prima, relative, altre, pericolose, stessa, nuova, nuove, altra, direttriciMasc
(1401; 55% of non-emptyGender
): altri, europeo, primo, nuovo, stesso, finanziario, altro, nuovi, necessario, relativiEMPTY
(1670): presente, sociale, importante, maggiore, possibile, grande, strutturali, intellettuale, principali, teatrali
Paradigm altro | Masc | Fem |
---|---|---|
Number=Sing | altro | altra |
Number=Plur | altri | altre |
VERB
1448 VERB tokens (31% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (1448; 100%), Person=EMPTY (1448; 100%), Tense=Past (1448; 100%), VerbForm=Part (1448; 100%), Number=Sing (961; 66%).
VERB
tokens may have the following values of Gender
:
Fem
(486; 34% of non-emptyGender
): data, presentata, concessa, pubblicate, applicate, considerata, messe, adottate, armonizzate, modificataMasc
(962; 66% of non-emptyGender
): considerato, fatto, dato, avuto, visto, svolto, detto, portato, previsto, scrittoEMPTY
(3276): ha, è, hanno, scrisse, far, fare, garantire, migliorare, creare, rappresenta
Paradigm avere | Masc | Fem |
---|---|---|
avuto | avuta |
PRON
652 PRON tokens (36% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Clitic=EMPTY (526; 81%), Number=Sing (472; 72%), Person=EMPTY (432; 66%).
PRON
tokens may have the following values of Gender
:
Fem
(168; 26% of non-emptyGender
): la, quella, le, questa, lei, una, essa, esse, molte, quelleMasc
(484; 74% of non-emptyGender
): lo, ciò, quanto, quello, altri, uno, questo, tutti, tutto, alcuniEMPTY
(1146): che, si, cui, ci, ne, mi, vi, c’, noi, quale
Paradigm quello | Masc | Fem |
---|---|---|
Number=Sing | quello, quel | quella |
Number=Plur | quelli | quelle |
AUX
157 AUX tokens (8% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (157; 100%), Person=EMPTY (157; 100%), Tense=Past (157; 100%), VerbForm=Part (157; 100%), Number=Sing (111; 71%).
AUX
tokens may have the following values of Gender
:
Fem
(46; 29% of non-emptyGender
): stata, state, andata, potutaMasc
(111; 71% of non-emptyGender
): stato, stati, potuto, dovuto, andato, volutoEMPTY
(1887): è, sono, essere, ha, era, hanno, sia, può, fu, possono
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati | state |
ADP
1 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): duEMPTY
(9012): di, a, in, per, da, su, con, come, ad, tra
PROPN
1 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): hyeEMPTY
(2033): Shakespeare, Balzac, Facebook, Europa, Ucraina, Pericle, Stati, Uniti, Europea, Unione
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (6716; 84%),
NOUN –[amod]–> ADJ (2045; 60%),
NOUN –[conj]–> NOUN (486; 55%),
NOUN –[det:poss]–> DET (470; 85%),
NOUN –[acl]–> VERB (421; 57%),
VERB –[nsubj:pass]–> NOUN (327; 96%),
NOUN –[det:predet]–> DET (83; 98%),
ADJ –[conj]–> ADJ (81; 54%),
PRON –[nmod]–> NOUN (80; 73%),
NOUN –[nsubj]–> NOUN (61; 54%).