Treebank Statistics: UD_Italian-VIT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
116420 tokens (42%) have a non-empty value of Gender
.
12733 types (54%) occur at least once with a non-empty value of Gender
.
8515 lemmas (54%) occur at least once with a non-empty value of Gender
.
The feature is used with 14 part-of-speech tags: NOUN (54622; 19% instances), DET (37733; 13% instances), ADJ (12473; 4% instances), VERB (7833; 3% instances), PRON (2754; 1% instances), AUX (668; 0% instances), ADV (156; 0% instances), NUM (98; 0% instances), X (59; 0% instances), ADP (8; 0% instances), SCONJ (8; 0% instances), CCONJ (6; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances).
NOUN
54622 NOUN tokens (95% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (34459; 63%).
NOUN
tokens may have the following values of Gender
:
Fem
(24061; 44% of non-emptyGender
): società, attività, parte, legge, titolarità, provincia, città, sede, domanda, gestioneMasc
(30561; 56% of non-emptyGender
): anni, miliardi, anno, posti, presidente, punto, governo, stato, gruppo, lavoroEMPTY
(3125): n., art., insegnanti, dpr, a, mila, b, docenti, dl, via
Paradigm fine | Masc | Fem |
---|---|---|
Number=Sing | fine | fine, fin |
Number=Plur | fini |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (5507) occur only with one value of Gender
.
DET
37733 DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (34990; 93%), Definite=Def (30875; 82%), Number=Sing (25954; 69%).
DET
tokens may have the following values of Gender
:
Fem
(15862; 42% of non-emptyGender
): la, le, una, un’, sua, questa, tutte, queste, sue, quellaMasc
(21871; 58% of non-emptyGender
): il, i, un, gli, lo, questo, suo, tutti, questi, unoEMPTY
(6179): l’, loro, ogni, tale, il, qualche, tali, che, quest’, cui
Paradigm il | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing|PronType=Art | il, lo, gli, i, l' | la |
Definite=Def|Number=Plur|PronType=Art | i, gli, il | le, il, i |
Number=Sing | lo | |
Number=Plur | i | le |
ADJ
12473 ADJ tokens (62% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (7816; 63%).
ADJ
tokens may have the following values of Gender
:
Fem
(5639; 45% of non-emptyGender
): altre, nuova, italiana, altra, nuove, politica, stessa, pubblica, economica, politicheMasc
(6834; 55% of non-emptyGender
): altri, nuovo, economico, stesso, nuovi, scorso, altro, finanziario, ultimo, italianoEMPTY
(7650): precedente, primo, grande, presente, ex, netto, generale, grandi, nazionale, sociale
Paradigm altro | Masc | Fem |
---|---|---|
Number=Sing | altro | altra |
Number=Sing|PronType=Dem | altro | |
Number=Sing|PronType=Ind | altro | altra |
Number=Plur | altri | altre |
VERB
7833 VERB tokens (37% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (7833; 100%), Person=EMPTY (7833; 100%), Tense=Past (7772; 99%), VerbForm=Part (7772; 99%), Number=Sing (5633; 72%).
VERB
tokens may have the following values of Gender
:
Fem
(2290; 29% of non-emptyGender
): prevista, indicate, presentata, comprese, effettuata, fatta, richiesta, data, previste, richiesteMasc
(5543; 71% of non-emptyGender
): fatto, detto, approvato, previsto, avuto, previsti, deciso, ottenuto, visto, datoEMPTY
(13558): è, ha, fare, fa, far, hanno, dice, sono, avere, scade
Paradigm fare | Masc | Fem |
---|---|---|
Number=Sing | fatto, salvo | fatta |
Number=Plur | fatti | fatte |
PRON
2754 PRON tokens (28% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Clitic=EMPTY (2154; 78%), Number=Sing (1987; 72%), Person=EMPTY (1926; 70%).
PRON
tokens may have the following values of Gender
:
Fem
(717; 26% of non-emptyGender
): quella, la, quelle, le, una, essa, questa, queste, altra, esseMasc
(2037; 74% of non-emptyGender
): lo, quello, quale, quelli, questo, tutti, gli, li, lui, quantoEMPTY
(7049): che, si, cui, ci, c’, ne, mi, dove, chi, quali
Paradigm quello | Masc | Fem |
---|---|---|
Number=Sing | quello, quel | quella |
Number=Plur | quelli | quelle |
AUX
668 AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (668; 100%), Person=EMPTY (668; 100%), Tense=Past (667; 100%), VerbForm=Part (667; 100%), Number=Sing (504; 75%).
AUX
tokens may have the following values of Gender
:
Fem
(203; 30% of non-emptyGender
): stata, state, dovuta, fatta, volutaMasc
(465; 70% of non-emptyGender
): stato, stati, potuto, dovuto, voluto, fatto, dovuti, essere, volutiEMPTY
(8752): è, ha, sono, essere, hanno, era, sarà, deve, può, sia
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing|PronType=Rel|Tense=Past|VerbForm=Part | stata | |
Number=Sing|Tense=Past|VerbForm=Part | stato | stata |
Number=Plur | essere | |
Number=Plur|Tense=Past|VerbForm=Part | stati | state |
ADV
156 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: PronType=EMPTY (154; 99%).
ADV
tokens may have the following values of Gender
:
Fem
(49; 31% of non-emptyGender
): estremamente, inizialmente, costantemente, normalmente, celermente, contrariamente, lungamente, solamente, una, MolteMasc
(107; 69% of non-emptyGender
): volta, molto, poco, fa, lungo, troppo, no, seguito, casual, dietroEMPTY
(10619): non, più, anche, solo, così, già, ancora, ieri, poi, sempre
Paradigm molto | Masc | Fem |
---|---|---|
Number=Sing | molto | |
Number=Plur | Molte |
Gender
seems to be lexical feature of ADV
. 97% lemmas (71) occur only with one value of Gender
.
NUM
98 NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (98; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(35; 36% of non-emptyGender
): un’, terza, una, mezzaMasc
(63; 64% of non-emptyGender
): miliardi, milioni, un, primi, terzi, bis, rientro, unoEMPTY
(6296): due, tre, cento, 15, 1, 5, 1973, 2, 20, 30
Paradigm uno | Masc | Fem |
---|---|---|
un, uno | un', una |
X
59 X tokens (15% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=Yes (59; 100%).
X
tokens may have the following values of Gender
:
Fem
(9; 15% of non-emptyGender
): area, city, deregulation, force, legis, mountain, ope, private, taskMasc
(50; 85% of non-emptyGender
): personal, station, work, computer, bond, open, space, business, condicio, dragonEMPTY
(343): joint, venture, baby, cd, sitter, rom, condicio, est, facile, news
Gender
seems to be lexical feature of X
. 100% lemmas (27) occur only with one value of Gender
.
ADP
8 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(8; 100% of non-emptyGender
): dietro, mezzo, per, ne, niente, rispetto, viciniEMPTY
(45582): di, a, in, per, da, con, su, tra, ad, come
SCONJ
8 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Gender
.
SCONJ
tokens may have the following values of Gender
:
Masc
(8; 100% of non-emptyGender
): quanto, addebitati, caso, nelEMPTY
(2232): che, se, perché, quando, mentre, come, qualora, quanto, poiché, affinché
CCONJ
6 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Gender
.
CCONJ
tokens may have the following values of Gender
:
Fem
(1; 17% of non-emptyGender
): essaMasc
(5; 83% of non-emptyGender
): quanti, altro, caso, quantoEMPTY
(8259): e, ma, o, ed, come, sia, che, cioè, ovvero, nonché
INTJ
1 INTJ tokens (1% of all INTJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which INTJ
and Gender
co-occurred: Polarity=EMPTY (1; 100%).
INTJ
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): okEMPTY
(79): no, sì, basta, Bè, Ecco, addio, avanti, macché, oh, Beh
PUNCT
1 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Gender
.
PUNCT
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): leEMPTY
(31584): ,, ., “, ), -, (, :, ?, ;, «/em>
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (30335; 83%),
NOUN –[amod]–> ADJ (9788; 61%),
NOUN –[conj]–> NOUN (2595; 58%),
NOUN –[advcl]–> VERB (1307; 76%),
VERB –[nsubj:pass]–> NOUN (1075; 93%),
NOUN –[det:poss]–> DET (961; 77%),
VERB –[conj]–> VERB (339; 51%),
NOUN –[det:predet]–> DET (331; 95%),
NOUN –[nsubj]–> NOUN (162; 53%),
ADJ –[amod]–> ADJ (137; 58%).