home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-VIT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

116481 tokens (42%) have a non-empty value of Gender. 12734 types (54%) occur at least once with a non-empty value of Gender. 8506 lemmas (54%) occur at least once with a non-empty value of Gender. The feature is used with 14 part-of-speech tags: NOUN (54690; 20% instances), DET (37731; 13% instances), ADJ (12482; 4% instances), VERB (7838; 3% instances), PRON (2804; 1% instances), AUX (668; 0% instances), ADV (149; 0% instances), NUM (98; 0% instances), X (8; 0% instances), ADP (6; 0% instances), CCONJ (4; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances).

NOUN

54690 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (34527; 63%).

NOUN tokens may have the following values of Gender:

Paradigm fineMascFem
Number=Singfinefine, fin
Number=Plurfini

Gender seems to be lexical feature of NOUN. 99% lemmas (5508) occur only with one value of Gender.

DET

37731 DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (34990; 93%), Definite=Def (30875; 82%), Number=Sing (25954; 69%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Definite=Def|Number=Sing|PronType=Artil, lo, gli, i, l'la
Definite=Def|Number=Plur|PronType=Arti, gli, ille, il, i
Number=Singil, lo
Number=Plurile

ADJ

12482 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (7828; 63%).

ADJ tokens may have the following values of Gender:

Paradigm altroMascFem
Number=Singaltroaltra
Number=Sing|PronType=Demaltro
Number=Sing|PronType=Indaltroaltra
Number=Pluraltrialtre

VERB

7838 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7838; 100%), Person=EMPTY (7838; 100%), Tense=Past (7777; 99%), VerbForm=Part (7777; 99%), Number=Sing (5635; 72%).

VERB tokens may have the following values of Gender:

Paradigm fareMascFem
Number=Singfatto, salvofatta
Number=Plurfattifatte

PRON

2804 PRON tokens (29% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (2204; 79%), Number=Sing (2034; 73%), Person=EMPTY (1976; 70%).

PRON tokens may have the following values of Gender:

Paradigm quelloMascFem
Number=Singquello, quelquella
Number=Plurquelliquelle

AUX

668 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (668; 100%), Person=EMPTY (668; 100%), Tense=Past (667; 100%), VerbForm=Part (667; 100%), Number=Sing (504; 75%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Sing|PronType=Rel|Tense=Past|VerbForm=Partstata
Number=Sing|Tense=Past|VerbForm=Partstatostata
Number=Pluressere
Number=Plur|Tense=Past|VerbForm=Partstatistate

ADV

149 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (147; 99%).

ADV tokens may have the following values of Gender:

Paradigm moltoMascFem
Number=Singmolto
Number=PlurMolte

Gender seems to be lexical feature of ADV. 97% lemmas (67) occur only with one value of Gender.

NUM

98 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (98; 100%).

NUM tokens may have the following values of Gender:

Paradigm unoMascFem
un, unoun', una

X

8 X tokens (2% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (8; 100%).

X tokens may have the following values of Gender:

ADP

6 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

CCONJ

4 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

INTJ

1 INTJ tokens (1% of all INTJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which INTJ and Gender co-occurred: Polarity=EMPTY (1; 100%).

INTJ tokens may have the following values of Gender:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (30364; 83%), NOUN –[amod]–> ADJ (9793; 61%), NOUN –[conj]–> NOUN (2596; 58%), NOUN –[advcl]–> VERB (1297; 76%), VERB –[nsubj:pass]–> NOUN (1075; 93%), NOUN –[det:poss]–> DET (960; 77%), VERB –[conj]–> VERB (338; 51%), NOUN –[det:predet]–> DET (333; 95%), NOUN –[nsubj]–> NOUN (162; 53%), ADJ –[amod]–> ADJ (137; 58%).