home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

36401 tokens (29%) have a non-empty value of Gender. 7007 types (40%) occur at least once with a non-empty value of Gender. 4486 lemmas (33%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (16933; 14% instances), DET (12308; 10% instances), ADJ (3540; 3% instances), PRON (1931; 2% instances), VERB (1596; 1% instances), AUX (92; 0% instances), PROPN (1; 0% instances).

NOUN

16933 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (11723; 69%).

NOUN tokens may have the following values of Gender:

Paradigm partitoMascFem
Number=Singpartito, partitinopartita
Number=Plurpartiti

Gender seems to be lexical feature of NOUN. 98% lemmas (3184) occur only with one value of Gender.

DET

12308 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (11463; 93%), Definite=Def (9904; 80%), Number=Sing (9594; 78%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, lola
Number=Pluri, gli, ille, e

ADJ

3540 ADJ tokens (71% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2690; 76%).

ADJ tokens may have the following values of Gender:

Paradigm nuovoMascFem
Number=Singnuovonuova
Number=Plurnuovinuove

PRON

1931 PRON tokens (30% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1380; 71%), Clitic=EMPTY (1180; 61%), Person=EMPTY (1170; 61%).

PRON tokens may have the following values of Gender:

Paradigm tuttoMascFem
Number=Singtuttotutta
Number=Plurtuttitutte

VERB

1596 VERB tokens (14% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1596; 100%), VerbForm=Part (1596; 100%), Person=EMPTY (1595; 100%), Tense=Past (1595; 100%), Number=Sing (1472; 92%).

VERB tokens may have the following values of Gender:

Paradigm fareMascFem
Number=Singfattofatta
Number=Plurfatte

AUX

92 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (92; 100%), Person=EMPTY (92; 100%), Tense=Past (92; 100%), VerbForm=Part (92; 100%), Number=Sing (79; 86%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Singstatostata, state, ststa
Number=Plurstatistate

PROPN

1 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (9108; 84%), NOUN –[amod]–> ADJ (2028; 70%), NOUN –[det:poss]–> DET (484; 86%), NOUN –[conj]–> NOUN (414; 53%), NOUN –[parataxis]–> NOUN (213; 52%), ADJ –[nsubj]–> NOUN (138; 58%), NOUN –[nsubj]–> NOUN (130; 50%), ADJ –[conj]–> ADJ (105; 52%), PRON –[det]–> DET (95; 65%), NOUN –[compound]–> NOUN (86; 53%).