home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-ParTUT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

24141 tokens (43%) have a non-empty value of Gender. 4787 types (57%) occur at least once with a non-empty value of Gender. 3451 lemmas (61%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (11176; 20% instances), DET (8168; 15% instances), ADJ (2538; 5% instances), VERB (1448; 3% instances), PRON (652; 1% instances), AUX (157; 0% instances), ADP (1; 0% instances), PROPN (1; 0% instances).

NOUN

11176 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (7231; 65%).

NOUN tokens may have the following values of Gender:

Paradigm signoreMascFem
signorsignora

Gender seems to be lexical feature of NOUN. 99% lemmas (2221) occur only with one value of Gender.

DET

8168 DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (7123; 87%), Definite=Def (6177; 76%), Number=Sing (5573; 68%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, lo, l'la
Number=Pluri, glile

ADJ

2538 ADJ tokens (60% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1583; 62%).

ADJ tokens may have the following values of Gender:

Paradigm altroMascFem
Number=Singaltroaltra
Number=Pluraltrialtre

VERB

1448 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1448; 100%), Person=EMPTY (1448; 100%), Tense=Past (1448; 100%), VerbForm=Part (1448; 100%), Number=Sing (961; 66%).

VERB tokens may have the following values of Gender:

Paradigm avereMascFem
avutoavuta

PRON

652 PRON tokens (36% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (526; 81%), Number=Sing (472; 72%), Person=EMPTY (432; 66%).

PRON tokens may have the following values of Gender:

Paradigm quelloMascFem
Number=Singquello, quelquella
Number=Plurquelliquelle

AUX

157 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (157; 100%), Person=EMPTY (157; 100%), Tense=Past (157; 100%), VerbForm=Part (157; 100%), Number=Sing (111; 71%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Singstatostata
Number=Plurstatistate

ADP

1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

PROPN

1 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (6716; 84%), NOUN –[amod]–> ADJ (2045; 60%), NOUN –[conj]–> NOUN (486; 55%), NOUN –[det:poss]–> DET (470; 85%), NOUN –[acl]–> VERB (421; 57%), VERB –[nsubj:pass]–> NOUN (327; 96%), NOUN –[det:predet]–> DET (83; 98%), ADJ –[conj]–> ADJ (81; 54%), PRON –[nmod]–> NOUN (80; 73%), NOUN –[nsubj]–> NOUN (61; 54%).