home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-MarkIT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

17767 tokens (44%) have a non-empty value of Gender. 3855 types (64%) occur at least once with a non-empty value of Gender. 2915 lemmas (71%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (7399; 18% instances), DET (5737; 14% instances), ADJ (2454; 6% instances), PRON (1391; 3% instances), VERB (718; 2% instances), AUX (61; 0% instances), PROPN (3; 0% instances), ADP (1; 0% instances), ADV (1; 0% instances), CCONJ (1; 0% instances), SCONJ (1; 0% instances).

NOUN

7399 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (5598; 76%).

NOUN tokens may have the following values of Gender:

Paradigm grazieMascFem
graziegrazie

Gender seems to be lexical feature of NOUN. 98% lemmas (1728) occur only with one value of Gender.

DET

5737 DET tokens (87% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (4632; 81%), Number=Sing (4398; 77%), Definite=Def (3780; 66%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Definite=Def|Number=SingLa
Definite=Def|Number=Sing|PronType=Artil, lo, l'la, l', lo
Definite=Def|Number=Plur|PronType=Arti, gli, ille, la
Number=Singilla, L'
Number=Plurile

ADJ

2454 ADJ tokens (96% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1749; 71%).

ADJ tokens may have the following values of Gender:

Paradigm grandeMascFem
Degree=Abs|Number=Singgrandissima
Number=Singgrande, gran
Number=Plurgrandigrandi

PRON

1391 PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (1147; 82%), Number=Sing (1032; 74%), PronType=Prs (857; 62%), Clitic=Yes (747; 54%).

PRON tokens may have the following values of Gender:

Paradigm loMascFem
Clitic=Yes|Number=Sing|Person=3la
Clitic=Yes|Number=Sing|Person=3|PronType=Prslo, l', gli, lila, le
Clitic=Yes|Number=Plur|Person=3|PronType=Prsli, glile
Definite=Def|Number=Singlo
Definite=Def|Number=Sing|PronType=Artlola
Number=Singla

VERB

718 VERB tokens (18% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (718; 100%), Mood=EMPTY (717; 100%), VerbForm=Part (714; 99%), Tense=Past (710; 99%), Number=Sing (534; 74%).

VERB tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Singstatostata
Number=Plurstatistate

AUX

61 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (61; 100%), Person=EMPTY (61; 100%), Tense=Past (61; 100%), VerbForm=Part (61; 100%), Number=Sing (47; 77%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Singstatostata
Number=Plurstatistate

PROPN

3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

ADP

1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (1; 100%).

ADV tokens may have the following values of Gender:

CCONJ

1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4737; 85%), NOUN –[amod]–> ADJ (1537; 82%), NOUN –[det:poss]–> DET (359; 93%), NOUN –[conj]–> NOUN (312; 56%), ADJ –[conj]–> ADJ (95; 84%), NOUN –[nsubj]–> NOUN (67; 54%), ADJ –[det]–> DET (65; 86%), ADJ –[nsubj]–> NOUN (55; 68%), NOUN –[det:predet]–> DET (53; 100%), VERB –[nsubj:pass]–> NOUN (51; 94%).