home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-KIParlaForest: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

6263 tokens (34%) have a non-empty value of Gender. 1546 types (53%) occur at least once with a non-empty value of Gender. 1271 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 13 part-of-speech tags: NOUN (2495; 13% instances), DET (1812; 10% instances), ADJ (665; 4% instances), PRON (650; 3% instances), VERB (389; 2% instances), PROPN (111; 1% instances), AUX (35; 0% instances), INTJ (34; 0% instances), NUM (34; 0% instances), ADV (33; 0% instances), ADP (2; 0% instances), CCONJ (2; 0% instances), X (1; 0% instances).

NOUN

2495 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1699; 68%).

NOUN tokens may have the following values of Gender:

Paradigm linguaMascFem
Number=Singlinguelingua
Number=Plurlingue

Gender seems to be lexical feature of NOUN. 97% lemmas (755) occur only with one value of Gender.

DET

1812 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (1555; 86%), Number=Sing (1335; 74%), Definite=Def (1167; 64%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Definite=Def|Number=Sing|PronType=Artil, lo, lla, le
Definite=Def|Number=Plur|PronType=Arti, gli, ille, lo
Number=Sing|Person=3|PronType=Prslo, l'la
Number=Sing|PronType=Artla
Number=Plur|PronType=Arti

ADJ

665 ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (511; 77%).

ADJ tokens may have the following values of Gender:

Paradigm araboMascFem
Number=Singaraboaraba
Number=Plurarabi

PRON

650 PRON tokens (35% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (510; 78%), Person=EMPTY (383; 59%).

PRON tokens may have the following values of Gender:

Paradigm loMascFem
Definite=Def|Number=Sing|PronType=Artlo
Definite=Def|Number=Plur|PronType=Prsl'
Number=Sing|Person=3|PronType=Prslo, l', qual

VERB

389 VERB tokens (16% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (389; 100%), Person=EMPTY (389; 100%), Number=Sing (336; 86%), Tense=Past (334; 86%), VerbForm=Part (334; 86%).

VERB tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Singstatostata
Number=Plurstati

PROPN

111 PROPN tokens (26% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (81; 73%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (56) occur only with one value of Gender.

AUX

35 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (35; 100%), Person=EMPTY (35; 100%), Tense=Past (27; 77%), VerbForm=Part (27; 77%), Number=Sing (26; 74%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
_sonson
Number=Singero
Number=Sing|Tense=Past|VerbForm=Partstatostata
Number=Pluresser
Number=Plur|Tense=Past|VerbForm=Partstatistate

INTJ

34 INTJ tokens (4% of all INTJ tokens) have a non-empty value of Gender.

INTJ tokens may have the following values of Gender:

NUM

34 NUM tokens (20% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (25; 74%), NumType=Ord (19; 56%).

NUM tokens may have the following values of Gender:

Paradigm primoMascFem
_primi
Number=Sing|NumType=Ordprimoprima
Number=Plur|NumType=Ordprimi

ADV

33 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (29; 88%).

ADV tokens may have the following values of Gender:

Paradigm MascFem

ADP

2 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

CCONJ

2 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

X

1 X tokens (0% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1388; 81%), NOUN –[amod]–> ADJ (341; 69%), NOUN –[conj]–> NOUN (46; 56%), PROPN –[det]–> DET (34; 51%), ADJ –[det]–> DET (24; 59%), NOUN –[det:poss]–> DET (23; 66%), ADJ –[nsubj]–> NOUN (22; 76%), DET –[reparandum]–> DET (16; 57%), NOUN –[parataxis]–> NOUN (15; 63%), INTJ –[discourse]–> INTJ (13; 100%).