home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sicilian-STB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

3634 tokens (32%) have a non-empty value of Gender. 1067 types (47%) occur at least once with a non-empty value of Gender. 880 lemmas (58%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (1462; 13% instances), DET (1226; 11% instances), PRON (496; 4% instances), ADJ (326; 3% instances), VERB (116; 1% instances), AUX (8; 0% instances).

NOUN

1462 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1009; 69%).

NOUN tokens may have the following values of Gender:

Paradigm manuMascFem
_mani, manu
Number=Singmanumanu, mani
Number=Plurmanu, mani

Gender seems to be lexical feature of NOUN. 98% lemmas (587) occur only with one value of Gender.

DET

1226 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (1006; 82%), Number=Sing (974; 79%), Definite=Def (752; 61%).

DET tokens may have the following values of Gender:

Paradigm luMascFem
Number=Singlu, u, 'u, l', O, lala, a, l', 'a, u
Number=Plurli, i, l', la, 'i, luli, i, l', la

PRON

496 PRON tokens (41% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (449; 91%), PronType=Prs (346; 70%), Person=3 (340; 69%).

PRON tokens may have the following values of Gender:

Paradigm ciMascFem
Clitic=Yes|Number=Singci, cci
Number=Singcci, cici, cci
Number=Plurccici

ADJ

326 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (284; 87%).

ADJ tokens may have the following values of Gender:

Paradigm menzuMascFem
menzumenz'

VERB

116 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (116; 100%), Person=EMPTY (116; 100%), Tense=Past (116; 100%), VerbForm=Part (116; 100%), Number=Sing (99; 85%).

VERB tokens may have the following values of Gender:

Paradigm vidiriMascFem
vistuvistu

AUX

8 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (8; 100%), Number=Sing (8; 100%), Person=EMPTY (8; 100%), Tense=Past (8; 100%), VerbForm=Part (8; 100%).

AUX tokens may have the following values of Gender:

Paradigm essiriMascFem
statustatu

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (993; 91%), NOUN –[amod]–> ADJ (168; 78%), NOUN –[det:poss]–> DET (47; 89%), NOUN –[conj]–> NOUN (34; 65%), PRON –[det]–> DET (19; 73%), ADJ –[conj]–> ADJ (16; 70%), NOUN –[det:predet]–> DET (16; 80%), PRON –[nmod]–> NOUN (11; 55%), ADJ –[nsubj]–> NOUN (9; 69%), ADJ –[obl]–> PRON (8; 89%).