Treebank Statistics: UD_Sicilian-STB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
3634 tokens (32%) have a non-empty value of Gender.
1067 types (47%) occur at least once with a non-empty value of Gender.
880 lemmas (58%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (1462; 13% instances), DET (1226; 11% instances), PRON (496; 4% instances), ADJ (326; 3% instances), VERB (116; 1% instances), AUX (8; 0% instances).
NOUN
1462 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1009; 69%).
NOUN tokens may have the following values of Gender:
Fem(714; 49% of non-emptyGender): vota, cosa, manu, Maistà, acqua, màchina, cosi, testa, facci, uraMasc(748; 51% of non-emptyGender): Re, mari, pisci, funnu, omu, anni, occhi, jornu, tempu, puntuEMPTY(43): missinisi, vurpi, Calabrisi, Vassìa, Zuccarata, Cucinu, Cumpari, Don, Patri, Riggitani
| Paradigm manu | Masc | Fem |
|---|---|---|
| _ | mani, manu | |
| Number=Sing | manu | manu, mani |
| Number=Plur | manu, mani |
Gender seems to be lexical feature of NOUN. 98% lemmas (587) occur only with one value of Gender.
DET
1226 DET tokens (95% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (1006; 82%), Number=Sing (974; 79%), Definite=Def (752; 61%).
DET tokens may have the following values of Gender:
Fem(556; 45% of non-emptyGender): la, a, na, li, ‘na, sta, i, l’, so, ḍḍaMasc(670; 55% of non-emptyGender): lu, un, u, li, stu, i, l’, n’, tutti, soEMPTY(67): l’, quarchi, ogni, chi, so, st’, A, Quantu, l’, mè
| Paradigm lu | Masc | Fem |
|---|---|---|
| Number=Sing | lu, u, 'u, l', O, la | la, a, l', 'a, u |
| Number=Plur | li, i, l', la, 'i, lu | li, i, l', la |
PRON
496 PRON tokens (41% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (449; 91%), PronType=Prs (346; 70%), Person=3 (340; 69%).
PRON tokens may have the following values of Gender:
Fem(122; 25% of non-emptyGender): la, ci, una, l’, a, àutra, cci, li, idda, ‘aMasc(374; 75% of non-emptyGender): cci, lu, ci, iḍḍu, l’, chistu, u, nenti, tutti, idduEMPTY(702): si, ca, mi, cci, ti, chi, cc’, s’, nni, tu
| Paradigm ci | Masc | Fem |
|---|---|---|
| Clitic=Yes|Number=Sing | ci, cci | |
| Number=Sing | cci, ci | ci, cci |
| Number=Plur | cci | ci |
ADJ
326 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (284; 87%).
ADJ tokens may have the following values of Gender:
Fem(158; 48% of non-emptyGender): sana, autra, beḍḍa, sula, calla, fabbricata, janca, prima, rutta, vecchiaMasc(168; 52% of non-emptyGender): menzu, marinu, veru, funnutu, cuntenti, mezzu, mortu, autru, bonu, sanuEMPTY(61): gran, granni, megghiu, duci, nurmali, ‘ranni, fina, forti, grossi, riali
| Paradigm menzu | Masc | Fem |
|---|---|---|
| menzu | menz' |
VERB
116 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (116; 100%), Person=EMPTY (116; 100%), Tense=Past (116; 100%), VerbForm=Part (116; 100%), Number=Sing (99; 85%).
VERB tokens may have the following values of Gender:
Fem(26; 22% of non-emptyGender): ‘mmarsamati, ‘ncatinati, Finuta, Misa, accattatu, accucchiatu, arrivutata, assittata, astutata, cadutaMasc(90; 78% of non-emptyGender): vistu, mortu, fattu, ntisu, dittu, statu, ‘ntisu, abbannunatu, chiamatu, murtuEMPTY(1479): dissi, fari, diri, era, è, fici, vitti, dici, sapiri, jiri
| Paradigm vidiri | Masc | Fem |
|---|---|---|
| vistu | vistu |
AUX
8 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (8; 100%), Number=Sing (8; 100%), Person=EMPTY (8; 100%), Tense=Past (8; 100%), VerbForm=Part (8; 100%).
AUX tokens may have the following values of Gender:
Fem(1; 13% of non-emptyGender): statuMasc(7; 88% of non-emptyGender): pututu, statuEMPTY(369): era, è, avìa, sugnu, avia, stava, putìa, èramu, èranu, avìssiru
| Paradigm essiri | Masc | Fem |
|---|---|---|
| statu | statu |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (993; 91%),
NOUN –[amod]–> ADJ (168; 78%),
NOUN –[det:poss]–> DET (47; 89%),
NOUN –[conj]–> NOUN (34; 65%),
PRON –[det]–> DET (19; 73%),
ADJ –[conj]–> ADJ (16; 70%),
NOUN –[det:predet]–> DET (16; 80%),
PRON –[nmod]–> NOUN (11; 55%),
ADJ –[nsubj]–> NOUN (9; 69%),
ADJ –[obl]–> PRON (8; 89%).