Treebank Statistics: UD_Old_Occitan-CorAG: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
3770 tokens (7%) have a non-empty value of Gender.
1123 types (16%) occur at least once with a non-empty value of Gender.
1 lemmas (0) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: VERB (1733; 3% instances), PRON (1692; 3% instances), ADJ (303; 1% instances), AUX (40; 0% instances), DET (2; 0% instances).
VERB
1733 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (1728; 100%), Person=EMPTY (1721; 99%), Tense=Past (1543; 89%), Number=Sing (1350; 78%), Voice=Pass (1043; 60%).
VERB tokens may have the following values of Gender:
Fem(404; 23% of non-emptyGender): feyte, feytes, feyta, dade, Conegude, passade, recebudes, audides, conoguda, bailhadeMasc(1326; 77% of non-emptyGender): feyt, judyat, establit, tengut, dat, passat, deyt, diit, feit, acostumatNeut(3; 0% of non-emptyGender): Notum, ActumEMPTY(4675): deu, far, es, pot, ha, dar, aver, judya, fe, fasse
PRON
1692 PRON tokens (43% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1166; 69%).
PRON tokens may have the following values of Gender:
Fem(196; 12% of non-emptyGender): la, las, aqueres, aquere, aqueras, ere, questas, -queres, autras, la-Masc(1365; 81% of non-emptyGender): lo, los, -los, luy, hom, lor, asso, ac, eg, aquegNeut(131; 8% of non-emptyGender): so, ag, ac, ço, -ço, o, -quero, -quet, acquetEMPTY(2221): qui, que, se, nos, y, en, autre, s’, l’, ne
ADJ
303 ADJ tokens (18% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=Part (303; 100%), Tense=Past (279; 92%), Voice=Pass (244; 81%), Number=Sing (173; 57%).
ADJ tokens may have the following values of Gender:
Fem(132; 44% of non-emptyGender): sobredeytas, sobredeyta, degude, degudas, susdites, irade, sabude, soberdiites, soberdites, connegudeMasc(171; 56% of non-emptyGender): sobredeyt, sobredeytz, sobredeyts, avantditz, quitis, dits, amat, amatz, deputat, ditzEMPTY(1377): mayor, autres, autre, medix, medeys, present, son, nostre, medixe, presens
AUX
40 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (40; 100%), VerbForm=Part (40; 100%), Person=EMPTY (39; 98%), Tense=Past (37; 93%), Number=Sing (36; 90%).
AUX tokens may have the following values of Gender:
Fem(7; 18% of non-emptyGender): estade, estada, estadas, estadesMasc(33; 83% of non-emptyGender): estat, estant, estad, estats, estatzEMPTY(1406): es, sera, fo, sie, son, aura, seran, esser, ere, a
DET
2 DET tokens (0% of all DET tokens) have a non-empty value of Gender.
DET tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): auguneMasc(1; 50% of non-emptyGender): augunEMPTY(6333): la, lo, -lo, las, l’, -los, los, son, li, un
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
VERB –[conj]–> VERB (174; 51%),
PRON –[nmod]–> PRON (12; 55%),
PRON –[acl]–> VERB (8; 62%),
PRON –[nmod]–> ADJ (2; 100%),
PRON –[conj]–> VERB (1; 100%),
PRON –[mark]–> VERB (1; 100%),
VERB –[parataxis]–> PRON (1; 100%).