home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cappadocian-AMGiC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

267 tokens (33%) have a non-empty value of Gender. 153 types (38%) occur at least once with a non-empty value of Gender. 131 lemmas (40%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (128; 16% instances), DET (68; 8% instances), PRON (50; 6% instances), ADJ (17; 2% instances), NUM (2; 0% instances), PROPN (1; 0% instances), VERB (1; 0% instances).

NOUN

128 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (111; 87%), Case=Acc (69; 54%).

NOUN tokens may have the following values of Gender:

Paradigm paráMascNeut
Number=Singparápará
Number=Plurpará

Gender seems to be lexical feature of NOUN. 99% lemmas (93) occur only with one value of Gender.

DET

68 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (66; 97%), Number=Sing (57; 84%), Definite=Def (52; 76%), Case=Acc (45; 66%).

DET tokens may have the following values of Gender:

Paradigm (ο)MascFemNeut
Number=Singtuči, čintu
Number=Plurtusta, čin

PRON

50 PRON tokens (53% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (48; 96%), Person=3 (48; 96%), PronType=Prs (40; 80%), Poss=EMPTY (35; 70%), Clitic=Yes (28; 56%).

PRON tokens may have the following values of Gender:

Paradigm (e)γóMascFemNeut
Case=Acc|Clitic=Yes|Number=Singdoči, ǰida, ta, to, do, δa
Case=Acc|Clitic=Yes|Number=Plurtus
Case=Acc|Number=Singzinta
Case=Gen|Clitic=Yes|Number=Sing|Poss=Yesdu, tučis, ǰis
Case=Gen|Number=Sing|Poss=Yeszis, čis, ǰis
Case=Nom|Clitic=Yes|Number=Singči

ADJ

17 ADJ tokens (77% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10; 59%).

ADJ tokens may have the following values of Gender:

Gender seems to be lexical feature of ADJ. 100% lemmas (16) occur only with one value of Gender.

NUM

2 NUM tokens (29% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Case=Acc (2; 100%), NumType=Card (2; 100%), Number=Plur (2; 100%).

NUM tokens may have the following values of Gender:

PROPN

1 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=Voc (1; 100%), Number=Sing (1; 100%).

PROPN tokens may have the following values of Gender:

VERB

1 VERB tokens (1% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Aspect=Perf (1; 100%), Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=EMPTY (1; 100%), VerbForm=Part (1; 100%), Voice=Pass (1; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (59; 88%), NOUN –[amod]–> ADJ (10; 83%), NOUN –[conj]–> NOUN (3; 60%), ADJ –[conj]–> ADJ (1; 100%), ADJ –[det]–> DET (1; 100%), NOUN –[acl]–> VERB (1; 100%), NOUN –[appos]–> NOUN (1; 100%), NOUN –[nsubj]–> NOUN (1; 100%), PROPN –[amod]–> ADJ (1; 100%).