home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-UFAL: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

881 tokens (48%) have a non-empty value of Gender. 647 types (61%) occur at least once with a non-empty value of Gender. 500 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (436; 24% instances), PRON (120; 7% instances), VERB (98; 5% instances), ADJ (77; 4% instances), PROPN (72; 4% instances), DET (64; 3% instances), NUM (10; 1% instances), ADV (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances).

NOUN

436 NOUN tokens (78% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Compound=EMPTY (436; 100%), Number=Sing (368; 84%).

NOUN tokens may have the following values of Gender:

Paradigm अर्थMascNeut
Case=Acc|Number=Singअर्थम्अर्थम्
Case=Gen|Number=Singअर्थस्य
Case=Ins|Number=Singअर्थेन
Case=Nom|Number=Singअर्थः
Case=Nom|Number=Plurअर्थाः

Gender seems to be lexical feature of NOUN. 94% lemmas (283) occur only with one value of Gender.

PRON

120 PRON tokens (67% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (105; 88%), Person=EMPTY (101; 84%), PronType=Dem (78; 65%).

PRON tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Sing|PronType=Demतस्मात्तस्याःतस्मात्
Case=Acc|Number=Sing|PronType=Demतम्तत्
Case=Acc|Number=Plur|PronType=Demतान्तानि
Case=Dat|Number=Sing|PronType=Demतस्मै
Case=Gen|Number=Sing|Person=3|PronType=Demतस्यतस्य
Case=Gen|Number=Sing|Poss=Yes|PronType=Demतस्य
Case=Gen|Number=Sing|PronType=Demतस्य
Case=Gen|Number=Dual|PronType=Demतयोर्
Case=Gen|Number=Plur|PronType=Demतेषाम्
Case=Ins|Number=Sing|PronType=Demतेनतया
Case=Nom|Number=Sing|Person=3|PronType=Demतत्
Case=Nom|Number=Sing|PronType=Demस, सः, सह्सातत्, तद्
Case=Nom|Number=Dualतौ
Case=Nom|Number=Dual|Person=3|PronType=Demतौ
Case=Nom|Number=Plur|PronType=Demते

VERB

98 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (98; 100%), VerbForm=Part (98; 100%), Mood=EMPTY (97; 99%), Voice=Pass (88; 90%), Case=Nom (85; 87%), Number=Sing (82; 84%), Tense=Past (67; 68%).

VERB tokens may have the following values of Gender:

Paradigm कृMascNeut
Number=Sing|Tense=Futकर्तव्यःकर्तव्यम्, कार्यम्
Number=Sing|Tense=Pastकृतःकृतम्
Number=Plur|Tense=Futकर्तव्याः

ADJ

77 ADJ tokens (68% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Compound=EMPTY (77; 100%), Number=Sing (65; 84%), Case=Nom (41; 53%).

ADJ tokens may have the following values of Gender:

Paradigm महत्MascNeut
महान्महत्

Gender seems to be lexical feature of ADJ. 97% lemmas (66) occur only with one value of Gender.

PROPN

72 PROPN tokens (77% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Compound=EMPTY (72; 100%), Number=Sing (62; 86%), Case=Nom (48; 67%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (57) occur only with one value of Gender.

DET

64 DET tokens (91% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (57; 89%), PronType=Dem (45; 70%).

DET tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Singतस्मात्
Case=Acc|Number=Sing|Person=3तत्
Case=Acc|Number=Singतम्तांतत्
Case=Acc|Number=Plurतान्
Case=Gen|Number=Sing|Person=3तस्य
Case=Gen|Number=Singतस्य
Case=Ins|Number=Sing|Person=3तया
Case=Ins|Number=Singतेनतयातेन
Case=Loc|Number=Singतस्याम्
Case=Nom|Number=Sing|Person=3तत्
Case=Nom|Number=Singस, सःतत्
Case=Nom|Number=Plurते

NUM

10 NUM tokens (56% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Compound=EMPTY (10; 100%), NumType=Card (10; 100%), Number=Sing (7; 70%), Case=Nom (6; 60%).

NUM tokens may have the following values of Gender:

Paradigm त्रिMascFem
त्रयःतिस्रः

ADV

2 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

AUX

1 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Pres (1; 100%), VerbForm=Part (1; 100%), Voice=Act (1; 100%).

AUX tokens may have the following values of Gender:

PART

1 PART tokens (3% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (1; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (42; 84%), NOUN –[amod]–> ADJ (32; 54%), PROPN –[conj]–> PROPN (13; 65%), NOUN –[acl]–> VERB (12; 75%), NOUN –[nsubj]–> NOUN (7; 70%), ADJ –[conj]–> ADJ (5; 71%), PRON –[nsubj:cop]–> NOUN (5; 83%), VERB –[conj]–> VERB (5; 71%), PRON –[acl]–> PRON (4; 100%), PROPN –[conj]–> NOUN (4; 80%).