home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

922 tokens (54%) have a non-empty value of Gender. 689 types (70%) occur at least once with a non-empty value of Gender. 550 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (478; 28% instances), PRON (123; 7% instances), PROPN (87; 5% instances), ADJ (85; 5% instances), VERB (73; 4% instances), DET (64; 4% instances), NUM (10; 1% instances), AUX (1; 0% instances), PART (1; 0% instances).

NOUN

478 NOUN tokens (93% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (375; 78%).

NOUN tokens may have the following values of Gender:

Paradigm अर्थMascNeut
Case=Acc|Number=Singअर्थम्अर्थम्
Case=Gen|Number=Singअर्थस्य
Case=Ins|Number=Singअर्थेन
Case=Nom|Number=Singअर्थःअर्थम्, अर्थ
Case=Nom|Number=Plurअर्थाः
Number=Singअर्थ

Gender seems to be lexical feature of NOUN. 96% lemmas (331) occur only with one value of Gender.

PRON

123 PRON tokens (69% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (108; 88%), Person=EMPTY (102; 83%), PronType=Dem (80; 65%), Case=Nom (63; 51%).

PRON tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Sing|PronType=Demतस्मात्तस्याःतस्मात्
Case=Acc|Number=Sing|PronType=Demतम्तत्
Case=Acc|Number=Plur|PronType=Demतान्तानि
Case=Dat|Number=Sing|PronType=Demतस्मै
Case=Gen|Number=Sing|Person=3|PronType=Demतस्यतस्य
Case=Gen|Number=Sing|Poss=Yes|PronType=Demतस्य
Case=Gen|Number=Sing|PronType=Demतस्य
Case=Gen|Number=Dual|PronType=Demतयोर्
Case=Gen|Number=Plur|PronType=Demतेषाम्
Case=Ins|Number=Sing|PronType=Demतेनतया
Case=Nom|Number=Sing|Person=3|PronType=Demतत्
Case=Nom|Number=Sing|PronType=Demस, सः, सह्सातत्, तद्
Case=Nom|Number=Dualतौ
Case=Nom|Number=Dual|Person=3|PronType=Demतौ
Case=Nom|Number=Plur|PronType=Demते

PROPN

87 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (75; 86%), Case=Nom (60; 69%).

PROPN tokens may have the following values of Gender:

Paradigm अर्थशास्त्रMascNeut
Number=Singअर्थशास्त्रअर्थशास्त्रं
Number=Plurअर्थशास्त्राणि

Gender seems to be lexical feature of PROPN. 99% lemmas (69) occur only with one value of Gender.

ADJ

85 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (71; 84%), Voice=EMPTY (67; 79%), Tense=EMPTY (66; 78%), VerbForm=EMPTY (65; 76%), Case=Nom (50; 59%).

ADJ tokens may have the following values of Gender:

Paradigm महत्MascNeut
महान्महत्

Gender seems to be lexical feature of ADJ. 97% lemmas (75) occur only with one value of Gender.

VERB

73 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (73; 100%), VerbForm=Part (73; 100%), Mood=EMPTY (72; 99%), Case=Nom (65; 89%), Voice=Pass (65; 89%), Number=Sing (61; 84%), Tense=Past (50; 68%).

VERB tokens may have the following values of Gender:

Paradigm कृMascNeut
Tense=Futकर्तव्यःकर्तव्यम्
Tense=Pastकृतःकृतम्

Gender seems to be lexical feature of VERB. 93% lemmas (50) occur only with one value of Gender.

DET

64 DET tokens (94% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (56; 88%), PronType=Dem (45; 70%).

DET tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Singतस्मात्
Case=Acc|Number=Singतम्तांतत्
Case=Acc|Number=Plurतान्
Case=Gen|Number=Sing|Person=3तस्य
Case=Gen|Number=Singतस्य
Case=Ins|Number=Sing|Person=3तया
Case=Ins|Number=Singतेनतयातेन
Case=Loc|Number=Singतस्याम्
Case=Nom|Number=Sing|Person=3तत्
Case=Nom|Number=Singस, सःतत्
Case=Nom|Number=Plurते

NUM

10 NUM tokens (56% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (10; 100%), Number=Sing (7; 70%), Case=Nom (6; 60%).

NUM tokens may have the following values of Gender:

Paradigm त्रिMascFem
त्रयःतिस्रः

AUX

1 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Pres (1; 100%), VerbForm=Part (1; 100%), Voice=Act (1; 100%).

AUX tokens may have the following values of Gender:

PART

1 PART tokens (3% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (1; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (48; 79%), NOUN –[det]–> DET (46; 84%), PROPN –[conj]–> PROPN (21; 84%), NOUN –[det]–> PRON (7; 88%), NOUN –[nsubj]–> NOUN (6; 67%), ADJ –[conj]–> ADJ (5; 71%), PRON –[nsubj:cop]–> NOUN (5; 83%), PROPN –[conj]–> NOUN (5; 71%), VERB –[conj]–> VERB (5; 71%), ADJ –[nsubj:cop]–> NOUN (4; 100%).