home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-Vedic: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

16092 tokens (59%) have a non-empty value of Gender. 5364 types (71%) occur at least once with a non-empty value of Gender. 2675 lemmas (79%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (8810; 32% instances), PRON (3281; 12% instances), ADJ (2236; 8% instances), VERB (1052; 4% instances), ADV (304; 1% instances), NUM (238; 1% instances), DET (148; 1% instances), AUX (23; 0% instances).

NOUN

8810 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (6240; 71%).

NOUN tokens may have the following values of Gender:

Paradigm rūpaMascFemNeut
_rūpa
Case=Acc|Number=Singrūpamrūpāmrūpam
Case=Acc|Number=Plurrūpāṇi, rūpāni
Case=Dat|Number=Singrūpāya
Case=Gen|Number=Plurrūpāṇāmrūpāṇām
Case=Ins|Number=Singrūpeṇa
Case=Nom|Number=Singrūpaḥrūpārūpam
Case=Nom|Number=Plurrūpāḥrūpāṇi

Gender seems to be lexical feature of NOUN. 92% lemmas (1446) occur only with one value of Gender.

PRON

3281 PRON tokens (79% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2626; 80%).

PRON tokens may have the following values of Gender:

Paradigm tadMascFemNeut
Case=Abl|Number=Singtasmāttasyāḥtasmāt
Case=Abl|Number=Plurtebhyaḥ
Case=Acc|Number=Singtamtāmtat, tad, ṭat
Case=Acc|Number=Dualtautete
Case=Acc|Number=Plurtān, tāṁtāḥtāni, tā
Case=Dat|Number=Singtasmaitasyai
Case=Dat|Number=Plurtebhyaḥ
Case=Gen|Number=Singtasyatasyāḥtasya
Case=Gen|Number=Dualtayoḥ
Case=Gen|Number=Plurteṣāmtāsāmteṣām
Case=Ins|Number=Singtenatayātena
Case=Ins|Number=Plurtaiḥ, tebhiḥtābhiḥtaiḥ
Case=Loc|Number=Singtasmintasyāmtasmin
Case=Loc|Number=Plurtāsu
Case=Nom|Number=Singsa, saḥtat
Case=Nom|Number=Dualtautete
Case=Nom|Number=Plurtetāḥtāni, tā

ADJ

2236 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1716; 77%), Case=Nom (1157; 52%).

ADJ tokens may have the following values of Gender:

Paradigm sarvaMascFemNeut
Case=Abl|Number=Plursarvebhyaḥ
Case=Acc|Number=Singsarvamsarvāmsarvam
Case=Acc|Number=Plursarvān
Case=Dat|Number=Plursarvebhyaḥ
Case=Gen|Number=Plursarvāsām
Case=Ins|Number=Singsarveṇa
Case=Loc|Number=Plursarveṣu
Case=Nom|Number=Singsarvaḥsarvāsarvam
Case=Nom|Number=Plursarvesarvāḥ

VERB

1052 VERB tokens (20% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1052; 100%), Person=EMPTY (1052; 100%), VerbForm=Part (990; 94%), Number=Sing (846; 80%), Case=Nom (683; 65%), Tense=Past (610; 58%).

VERB tokens may have the following values of Gender:

Paradigm bhūMascFemNeut
Case=Acc|Number=Sing|Tense=Futbhaviṣyat
Case=Acc|Number=Sing|Tense=Pastbhūtam
Case=Ins|Number=Sing|Tense=Pastbhūtayā
Case=Loc|Number=Sing|Tense=Pastbhūte
Case=Nom|Number=Sing|Tense=Futbhaviṣyat
Case=Nom|Number=Sing|Tense=Pastbhūtaḥbhūtam
Case=Nom|Number=Dual|Tense=Pastbhūtau

ADV

304 ADV tokens (10% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Number=Sing (302; 99%), Case=Acc (253; 83%).

ADV tokens may have the following values of Gender:

Paradigm tadMascNeut
Case=Abltasmāt
Case=Acctat, tad
Case=Gentasya
Case=Instenatena
Case=Nomsatat

NUM

238 NUM tokens (82% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm triMascFemNeut
Case=Acctrīntisraḥtrī, trīṇi
Case=Gentrayāṇām
Case=Instisṛbhiḥ
Case=Loctriṣutriṣu
Case=Nomtrayaḥtisraḥtrīṇi, trī

DET

148 DET tokens (90% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Plur (106; 72%).

DET tokens may have the following values of Gender:

Paradigm sarvaMascFemNeut
Case=Abl|Number=Plursarvābhyaḥ
Case=Acc|Number=Singsarvamsarvam
Case=Acc|Number=Plursarvānsarvāḥsarvā, sarvāṇi
Case=Dat|Number=Singsarvasyai
Case=Dat|Number=Plursarvebhyaḥsarvābhyaḥ
Case=Gen|Number=Singsarvasya
Case=Gen|Number=Plursarveṣāmsarvāsāmsarveṣām
Case=Ins|Number=Singsarveṇasarveṇa
Case=Ins|Number=Plursarvaiḥsarvābhiḥsarvaiḥ
Case=Loc|Number=Plursarveṣusarvāsu
Case=Nom|Number=Singsarvaḥsarvam
Case=Nom|Number=Plursarvesarvāḥsarvāṇi

AUX

23 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (23; 100%), Person=EMPTY (23; 100%), Tense=Pres (23; 100%), Number=Sing (22; 96%).

AUX tokens may have the following values of Gender:

Paradigm asMascFem
Case=Accsantamsatīm
Case=Gensataḥ
Case=Nomsansatī

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (555; 88%), NOUN –[conj]–> NOUN (546; 62%), NOUN –[det]–> PRON (452; 98%), NOUN –[nsubj]–> PRON (285; 79%), NOUN –[nsubj]–> NOUN (193; 54%), NOUN –[acl]–> VERB (179; 72%), ADJ –[nsubj]–> NOUN (165; 94%), ADJ –[conj]–> ADJ (154; 93%), NOUN –[det]–> DET (138; 91%), NOUN –[acl]–> NOUN (118; 81%).