Treebank Statistics: UD_Sanskrit-Vedic: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
16092 tokens (59%) have a non-empty value of Gender
.
5364 types (71%) occur at least once with a non-empty value of Gender
.
2675 lemmas (79%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (8810; 32% instances), PRON (3281; 12% instances), ADJ (2236; 8% instances), VERB (1052; 4% instances), ADV (304; 1% instances), NUM (238; 1% instances), DET (148; 1% instances), AUX (23; 0% instances).
NOUN
8810 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (6240; 71%).
NOUN
tokens may have the following values of Gender
:
Fem
(1991; 23% of non-emptyGender
): kṛtyā, vaśā, devatāḥ, diśaḥ, āpaḥ, iḍā, vāc, prajā, devatayā, diśamMasc
(4782; 54% of non-emptyGender
): devāḥ, agniḥ, deva, indraḥ, indra, yajñasya, devānām, agne, kāmaḥ, paśavaḥNeut
(2037; 23% of non-emptyGender
): anna, rūpa, brahma, vīryam, namaḥ, rūpam, manaḥ, annam, viṣam, dhāmaEMPTY
(32): anaḍvān, upasatsu, āśiṣam, anaḍuhaḥ, panthām, virāḍbhyām, śriyai, avayāḥ, havyavāṭ, maghonaḥ
Paradigm rūpa | Masc | Fem | Neut |
---|---|---|---|
_ | rūpa | ||
Case=Acc|Number=Sing | rūpam | rūpām | rūpam |
Case=Acc|Number=Plur | rūpāṇi, rūpāni | ||
Case=Dat|Number=Sing | rūpāya | ||
Case=Gen|Number=Plur | rūpāṇām | rūpāṇām | |
Case=Ins|Number=Sing | rūpeṇa | ||
Case=Nom|Number=Sing | rūpaḥ | rūpā | rūpam |
Case=Nom|Number=Plur | rūpāḥ | rūpāṇi |
Gender
seems to be lexical feature of NOUN
. 92% lemmas (1446) occur only with one value of Gender
.
PRON
3281 PRON tokens (79% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (2626; 80%).
PRON
tokens may have the following values of Gender
:
Fem
(561; 17% of non-emptyGender
): sā, tāḥ, tām, yāḥ, iyam, etāḥ, eṣā, yām, imāḥ, yāMasc
(2120; 65% of non-emptyGender
): yaḥ, sa, enam, tam, asya, saḥ, asmai, ye, te, eṣaNeut
(600; 18% of non-emptyGender
): tat, yat, etat, idam, tena, tāni, etena, kim, yena, aparamEMPTY
(867): te, naḥ, tvā, tvam, me, aham, mā, vaḥ, vayam, mām
Paradigm tad | Masc | Fem | Neut |
---|---|---|---|
Case=Abl|Number=Sing | tasmāt | tasyāḥ | tasmāt |
Case=Abl|Number=Plur | tebhyaḥ | ||
Case=Acc|Number=Sing | tam | tām | tat, tad, ṭat |
Case=Acc|Number=Dual | tau | te | te |
Case=Acc|Number=Plur | tān, tāṁ | tāḥ | tāni, tā |
Case=Dat|Number=Sing | tasmai | tasyai | |
Case=Dat|Number=Plur | tebhyaḥ | ||
Case=Gen|Number=Sing | tasya | tasyāḥ | tasya |
Case=Gen|Number=Dual | tayoḥ | ||
Case=Gen|Number=Plur | teṣām | tāsām | teṣām |
Case=Ins|Number=Sing | tena | tayā | tena |
Case=Ins|Number=Plur | taiḥ, tebhiḥ | tābhiḥ | taiḥ |
Case=Loc|Number=Sing | tasmin | tasyām | tasmin |
Case=Loc|Number=Plur | tāsu | ||
Case=Nom|Number=Sing | sa, saḥ | sā | tat |
Case=Nom|Number=Dual | tau | te | te |
Case=Nom|Number=Plur | te | tāḥ | tāni, tā |
ADJ
2236 ADJ tokens (94% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1716; 77%), Case=Nom (1157; 52%).
ADJ
tokens may have the following values of Gender
:
Fem
(472; 21% of non-emptyGender
): uttamām, svayā, prathamām, apagā, kāminī, uttamayā, vichandasaḥ, abhirūpā, atithimatī, brāhmaṇaspatyāmMasc
(1309; 59% of non-emptyGender
): prathamaḥ, ūrdhvaḥ, medhyaḥ, priyaḥ, jāḥ, kṛtam, sarve, aindram, devatyaḥ, pāpmāNeut
(455; 20% of non-emptyGender
): priyam, sarvam, arasam, abhirūpam, parokṣam, pratyakṣam, mahat, svena, bṛhat, indriyamEMPTY
(137): kṛṣṇa, bahu, hiraṇya, viśva, amṛta, dīrgha, mahā, sat, uttara, śiti
Paradigm sarva | Masc | Fem | Neut |
---|---|---|---|
Case=Abl|Number=Plur | sarvebhyaḥ | ||
Case=Acc|Number=Sing | sarvam | sarvām | sarvam |
Case=Acc|Number=Plur | sarvān | ||
Case=Dat|Number=Plur | sarvebhyaḥ | ||
Case=Gen|Number=Plur | sarvāsām | ||
Case=Ins|Number=Sing | sarveṇa | ||
Case=Loc|Number=Plur | sarveṣu | ||
Case=Nom|Number=Sing | sarvaḥ | sarvā | sarvam |
Case=Nom|Number=Plur | sarve | sarvāḥ |
VERB
1052 VERB tokens (20% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (1052; 100%), Person=EMPTY (1052; 100%), VerbForm=Part (990; 94%), Number=Sing (846; 80%), Case=Nom (683; 65%), Tense=Past (610; 58%).
VERB
tokens may have the following values of Gender
:
Fem
(170; 16% of non-emptyGender
): upahūtā, kṛtā, samṛddhāḥ, jātā, bibhratī, pratibuddhāḥ, abhīṣṭāḥ, avasṛṣṭā, avattā, bhūtayāMasc
(686; 65% of non-emptyGender
): vidvān, yajamānaḥ, jātaḥ, anūcyaḥ, yajamānasya, yajamānāya, yajamānam, gṛhītaḥ, upahūtaḥ, upāptaḥNeut
(196; 19% of non-emptyGender
): samṛddham, kriyamāṇam, anūcyam, kartavyam, kṛtam, otāni, upahūtam, ādṛtyam, anūcyāni, avadīyamānasyaEMPTY
(4231): bhavati, veda, anvāha, ālabheta, āhuḥ, āha, uvāca, anuvyacalat, abhavat, dadhāti
Paradigm bhū | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing|Tense=Fut | bhaviṣyat | ||
Case=Acc|Number=Sing|Tense=Past | bhūtam | ||
Case=Ins|Number=Sing|Tense=Past | bhūtayā | ||
Case=Loc|Number=Sing|Tense=Past | bhūte | ||
Case=Nom|Number=Sing|Tense=Fut | bhaviṣyat | ||
Case=Nom|Number=Sing|Tense=Past | bhūtaḥ | bhūtam | |
Case=Nom|Number=Dual|Tense=Past | bhūtau |
ADV
304 ADV tokens (10% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Number=Sing (302; 99%), Case=Acc (253; 83%).
ADV
tokens may have the following values of Gender
:
Masc
(14; 5% of non-emptyGender
): tena, sam, etena, tasya, saNeut
(290; 95% of non-emptyGender
): tat, etat, yat, tena, idam, adaḥ, aparam, catuḥ, etad, tasmātEMPTY
(2640): vai, evam, hi, ā, atha, tasmāt, sam, su, pra, vi
Paradigm tad | Masc | Neut |
---|---|---|
Case=Abl | tasmāt | |
Case=Acc | tat, tad | |
Case=Gen | tasya | |
Case=Ins | tena | tena |
Case=Nom | sa | tat |
NUM
238 NUM tokens (82% of all NUM
tokens) have a non-empty value of Gender
.
NUM
tokens may have the following values of Gender
:
Fem
(54; 23% of non-emptyGender
): tisraḥ, trayastriṃśat, aṣṭau, aṣṭābhiḥ, dvayoḥ, catasraḥ, catasṛbhiḥ, pañca, ekayā, pañcāśatMasc
(47; 20% of non-emptyGender
): sapta, trayaḥ, dvau, pañca, aṣṭau, caturaḥ, dvādaśā, ekam, ekaḥ, trīnNeut
(137; 58% of non-emptyGender
): śatam, śata, sahasra, sapta, dvādaśa, sahasram, nava, śatāni, ekādaśa, trīṇiEMPTY
(54): eka, dvi, aṣṭa, catur, daśa, ekādaśa, pañca, tri, dvis, dvādaśa
Paradigm tri | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | trīn | tisraḥ | trī, trīṇi |
Case=Gen | trayāṇām | ||
Case=Ins | tisṛbhiḥ | ||
Case=Loc | triṣu | triṣu | |
Case=Nom | trayaḥ | tisraḥ | trīṇi, trī |
DET
148 DET tokens (90% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Plur (106; 72%).
DET
tokens may have the following values of Gender
:
Fem
(47; 32% of non-emptyGender
): sarvāḥ, sarvābhyaḥ, sarvābhiḥ, viśvāḥ, itithīm, sarvasyai, sarvāsām, svayā, ubhayyaḥ, sarvāsuMasc
(53; 36% of non-emptyGender
): sarvān, sarve, sarveṣām, viśve, ubhayoḥ, sarvam, viśvaiḥ, viśvebhiḥ, anyake, bahaveNeut
(48; 32% of non-emptyGender
): viśvā, sarvāṇi, viśvam, sarvam, sarvaiḥ, sarveṣām, sarvā, viśvāni, sarvasya, svenaEMPTY
(17): viśva, sarva, puru
Paradigm sarva | Masc | Fem | Neut |
---|---|---|---|
Case=Abl|Number=Plur | sarvābhyaḥ | ||
Case=Acc|Number=Sing | sarvam | sarvam | |
Case=Acc|Number=Plur | sarvān | sarvāḥ | sarvā, sarvāṇi |
Case=Dat|Number=Sing | sarvasyai | ||
Case=Dat|Number=Plur | sarvebhyaḥ | sarvābhyaḥ | |
Case=Gen|Number=Sing | sarvasya | ||
Case=Gen|Number=Plur | sarveṣām | sarvāsām | sarveṣām |
Case=Ins|Number=Sing | sarveṇa | sarveṇa | |
Case=Ins|Number=Plur | sarvaiḥ | sarvābhiḥ | sarvaiḥ |
Case=Loc|Number=Plur | sarveṣu | sarvāsu | |
Case=Nom|Number=Sing | sarvaḥ | sarvam | |
Case=Nom|Number=Plur | sarve | sarvāḥ | sarvāṇi |
AUX
23 AUX tokens (8% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (23; 100%), Person=EMPTY (23; 100%), Tense=Pres (23; 100%), Number=Sing (22; 96%).
AUX
tokens may have the following values of Gender
:
Fem
(3; 13% of non-emptyGender
): satīm, satīMasc
(20; 87% of non-emptyGender
): san, santam, bhavantaḥ, sataḥEMPTY
(278): asi, bhavati, astu, syāt, asaḥ, bhavanti, āsīt, santu, asat, āsan
Paradigm as | Masc | Fem |
---|---|---|
Case=Acc | santam | satīm |
Case=Gen | sataḥ | |
Case=Nom | san | satī |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (555; 88%),
NOUN –[conj]–> NOUN (546; 62%),
NOUN –[det]–> PRON (452; 98%),
NOUN –[nsubj]–> PRON (285; 79%),
NOUN –[nsubj]–> NOUN (193; 54%),
NOUN –[acl]–> VERB (179; 72%),
ADJ –[nsubj]–> NOUN (165; 94%),
ADJ –[conj]–> ADJ (154; 93%),
NOUN –[det]–> DET (138; 91%),
NOUN –[acl]–> NOUN (118; 81%).