home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-Vedic: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

115253 tokens (56%) have a non-empty value of Gender. 24860 types (69%) occur at least once with a non-empty value of Gender. 10366 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (64911; 31% instances), PRON (21606; 10% instances), ADJ (16621; 8% instances), VERB (8501; 4% instances), NUM (2021; 1% instances), ADV (1058; 1% instances), DET (442; 0% instances), AUX (93; 0% instances).

NOUN

64911 NOUN tokens (90% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Compound=EMPTY (64911; 100%), Number=Sing (49261; 76%).

NOUN tokens may have the following values of Gender:

Paradigm somaMascFemNeut
Case=Abl|Number=Singsomāt
Case=Acc|Number=Singsomam
Case=Acc|Number=Plursomān, somāṁsomāḥ
Case=Dat|Number=Singsomāya
Case=Gen|Number=Singsomasya
Case=Gen|Number=Plursomānām
Case=Ins|Number=Singsomena
Case=Ins|Number=Plursomaiḥ, somebhiḥ
Case=Loc|Number=Singsome
Case=Loc|Number=Plursomeṣu
Case=Nom|Number=Singsomaḥsomam
Case=Nom|Number=Plursomāḥ, somāsaḥ
Case=Voc|Number=Singsoma, somaiḥ

PRON

21606 PRON tokens (78% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (17126; 79%).

PRON tokens may have the following values of Gender:

Paradigm tadMascFemNeut
Case=Abl|Number=Singtasmāttasyāḥtasmāt
Case=Abl|Number=Plurtebhyaḥtābhyaḥtebhyaḥ
Case=Acc|Number=Singtam, ṭamtāmtat, tad, ṭat
Case=Acc|Number=Dualtau, tātete
Case=Acc|Number=Plurtān, tāṁtāḥtāni, tā
Case=Dat|Number=Singtasmaitasyaitasmai
Case=Dat|Number=Dualtābhyāmtābhyām
Case=Dat|Number=Plurtebhyaḥtābhyaḥ
Case=Gen|Number=Singtasyatasyāḥtasya
Case=Gen|Number=Dualtayoḥtayoḥ
Case=Gen|Number=Plurteṣāmtāsāmteṣām
Case=Ins|Number=Singtenatayātena
Case=Ins|Number=Dualtābhyāmtābhyāmtābhyām
Case=Ins|Number=Plurtaiḥ, tebhiḥtābhiḥtaiḥ
Case=Loc|Number=Singtasmintasyāmtasmin, sasmin
Case=Loc|Number=Dualtayoḥ
Case=Loc|Number=Plurteṣutāsuteṣu
Case=Nom|Number=Singsaḥ, satat
Case=Nom|Number=Dualtau, tātete
Case=Nom|Number=Plurtetāḥtāni, tā

ADJ

16621 ADJ tokens (91% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (12658; 76%).

ADJ tokens may have the following values of Gender:

Paradigm uttaraMascFemNeut
Case=Abl|Number=Singuttarāt, uttarasmāt
Case=Acc|Number=Singuttaramuttarāmuttaram
Case=Acc|Number=Dualuttarauuttare
Case=Acc|Number=Pluruttarānuttarāḥuttarāṇi
Case=Dat|Number=Singuttarāyai
Case=Dat|Number=Pluruttarebhyaḥ
Case=Gen|Number=Singuttarasyauttarasya
Case=Gen|Number=Pluruttarāṇām
Case=Ins|Number=Singuttareṇauttarayāuttareṇa, uttarena
Case=Ins|Number=Dualuttarābhyāmuttarābhyām
Case=Ins|Number=Pluruttaraiḥuttarābhiḥuttaraiḥ
Case=Loc|Number=Singuttare, uttarasminuttarasyām, uttarāyāmuttarasmin
Case=Loc|Number=Dualuttarayoḥuttarayoḥ
Case=Loc|Number=Pluruttareṣuuttarāsu
Case=Nom|Number=Singuttaraḥuttarāuttaram
Case=Nom|Number=Dualuttarauuttare
Case=Nom|Number=Pluruttareuttarāṇi, uttarā

VERB

8501 VERB tokens (21% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (8501; 100%), Person=EMPTY (8501; 100%), VerbForm=Part (8000; 94%), Number=Sing (6697; 79%), Case=Nom (5342; 63%), Tense=Past (4912; 58%).

VERB tokens may have the following values of Gender:

Paradigm kṛMascFemNeut
Case=Abl|Number=Plur|Tense=Past|VerbForm=Partkṛtebhyaḥ
Case=Acc|Number=Sing|Tense=Fut|VerbForm=Partkariṣyantīm
Case=Acc|Number=Sing|Tense=Past|VerbForm=Partkṛtamkṛtāmkṛtam
Case=Acc|Number=Sing|Tense=Pres|VerbForm=Partkṛṇvantamkṛṇvānām
Case=Acc|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Passkriyamāṇāmkriyamāṇam
Case=Acc|Number=Sing|VerbForm=Gdvkāryam
Case=Acc|Number=Plur|Tense=Fut|VerbForm=Partkariṣyataḥ
Case=Acc|Number=Plur|Tense=Past|VerbForm=Partkṛtāḥkṛtā, kṛtāni
Case=Acc|Number=Plur|Tense=Pres|VerbForm=Partkurvataḥ
Case=Acc|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Passkriyamāṇā
Case=Dat|Number=Sing|Tense=Past|VerbForm=Partcakruṣe
Case=Dat|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Passkriyamāṇāya
Case=Gen|Number=Sing|Tense=Fut|VerbForm=Partkariṣyataḥ
Case=Gen|Number=Sing|Tense=Past|VerbForm=Partkṛtasya, cakrivasaḥkṛtasya
Case=Gen|Number=Sing|Tense=Pres|VerbForm=Partkurvataḥ, kṛṇvataḥ
Case=Gen|Number=Plur|Tense=Past|VerbForm=Partkṛtānām
Case=Gen|Number=Plur|Tense=Pres|VerbForm=Partkurvatām
Case=Ins|Number=Sing|Tense=Past|VerbForm=Partkṛtenakṛtena
Case=Ins|Number=Sing|VerbForm=Gdvkartvena
Case=Ins|Number=Plur|Tense=Past|VerbForm=Partkṛtābhiḥ
Case=Loc|Number=Sing|Tense=Past|VerbForm=Partkṛte
Case=Loc|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Passkriyamāṇe
Case=Loc|Number=Plur|Tense=Past|VerbForm=Partkṛtāsu
Case=Nom|Number=Sing|Tense=Fut|VerbForm=Partkariṣyan
Case=Nom|Number=Sing|Tense=Past|VerbForm=Partkṛtaḥ, cakrāṇaḥ, krānkṛtākṛtam
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Partkṛṇvan, kurvan, kṛṇvānaḥ, kurvāṇaḥkṛṇvatī, kṛṇvānā
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Passkriyamāṇam
Case=Nom|Number=Sing|VerbForm=Gdvkāryaḥ, kartavyaḥkāryā, kartavyākāryam, kartavyam, kṛtyam, kartvam
Case=Nom|Number=Dual|Tense=Past|VerbForm=Partkṛte
Case=Nom|Number=Dual|VerbForm=Gdvkartavyau, kāryaukartavye
Case=Nom|Number=Plur|Tense=Fut|VerbForm=Partkariṣyantaḥ
Case=Nom|Number=Plur|Tense=Past|VerbForm=Partkṛtāḥ, krantaḥkṛtāni
Case=Nom|Number=Plur|Tense=Pres|VerbForm=Partkṛṇvānāḥ, kurvantaḥ, kurvāṇāḥ, kṛṇvantaḥ
Case=Nom|Number=Plur|VerbForm=Gdvkāryāḥkartavyāḥ, kāryāḥkartvāni, kāryāṇi
Case=Voc|Number=Sing|Tense=Past|VerbForm=Partkṛtam

NUM

2021 NUM tokens (70% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Compound=EMPTY (2021; 100%), Case=Nom (1034; 51%).

NUM tokens may have the following values of Gender:

Paradigm ekaMascFemNeut
Case=Acc|Number=Singekamekāmekam
Case=Gen|Number=Singekasyaekasyāḥekasya
Case=Gen|Number=Plurekeṣām
Case=Ins|Number=Singekenaekayāekena
Case=Loc|Number=Singekasminekasyāmekasmin
Case=Nom|Number=Singekaḥekāekam
Case=Nom|Number=Plureke

ADV

1058 ADV tokens (8% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Paradigm tadMascNeut
Case=Abltasmāt
Case=Acctamtat, tad
Case=Instad, tena
Case=Nomsa, saḥ, tadtat, tad

DET

442 DET tokens (78% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Compound=EMPTY (442; 100%), Number=Sing (237; 54%).

DET tokens may have the following values of Gender:

Paradigm viśvaMascFemNeut
Case=Abl|Number=Singviśvasmāt, viśvāt
Case=Abl|Number=Plurviśvebhyaḥ
Case=Acc|Number=Singviśvamviśvāmviśvam
Case=Acc|Number=Plurviśvānviśvāḥviśvā, viśvāni
Case=Dat|Number=Singviśvasmai
Case=Dat|Number=Plurviśvebhyaḥ
Case=Gen|Number=Singviśvasyaviśvasyāḥviśvasya
Case=Gen|Number=Plurviśveṣāmviśvāsām
Case=Ins|Number=Singviśvena
Case=Ins|Number=Plurviśvebhiḥ, viśvaiḥviśvābhiḥviśvebhiḥ
Case=Loc|Number=Plurviśveṣuviśvāsuviśveṣu
Case=Nom|Number=Singviśvaḥviśvāviśvam
Case=Nom|Number=Plurviśve, viśvāḥviśvāḥviśvā, viśvāni

AUX

93 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (93; 100%), Person=EMPTY (93; 100%), Tense=Pres (93; 100%), Number=Sing (86; 92%).

AUX tokens may have the following values of Gender:

Paradigm asMascFemNeut
Case=Acc|Number=Singsantamsatīmsat
Case=Acc|Number=Plursatīḥ
Case=Dat|Number=Plursadbhyaḥ
Case=Gen|Number=Singsataḥsataḥ
Case=Gen|Number=Plursatām
Case=Ins|Number=Singsatā
Case=Loc|Number=Singsati
Case=Nom|Number=Singsansatīsat
Case=Nom|Number=Dualsatī
Case=Nom|Number=Plursantaḥ

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> PRON (4003; 96%), NOUN –[conj]–> NOUN (3445; 61%), NOUN –[amod]–> ADJ (3153; 79%), NOUN –[nsubj]–> PRON (1801; 81%), NOUN –[flat]–> NOUN (1391; 54%), ADJ –[conj]–> ADJ (1379; 97%), NOUN –[acl]–> VERB (1232; 77%), NOUN –[acl]–> ADJ (1125; 94%), NOUN –[nsubj]–> NOUN (1090; 53%), ADJ –[nsubj]–> NOUN (955; 94%).