This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home sa/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Sanskrit)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

752 tokens (62%) have a non-empty value of Gender. 553 types (70%) occur at least once with a non-empty value of Gender. 456 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (406; 34% instances), PRON (123; 10% instances), VERB (59; 5% instances), DET (54; 4% instances), ADJ (45; 4% instances), PROPN (40; 3% instances), X (16; 1% instances), NUM (9; 1% instances).

NOUN

406 NOUN tokens (90% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (363; 89%).

NOUN tokens may have the following values of Gender:

Paradigm अर्थMascNeut
Case=Acc|Number=Singअर्थम्अर्थम्, अर्था
Case=Nom|Number=Singअर्थःअर्थम्, अर्थ
Case=Nom|Number=Plurअर्थः, अर्थाः

Gender seems to be lexical feature of NOUN. 96% lemmas (299) occur only with one value of Gender.

PRON

123 PRON tokens (82% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (110; 89%), Person=EMPTY (94; 76%), PronType=Dem (64; 52%), Case=Nom (63; 51%).

PRON tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Singतस्मात्तस्मात्
Case=Abl|Number=Plurतान्
Case=Acc|Number=Singतत्, तम्
Case=Acc|Number=Plurतानि
Case=Dat|Number=Singतस्मै
Case=Gen|Number=Sing|Poss=Yesतस्य
Case=Gen|Number=Singतस्यतस्याःतस्य
Case=Gen|Number=Plurतेषाम्
Case=Ins|Number=Singतेनतया
Case=Loc|Number=Singतस्याम्
Case=Nom|Number=Singसः, स, सह्तत्, सा, स
Case=Nom|Number=Plurते

VERB

59 VERB tokens (25% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Voice=EMPTY (59; 100%), Person=EMPTY (59; 100%), Mood=EMPTY (59; 100%), Tense=EMPTY (58; 98%), Number=Sing (53; 90%), VerbForm=Part (37; 63%), Case=Nom (34; 58%).

VERB tokens may have the following values of Gender:

Paradigm गतMascNeut
Case=Locगते
Case=Nomगतः

Gender seems to be lexical feature of VERB. 96% lemmas (50) occur only with one value of Gender.

DET

54 DET tokens (93% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (48; 89%), PronType=Dem (42; 78%).

DET tokens may have the following values of Gender:

Paradigm तद्MascFemNeut
Case=Abl|Number=Singतस्मात्
Case=Abl|Number=Plurतान्
Case=Acc|Number=Singतत्, तम्तत्, तम्
Case=Dat|Number=Sing|Poss=Yesतस्मै
Case=Gen|Number=Sing|Poss=Yesतस्य
Case=Gen|Number=Singतस्य
Case=Ins|Number=Singतेनतया
Case=Ins|Number=Plurतस्य
Case=Nom|Number=Singस, सःतत्
Case=Nom|Number=Plur|Poss=Yesते
Case=Nom|Number=Plurते

ADJ

45 ADJ tokens (96% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (39; 87%), Case=Acc (33; 73%).

ADJ tokens may have the following values of Gender:

Paradigm महत्MascNeut
Case=Accमहान्महत्
Case=Nomमहान्

Gender seems to be lexical feature of ADJ. 95% lemmas (38) occur only with one value of Gender.

PROPN

40 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (40; 100%), Case=Nom (33; 83%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (30) occur only with one value of Gender.

X

16 X tokens (80% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Person=EMPTY (16; 100%), Voice=EMPTY (16; 100%), Number=Sing (13; 81%), Case=Nom (11; 69%), VerbForm=EMPTY (10; 63%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (16) occur only with one value of Gender.

NUM

9 NUM tokens (69% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (9; 100%), Number=Sing (6; 67%).

NUM tokens may have the following values of Gender:

Paradigm त्रिMascFem
त्रयःतिस्रः

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (34; 76%), NOUN –[nmod]–> NOUN (33; 56%), NOUN –[det]–> DET (27; 69%), NOUN –[nmod]–> PRON (22; 54%), NOUN –[nmod]–> PROPN (8; 73%), PROPN –[conj]–> PROPN (7; 100%), NOUN –[det]–> PRON (6; 67%), NOUN –[nsubj]–> NOUN (6; 86%), PRON –[acl]–> NOUN (4; 100%), X –[nsubj]–> NOUN (3; 75%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]