home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hindi-HDTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

185891 tokens (53%) have a non-empty value of Gender. 15289 types (80%) occur at least once with a non-empty value of Gender. 12384 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 14 part-of-speech tags: NOUN (77241; 22% instances), PROPN (37640; 11% instances), ADP (27147; 8% instances), VERB (21870; 6% instances), AUX (13188; 4% instances), PRON (3516; 1% instances), ADJ (3379; 1% instances), ADV (1379; 0% instances), DET (464; 0% instances), PART (35; 0% instances), NUM (16; 0% instances), X (13; 0% instances), SCONJ (2; 0% instances), PUNCT (1; 0% instances).

NOUN

77241 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (77214; 100%), Number=Sing (62647; 81%), Case=Acc (41556; 54%).

NOUN tokens may have the following values of Gender:

Paradigm सरकारMascFem
Case=Acc|Number=Singसरकार
Case=Acc|Number=Sing|Person=3सरकारसरकार
Case=Acc|Number=Plur|Person=3सरकारोंसरकारों
Case=Nom|Number=Sing|Person=3सरकार
Case=Nom|Number=Plur|Person=3सरकारें, सरकार

Gender seems to be lexical feature of NOUN. 95% lemmas (5986) occur only with one value of Gender.

PROPN

37640 PROPN tokens (88% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (37556; 100%), Number=Sing (37408; 99%), Case=Nom (19783; 53%).

PROPN tokens may have the following values of Gender:

Paradigm प्रधानमंत्रीMascFem
Case=Accप्रधानमंत्री
Case=Nomप्रधानमंत्रीप्रधानमंत्री
प्रधानमंत्री

Gender seems to be lexical feature of PROPN. 95% lemmas (6477) occur only with one value of Gender.

ADP

27147 ADP tokens (37% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Post (26201; 97%), Number=Sing (21798; 80%), Case=Acc (13774; 51%).

ADP tokens may have the following values of Gender:

Paradigm काMascFem
AdpType=Postकी
AdpType=Post|Case=Accकेकी
AdpType=Post|Case=Acc,Gen|Number=Sing|Poss=Yesका, केकी
AdpType=Post|Case=Acc,Gen|Number=Plur|Poss=Yesकेकी
AdpType=Post|Case=Acc|Number=Singके, का, कीकी, के, का
AdpType=Post|Case=Acc|Number=Sing|Person=3केकी
AdpType=Post|Case=Acc|Number=Sing|Person=3|Polite=Formके
AdpType=Post|Case=Acc|Number=Plurके, काकी, के
AdpType=Post|Case=Acc|Number=Plur|Person=3केकी
AdpType=Post|Case=Nomकी
AdpType=Post|Case=Nom|Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Past|VerbForm=Finके
AdpType=Post|Case=Nom|Number=Singका, के, कीकी, के
AdpType=Post|Case=Nom|Number=Sing|Person=2|Polite=Formके
AdpType=Post|Case=Nom|Number=Sing|Person=3का, कीकी
AdpType=Post|Case=Nom|Number=Sing|Person=3|Polite=Formके
AdpType=Post|Case=Nom|Number=Plurके, काकी
AdpType=Post|Case=Nom|Number=Plur|Person=3के
AdpType=Post|Number=Singकेकी
AdpType=Post|Number=Plur|Person=3के
Case=Acc|Number=Singकेकी
Case=Acc|Number=Plurके
Case=Nom|Number=Singकाकी
Case=Nom|Number=Sing|Person=3|Polite=Formके

Gender seems to be lexical feature of ADP. 91% lemmas (87) occur only with one value of Gender.

VERB

21870 VERB tokens (65% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Case=EMPTY (21628; 99%), Number=Sing (18171; 83%), Voice=Act (16932; 77%), VerbForm=Part (16003; 73%), Aspect=Perf (13658; 62%), Person=EMPTY (10994; 50%).

VERB tokens may have the following values of Gender:

Paradigm करMascFem
_कर
Aspect=Imp|Case=Acc|Number=Sing|VerbForm=Partकरते
Aspect=Imp|Echo=Rdp|Number=Sing|VerbForm=Partकरते
Aspect=Imp|Number=Sing|Person=1|VerbForm=Part|Voice=Actकरता
Aspect=Imp|Number=Sing|Person=2|Polite=Form|VerbForm=Part|Voice=Actकरते
Aspect=Imp|Number=Sing|Person=3|Polite=Form|VerbForm=Partकरते
Aspect=Imp|Number=Sing|Person=3|Polite=Form|VerbForm=Part|Voice=Actकरतेकरती
Aspect=Imp|Number=Sing|Person=3|VerbForm=Partकरते
Aspect=Imp|Number=Sing|Person=3|VerbForm=Part|Voice=Actकरता, करते, करवाताकरती
Aspect=Imp|Number=Sing|VerbForm=Partकरते, करताकरती
Aspect=Imp|Number=Sing|VerbForm=Part|Voice=Actकरता, करतेकरती
Aspect=Imp|Number=Plur|Person=1|VerbForm=Part|Voice=Actकरते
Aspect=Imp|Number=Plur|Person=3|VerbForm=Part|Voice=Actकरतेकरती
Aspect=Imp|Number=Plur|VerbForm=Partकरतेकरती
Aspect=Imp|Number=Plur|VerbForm=Part|Voice=Actकरतेकरती, करतीं
Aspect=Perf|Number=Sing|Person=1|VerbForm=Part|Voice=Actकी
Aspect=Perf|Number=Sing|Person=3|Polite=Form|VerbForm=Part|Voice=Actकिए, करा, किये
Aspect=Perf|Number=Sing|Person=3|VerbForm=Partकिए, कियेकिए, की
Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Actकिया, करा, करवाया, किए, करकी, करा, कर
Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Passकिया, करवाया, करा, करायाकी
Aspect=Perf|Number=Sing|VerbForm=Partकिए, किया, कियेकी
Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Actकिया, करवाया, करा, कियेकी
Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Passकिया, करा, करवायाकी
Aspect=Perf|Number=Plur|Person=3|Polite=Form|VerbForm=Part|Voice=Actकिए
Aspect=Perf|Number=Plur|Person=3|VerbForm=Partकिएकी
Aspect=Perf|Number=Plur|Person=3|VerbForm=Part|Voice=Actकिए, करा, किये, कियाकी, कीं
Aspect=Perf|Number=Plur|Person=3|VerbForm=Part|Voice=Passकिए, कियेकी
Aspect=Perf|Number=Plur|VerbForm=Partकिएकी
Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Actकिए, किये, कराकी, कीं
Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Passकिए, कियेकी
Aspect=Perf|VerbForm=Partकिया, किएकी
Aspect=Perf|VerbForm=Part|Voice=Actकिया, किएकी
Aspect=Perf|VerbForm=Part|Voice=Passकिया, किएकी
Case=Acc|Number=Sing|Person=3|Polite=Form|VerbForm=Inf|Voice=Actकरने
Case=Acc|Number=Sing|Person=3|VerbForm=Inf|Voice=Actकरनेकरने
Case=Acc|Number=Sing|VerbForm=Infकरने
Case=Acc|VerbForm=Infकरनेकरने
Case=Acc|VerbForm=Inf|Voice=Actकरने
Case=Nom|Number=Sing|Person=3|VerbForm=Inf|Voice=Actकरना
Case=Nom|Number=Sing|Person=3|Voice=Actकर
Case=Nom|Number=Sing|Voice=Actकर
Mood=Ind|Number=Sing|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actकरूंगा, करूँगाकरुंगी, करूँगी
Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Fut|VerbForm=Fin|Voice=Actकरेंगेकरेंगी
Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actकरेगाकरेगी, करूंगी
Mood=Ind|Number=Plur|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actकरेंगे, करेगें
Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Finकरेंगे
Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actकरेंगे, करवाएंगेकरेंगी
Mood=Ind|Number=Plur|Tense=Fut|VerbForm=Fin|Voice=Actकरेंगे
Number=Sing|Person=1|Voice=Actकर
Number=Sing|Person=2|Polite=Form|Voice=Actकर, करवा
Number=Sing|Person=3कर
Number=Sing|Person=3|Polite=Form|VerbForm=Inf|Voice=Actकरने
Number=Sing|Person=3|Polite=Form|Voice=Actकरकर
Number=Sing|Person=3|VerbForm=Inf|Voice=Actकरना, करने, करानाकरनी, करने
Number=Sing|Person=3|VerbForm=Inf|Voice=Passकरनी
Number=Sing|Person=3|Voice=Actकरकर, की
Number=Sing|Person=3|Voice=Passकरकर
Number=Sing|VerbForm=Infकरना
Number=Sing|VerbForm=Inf|Voice=Actकरना, करनेकरनी, करने
Number=Sing|VerbForm=Inf|Voice=Passकरनी
Number=Sing|Voice=Actकरकर
Number=Sing|Voice=Passकरकर
Number=Plur|Person=1|Voice=Actकर
Number=Plur|Person=3|VerbForm=Inf|Voice=Actकरनेकरनी
Number=Plur|Person=3|Voice=Actकरकर
Number=Plur|Person=3|Voice=Passकरकर
Number=Plur|VerbForm=Inf|Voice=Actकरने
Number=Plur|Voice=Actकरकर
Number=Plur|Voice=Passकरकर
VerbForm=Inf|Voice=Actकरना, करनेकरनी
VerbForm=Inf|Voice=Passकरने
Voice=Actकरकर

AUX

13188 AUX tokens (51% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Voice=EMPTY (12687; 96%), Number=Sing (10801; 82%), Person=EMPTY (10721; 81%), Tense=EMPTY (9714; 74%), Mood=EMPTY (9661; 73%), VerbForm=Part (9420; 71%), Aspect=Perf (8273; 63%).

AUX tokens may have the following values of Gender:

Paradigm हैMascFem
Aspect=Perf|Number=Plur|Person=3|VerbForm=Partहैं
Case=Nom|Number=Singहै
Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Pres|VerbForm=Finहैं
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Finहैहै
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Finहैं
Number=Singहै
Number=Plurहैं
Number=Plur|Person=3हों, हैं
Number=Plur|Person=3|Voice=Actहों

PRON

3516 PRON tokens (24% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (3505; 100%), Polite=EMPTY (2826; 80%), Case=Acc,Gen (2038; 58%), Poss=Yes (2038; 58%), Number=Sing (2026; 58%), Person=3 (2010; 57%).

PRON tokens may have the following values of Gender:

Paradigm वहMascFem
Case=Acc,Dat|Number=Sing|Person=3|Polite=Formउन्हेंउनकी
Case=Acc,Erg|Number=Sing|Person=3|Polite=Formउन्होंनेउन्होंने
Case=Acc,Erg|Number=Sing|Person=3उन्होंने, उसनेउसने
Case=Acc,Erg|Number=Plur|Person=3उन्होंने
Case=Acc,Gen|Number=Sing|Person=3|Polite=Form|Poss=Yesउनके, उनका, उनकीउनकी
Case=Acc,Gen|Number=Sing|Person=3|Poss=Yesउसके, उसका, उनका, उनकेउसकी, उनकी
Case=Acc,Gen|Number=Plur|Person=3|Polite=Form|Poss=Yesउनके
Case=Acc,Gen|Number=Plur|Person=3|Poss=Yesउनके, उनका, उसके, उनकीउनकी
Case=Acc,Gen|Person=3|Polite=Form|Poss=Yesउनकी
Case=Acc,Gen|Poss=Yesउनकी
Case=Acc,Ins|Number=Sing|Person=3|Polite=Formउनसे
Case=Acc,Ins|Number=Plurउनके
Case=Acc|Number=Sing|Person=3|Polite=Formउन
Case=Acc|Number=Sing|Person=3उसी
Case=Nom|Number=Sing|Person=3|Polite=Formवे
Case=Nom|Number=Sing|Person=3वहीवह
Case=Nom|Number=Plur|Person=3वे

ADJ

3379 ADJ tokens (16% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2827; 84%).

ADJ tokens may have the following values of Gender:

Paradigm पूराMascFem
Case=Acc|Number=Singपूरे, पूरापूरी
Case=Acc|Number=Sing|Person=3पूरी
Case=Acc|Number=Plurपूरेपूरी
Case=Nom|Echo=Rdp|Number=Singपूरी
Case=Nom|Number=Singपूरा, पूरे, पूरीपूरी
Case=Nom|Number=Plurपूरे
Number=Singपूरा, पूरेपूरी
Number=Plurपूरेपूरी

ADV

1379 ADV tokens (42% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: AdvType=EMPTY (1376; 100%), Number=Sing (1374; 100%), AdpType=Post (1372; 99%), Person=3 (1370; 99%), Case=Nom (1134; 82%).

ADV tokens may have the following values of Gender:

Paradigm दूरMascFem
Case=Accदूर
Case=Nomदूरदूर

Gender seems to be lexical feature of ADV. 92% lemmas (35) occur only with one value of Gender.

DET

464 DET tokens (6% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (364; 78%), Number=Sing (295; 64%), PronType=Dem (270; 58%), Case=Nom (252; 54%).

DET tokens may have the following values of Gender:

Paradigm यहMascFem
Case=Acc|Number=Sing|Person=3इस, इसी, यहीइसी, ऐसी
Case=Acc|Number=Singइसी
Case=Acc|Number=Plur|Person=3इन्हींइन्हीं
Case=Nom|Number=Sing|Person=3यही, यहइसी, यह

PART

35 PART tokens (0% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (35; 100%), PronType=EMPTY (35; 100%).

PART tokens may have the following values of Gender:

Paradigm साMascFem
Case=Accसी
Case=Acc|Number=Singसी
Case=Nomसी
Case=Nom|Number=Singसासी
Case=Nom|Number=Plurसेसी
Number=Singसासी
Number=Plurसेसी

NUM

16 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (16; 100%).

NUM tokens may have the following values of Gender:

X

13 X tokens (9% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (13; 100%).

X tokens may have the following values of Gender:

Paradigm बड़ाMascFem
Case=Accबड़ेबड़ी
Case=Nomबड़ा

SCONJ

2 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[compound]–> PROPN (8069; 54%), NOUN –[nmod]–> NOUN (6841; 51%), VERB –[nsubj]–> NOUN (5110; 60%), VERB –[compound]–> NOUN (5012; 59%), NOUN –[nmod]–> PROPN (4558; 55%), NOUN –[compound]–> NOUN (3607; 52%), VERB –[nsubj]–> PROPN (2918; 53%), NOUN –[nmod]–> PRON (2706; 78%), PROPN –[nmod]–> NOUN (1620; 59%), PROPN –[nmod]–> PROPN (1614; 69%).