home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hindi-PUD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

This is a layered feature with the following layers: Gender, Gender[psor].

11116 tokens (47%) have a non-empty value of Gender. 3503 types (68%) occur at least once with a non-empty value of Gender. 2993 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4997; 21% instances), VERB (1555; 7% instances), PROPN (1338; 6% instances), ADP (1285; 5% instances), AUX (1221; 5% instances), PRON (408; 2% instances), ADJ (273; 1% instances), DET (39; 0% instances).

NOUN

4997 NOUN tokens (89% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4463; 89%), Case=Acc (2505; 50%).

NOUN tokens may have the following values of Gender:

Paradigm बारMascFem
बारबार

Gender seems to be lexical feature of NOUN. 98% lemmas (1794) occur only with one value of Gender.

VERB

1555 VERB tokens (63% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=EMPTY (1539; 99%), Person=3 (1497; 96%), Number=Sing (1380; 89%), Mood=Ind (1059; 68%), Tense=EMPTY (829; 53%).

VERB tokens may have the following values of Gender:

Paradigm करनाMascFem
Aspect=Imp|Mood=Imp|Number=Plur|Person=3|Tense=Futकरेंगे
Aspect=Imp|Mood=Imp|Number=Plur|Person=3|Tense=Presकरें
Aspect=Imp|Mood=Ind|Number=Sing|Person=1करता
Aspect=Imp|Mood=Ind|Number=Sing|Person=1|Tense=Futकरूंगा
Aspect=Imp|Mood=Ind|Number=Sing|Person=3करता, करना, करेकरती, करत
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polite=Formकरती
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Futकरेगाकरेगी
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Presकरता, करते
Aspect=Imp|Mood=Ind|Number=Plur|Person=3करतेकरती
Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Polite=Formकरते
Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Presकरें
Aspect=Perf|Mood=Ind|Number=Sing|Person=3किया, किएकी
Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Pastकिया, कीकी, कीं
Aspect=Perf|Mood=Ind|Number=Plur|Person=3की
Aspect=Perf|Mood=Ind|Number=Plur|Person=3|Tense=Pastकिएकीं
Number=Singकिए
Number=Sing|Person=3किया, करते, किए, कर, करने, कियकियाकी, करनी
Number=Sing|Person=3|VerbForm=Infकरनी
Number=Plur|Person=3किए, करते, कियाकी
Number=Plur|Person=3|VerbForm=Infकरने

PROPN

1338 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1337; 100%), Case=Acc (712; 53%).

PROPN tokens may have the following values of Gender:

Paradigm ट्रम्पMascFem
Case=Accट्रम्पट्रम्प
Case=Nomट्रम्प

Gender seems to be lexical feature of PROPN. 99% lemmas (905) occur only with one value of Gender.

ADP

1285 ADP tokens (27% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: Case=Gen (1200; 93%), Number=Sing (758; 59%).

ADP tokens may have the following values of Gender:

Paradigm काMascFem
काकी
Number=Singके, का
Number=Plurके

AUX

1221 AUX tokens (89% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=3 (1209; 99%), Number=Sing (1020; 84%), Mood=EMPTY (622; 51%), Aspect=EMPTY (620; 51%).

AUX tokens may have the following values of Gender:

Paradigm हैMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polite=Formहैं
Aspect=Imp|Mood=Ind|Number=Sing|Person=3हैहै
Aspect=Imp|Mood=Ind|Number=Plur|Person=3हैंहैं
Number=Sing|Person=1हूं
Number=Sing|Person=3|Polite=Formहैंहैं
Number=Sing|Person=3है, हैंहै, हूं, हैं
Number=Plur|Person=3हैंहैं, है

PRON

408 PRON tokens (36% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (365; 89%), Gender[psor]=EMPTY (309; 76%), Number[psor]=EMPTY (300; 74%), PronType=EMPTY (297; 73%), Case=Nom (246; 60%), Person=3 (240; 59%).

PRON tokens may have the following values of Gender:

Paradigm वहMascFem
Case=Acc|Number=Sing|Number[psor]=Sing|PronType=Prsउसके, उनकेउसकी
Case=Acc|Number=Singउसे, उसने, उसके, उससेउसे, उसके, उसने
Case=Acc|Number=Sing|Polite=Formउन्हेंउन्हें
Case=Acc|Number=Plur|Number[psor]=Plur|PronType=Prsउनके
Case=Acc|Number=Plurउन्होंने, उनके, उन्हें, वेउन्हें, उन्होंने, वे
Case=Accउनके
Case=Nom|Number=Sing|Number[psor]=Sing|Polite=Form|PronType=Prsउनकी
Case=Nom|Number=Sing|Number[psor]=Sing|PronType=Prsउसका, उसकेउसकी, उनकी
Case=Nom|Number=Sing|Number[psor]=Plur|Polite=Form|PronType=Prsउनकी
Case=Nom|Number=Sing|Number[psor]=Plur|PronType=Prsउसकाउनकी
Case=Nom|Number=Singवह, उसका, उनका, वे, वोउसकी, वह, उनकी, वो
Case=Nom|Number=Plur|Number[psor]=Plur|Polite=Form|PronType=Prsउनकी
Case=Nom|Number=Plur|Number[psor]=Plur|PronType=Prsउनकी
Case=Nom|Number=Plurवे, उनकाउनकी
Case=Nomउनका
Case=Nom|PronType=Prsउनका
Number=Sing|Number[psor]=Sing|PronType=Prsउसकी
Number=Singउसका

ADJ

273 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Gender.

ADJ tokens may have the following values of Gender:

Paradigm नयाMascFem
_नयी
Case=Acc|Number=Singनये
Case=Acc|Number=Plurनये
Case=Nom|Number=Singनया
Case=Nom|Number=Plurनये

DET

39 DET tokens (4% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (39; 100%), Number=Sing (23; 59%).

DET tokens may have the following values of Gender:

Paradigm पूराMascFem
_पूरी
Case=Acc|Number=Singपूरे
Case=Nom|Number=Singपूरी

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[obj]–> NOUN (629; 53%), VERB –[aux]–> AUX (575; 65%), NOUN –[compound]–> NOUN (221; 56%), NOUN –[nmod:poss]–> PRON (213; 70%), PROPN –[flat:name]–> PROPN (194; 94%), VERB –[aux:pass]–> AUX (171; 86%), VERB –[nsubj]–> NOUN (161; 51%), VERB –[nsubj]–> PROPN (159; 72%), NOUN –[compound]–> PROPN (122; 73%), NOUN –[conj]–> NOUN (121; 58%).