home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

74073 tokens (54%) have a non-empty value of Gender. 8867 types (82%) occur at least once with a non-empty value of Gender. 7852 lemmas (82%) occur at least once with a non-empty value of Gender. The feature is used with 14 part-of-speech tags: NOUN (32504; 24% instances), PROPN (16889; 12% instances), ADP (10419; 8% instances), VERB (7550; 5% instances), AUX (3999; 3% instances), ADJ (1441; 1% instances), PRON (614; 0% instances), ADV (441; 0% instances), DET (115; 0% instances), PART (47; 0% instances), NUM (45; 0% instances), CCONJ (6; 0% instances), X (2; 0% instances), SCONJ (1; 0% instances).

NOUN

32504 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (32398; 100%), Number=Sing (27377; 84%), Case=Acc (17103; 53%).

NOUN tokens may have the following values of Gender:

Paradigm حکومتMascFem
Case=Acc|Number=Singحکومتحکومت
Case=Acc|Number=Plurحکومتوںحکومتوں, حکومتیں
Case=Nom|Number=Singحکومتحکومت
Case=Nom|Number=Plurحکومتیںحکومتیں

PROPN

16889 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (16879; 100%), Number=Sing (16829; 100%), Case=Nom (9869; 58%).

PROPN tokens may have the following values of Gender:

Paradigm پیMascFem
Case=Accپیپی
Case=Nomپی, بیپی

Gender seems to be lexical feature of PROPN. 96% lemmas (3558) occur only with one value of Gender.

ADP

10419 ADP tokens (37% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: Number=Sing (10069; 97%), AdpType=Post (9908; 95%), Case=Nom (5615; 54%).

ADP tokens may have the following values of Gender:

Paradigm کاMascFem
AdpType=Postکے
AdpType=Post|Case=Accکی
AdpType=Post|Case=Acc|Number=Singکے, کا, کی, سے, نےکی, کے, والی, کا
AdpType=Post|Case=Acc|Number=Sing|Person=3کا, کے
AdpType=Post|Case=Acc|Number=Sing|Person=3|Polite=Formکے
AdpType=Post|Case=Acc|Number=Plurکےکی
AdpType=Post|Case=Acc|Number=Plur|Person=3|Polite=Formکے
AdpType=Post|Case=Nom|Number=Singکا, کے, کیکی, کا, کو, کے
AdpType=Post|Case=Nom|Number=Sing|Person=3کی
AdpType=Post|Case=Nom|Number=Sing|Person=3|Polite=Formکے
AdpType=Post|Case=Nom|Number=Plurکےکی
AdpType=Post|Case=Nom|Number=Plur|Person=3کے
AdpType=Post|Case=Nom|Number=Plur|Person=3|Polite=Formکے
AdpType=Post|Number=Singکاکی
AdpType=Post|Number=Sing|Person=3کا
Aspect=Perf|Case=Nom|Number=Sing|VerbForm=Partکی
Case=Accکے
Case=Acc|Number=Singکے
Case=Acc|Number=Plurکے
Case=Nom|Number=Singکےکی

VERB

7550 VERB tokens (59% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Case=EMPTY (7440; 99%), Person=EMPTY (6616; 88%), VerbForm=Part (6484; 86%), Number=Sing (6409; 85%), Aspect=Perf (5388; 71%), Voice=Act (4884; 65%).

VERB tokens may have the following values of Gender:

Paradigm کرناMascFem
Aspect=Imp|Number=Sing|Person=3|Polite=Form|VerbForm=Part|Voice=Actکرتےکرتیں
Aspect=Imp|Number=Sing|VerbForm=Partکرتے, کرتاکرتی, کرتے
Aspect=Imp|Number=Sing|VerbForm=Part|Voice=Actکرتا, کرتےکرتی
Aspect=Imp|Number=Plur|VerbForm=Partکرتے
Aspect=Imp|Number=Plur|VerbForm=Part|Voice=Actکرتےکرتیں
Aspect=Perf|Case=Acc|Number=Sing|Person=3|VerbForm=Partکیے
Aspect=Perf|Case=Acc|Number=Sing|VerbForm=Partکئے, کیےکی
Aspect=Perf|Case=Nom|Number=Sing|VerbForm=Part|Voice=Actکی
Aspect=Perf|Number=Sing|Person=3|Polite=Form|VerbForm=Part|Voice=Actکئے
Aspect=Perf|Number=Sing|Person=3|VerbForm=Partکئے, کیا, کیے
Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Actکیا, کئے, کر, کیکی
Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Passکئے, کیےکی
Aspect=Perf|Number=Sing|VerbForm=Partکیا, کئے, کرتے, کیےکی, کیے
Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Actکیا, کیے, کئے, کر, کیکی, کریں
Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Passکیا, کئے, کیےکی
Aspect=Perf|Number=Plur|Person=3|VerbForm=Partکیے
Aspect=Perf|Number=Plur|Person=3|VerbForm=Part|Voice=Actکئے, کیے, کیںکیں
Aspect=Perf|Number=Plur|Person=3|VerbForm=Part|Voice=Passکیے
Aspect=Perf|Number=Plur|VerbForm=Partکیے, کئے
Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Actکیے, کئے, کریںکیں
Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Passکیے, کئے
Aspect=Perf|Person=3|VerbForm=Partکیے
Aspect=Perf|VerbForm=Part|Voice=Actکیے
Case=Acc|Number=Sing|Person=3|Voice=Passکی
Case=Acc|Number=Plur|VerbForm=Infکرنے
Case=Acc|VerbForm=Infکرنے
Case=Nom|Number=Sing|VerbForm=Infکرنا
Case=Nom|Number=Sing|Voice=Actکی
Case=Nom|Number=Plur|Person=3|Voice=Actکر
Case=Nom|VerbForm=Infکرنے
Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Fut|VerbForm=Fin|Voice=Actکریں_گے, کریںگے
Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actکریں_گے, کریگا, کرےگاکرےگی, کریں_گی, کریگی
Mood=Ind|Number=Sing|Tense=Fut|VerbForm=Fin|Voice=Actکریں_گےکرےگی, کریں_گی
Mood=Ind|Number=Plur|Person=3|Polite=Form|Tense=Fut|VerbForm=Fin|Voice=Actکرینگے
Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actکریں_گے, کرےنگےکریں_گی
Mood=Ind|Number=Plur|Tense=Fut|VerbForm=Fin|Voice=Actکریں_گے, کرےنگےکریں_گی
Mood=Sub|Number=Sing|Person=3|Polite=Form|VerbForm=Fin|Voice=Actکریں_گے, کریں
Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actکریں, کرے
Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Passکریں
Mood=Sub|Number=Sing|VerbForm=Fin|Voice=Actکریں
Mood=Sub|Number=Sing|VerbForm=Fin|Voice=Passکریں
Mood=Sub|Number=Plur|Person=3|VerbForm=Fin|Voice=Actکریں
Mood=Sub|Number=Plur|Person=3|VerbForm=Fin|Voice=Passکریںکریں
Mood=Sub|Number=Plur|VerbForm=Fin|Voice=Actکریں
Mood=Sub|Number=Plur|VerbForm=Fin|Voice=Passکریں
Number=Sing|Person=3کیے, کر
Number=Sing|Person=3|VerbForm=Inf|Voice=Passکرنی
Number=Sing|Person=3|Voice=Actکرےگا, کیا, کر, کریں_گےکرےگی, کی
Number=Sing|VerbForm=Infکرنا
Number=Sing|VerbForm=Inf|Voice=Actکرنا, کرنےکرنی
Number=Sing|VerbForm=Inf|Voice=Passکرناکرنی
Number=Sing|Voice=Actکیا, کر, کرےگا, کہاکریں, کی
Number=Sing|Voice=Passکیاکی
Number=Plur|VerbForm=Infکرنے
Number=Plur|VerbForm=Inf|Voice=Actکرنےکرنی
Number=Plur|VerbForm=Inf|Voice=Passکرنے
VerbForm=Infکرنے

AUX

3999 AUX tokens (44% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Voice=EMPTY (3739; 93%), Number=Sing (2943; 74%), Person=EMPTY (2650; 66%), Tense=EMPTY (2450; 61%), Mood=EMPTY (2394; 60%), VerbForm=Part (2282; 57%).

AUX tokens may have the following values of Gender:

Paradigm ہےMascFem
Aspect=Perf|Number=Sing|VerbForm=Partہوا, ہےہے
Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Pres|VerbForm=Finہیںہیں
Mood=Ind|Number=Sing|Person=3|Polite=Form|Tense=Pres|VerbForm=Fin|Voice=Actہیںہیں
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Finہے, ہیںہیں
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actہیں, ہےہیں
Mood=Ind|Number=Sing|Tense=Pres|VerbForm=Finہیں, ہےہیں
Mood=Ind|Number=Sing|Tense=Pres|VerbForm=Fin|Voice=Actہیں
Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actہوں_گے
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Finہیں, ہےںہیں
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actہیںہیں
Mood=Ind|Number=Plur|Tense=Pres|VerbForm=Finہیں, ہےہیں
Mood=Ind|Number=Plur|Tense=Pres|VerbForm=Fin|Voice=Actہیںہیں

ADJ

1441 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (926; 64%).

ADJ tokens may have the following values of Gender:

Paradigm والاMascFem
_والی
Case=Accوالی
Case=Acc|Number=Singوالے, والاوالی
Case=Acc|Number=Sing|Person=3والا
Case=Acc|Number=Plurوالے, والوں
Case=Nomوالی
Case=Nom|Number=Singوالا, والےوالی
Case=Nom|Number=Sing|Person=3والا
Case=Nom|Number=Plurوالے
Number=Singوالاوالی
Number=Sing|Person=3والا, والےوالی
Number=Plurوالےوالی

PRON

614 PRON tokens (11% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (562; 92%), Number=Sing (406; 66%), Polite=EMPTY (403; 66%), Case=Acc (308; 50%).

PRON tokens may have the following values of Gender:

Paradigm وہMascFem
Case=Acc,Dat|Number=Sing|Polite=Form|PronType=Prsانہیں, انہوں, انھیں, انھوں, اُنھیںانہیں
Case=Acc,Dat|Number=Sing|PronType=Prsانھیں
Case=Acc,Dat|Number=Plur|PronType=Prsانھیں, انہیں
Case=Acc|Number=Singاُساُس
Case=Acc|Number=Sing|Polite=Formاُن, انہوں
Case=Acc|Number=Sing|Polite=Form|PronType=Prsانہوں, اُنھوں, انھوں, ان
Case=Acc|Number=Sing|PronType=Prsاُس, انہوں, اُن
Case=Acc|Number=Plurاُن
Case=Acc|Number=Plur|Polite=Form|PronType=Prsانہوں
Case=Acc|Number=Plur|PronType=Prsانہوںان
Case=Nom|Number=Sing|Polite=Form|PronType=Prsانھوں, اُنھیں
Case=Nom|Number=Plur|PronType=Prsانہیں
Number=Sing|Polite=Form|PronType=Prsانہوں

ADV

441 ADV tokens (32% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: AdvType=EMPTY (440; 100%), Number=Sing (437; 99%), Person=3 (408; 93%), Case=Nom (398; 90%), AdpType=Post (372; 84%).

ADV tokens may have the following values of Gender:

Paradigm جانبMascFem
AdpType=Post|Case=Accجانب
AdpType=Post|Case=Nomجانب, عنقریبجانب
Case=Accجانب

Gender seems to be lexical feature of ADV. 98% lemmas (53) occur only with one value of Gender.

DET

115 DET tokens (5% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (102; 89%), Person=EMPTY (96; 83%), PronType=Dem (91; 79%), Case=Nom (68; 59%).

DET tokens may have the following values of Gender:

Paradigm یہMascFem
Case=Acc|Number=Singاسیاسی
Case=Acc|Number=Plurایسی
Case=Nom|Number=Sing|Person=3یہ, اسی
Case=Nom|Number=Singاس, اسیاسی
Case=Nom|Number=Plurایسی
Number=Sing|Person=3یہ

PART

47 PART tokens (2% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (47; 100%), PronType=EMPTY (47; 100%).

PART tokens may have the following values of Gender:

Paradigm مسٹرMascFem
Case=Nomمسٹرمسٹر
مسٹر

Gender seems to be lexical feature of PART. 95% lemmas (19) occur only with one value of Gender.

NUM

45 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (45; 100%).

NUM tokens may have the following values of Gender:

Paradigm تینMascFem
تینوںتینوں

Gender seems to be lexical feature of NUM. 96% lemmas (22) occur only with one value of Gender.

CCONJ

6 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

X

2 X tokens (14% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (2; 100%), Number=Sing (2; 100%), Person=3 (2; 100%).

X tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[compound]–> PROPN (7500; 92%), NOUN –[nmod]–> NOUN (4376; 69%), NOUN –[nmod]–> PROPN (2271; 77%), NOUN –[compound]–> NOUN (1863; 81%), PROPN –[nmod]–> NOUN (1175; 88%), VERB –[nsubj]–> PROPN (932; 55%), NOUN –[conj]–> NOUN (837; 76%), PROPN –[conj]–> PROPN (690; 93%), PROPN –[nmod]–> PROPN (650; 89%), PROPN –[compound]–> NOUN (414; 93%).