Treebank Statistics: UD_Kangri-KDTB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
1173 tokens (47%) have a non-empty value of Gender.
787 types (75%) occur at least once with a non-empty value of Gender.
699 lemmas (74%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (492; 20% instances), VERB (269; 11% instances), PRON (112; 4% instances), AUX (97; 4% instances), ADJ (92; 4% instances), PROPN (78; 3% instances), DET (18; 1% instances), NUM (15; 1% instances).
NOUN
492 NOUN tokens (89% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (492; 100%), Case=Nom (445; 90%), Number=Sing (396; 80%).
NOUN tokens may have the following values of Gender:
Fem(184; 37% of non-emptyGender): लोकां, अम्मा, माता, ग्रांएं, जरूरत, ज़रूरत, सलाह, हवा, कताब, किताबMasc(308; 63% of non-emptyGender): घरे, कमरे, मन्दरे, घर, पता, पाणिए, प्रतिशत, फायदा, बजे, बरखाEMPTY(63): गल्ल, कवता, पैदा, अप्पूं, इसा, एह, कम, कम्म, कुर्तू, खाणा
| Paradigm लोक | Masc | Fem |
|---|---|---|
| Number=Sing | लोक | |
| Number=Plur | लोक | लोकां |
Gender seems to be lexical feature of NOUN. 98% lemmas (373) occur only with one value of Gender.
VERB
269 VERB tokens (79% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (250; 93%), Case=EMPTY (164; 61%), Person=3 (153; 57%), Aspect=EMPTY (142; 53%), Voice=EMPTY (141; 52%).
VERB tokens may have the following values of Gender:
Fem(135; 50% of non-emptyGender): होई, करी, दित्ती, लगी, कित्ती, दी, आई, हुन्दी, ओंदी, खुल्लीMasc(134; 50% of non-emptyGender): दित्ता, दा, आया, करदे, हुन्दा, हुन्दे, होया, ओआ, करना, पीन्दाEMPTY(73): दे, होई, लेई, जा, ढेई, पढ़ने, बचाणे, रखणे, है, उठदे
| Paradigm होणा | Masc | Fem |
|---|---|---|
| Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Act | होई | |
| Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Pass | होई | |
| Aspect=Perf|Number=Sing|VerbForm=Part | होईके | होई |
| Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Act | होया, हुणा | होई, हुणी, होणी |
| Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Act | हुन्दे, होए | |
| Aspect=Perf|VerbForm=Part|Voice=Act | होई | |
| Case=Acc|VerbForm=Inf | होणे | |
| Case=Nom|Number=Sing|Person=3 | हुन्दा, होईके | हुन्दी |
| Case=Nom|Number=Plur|Person=3 | होणें | |
| Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | होणा | |
| Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | हुन्दे | |
| Number=Plur|Person=3|VerbForm=Inf|Voice=Act | हुन्दे |
PRON
112 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (103; 92%), PronType=EMPTY (100; 89%), Case=Nom (98; 88%), Person=3 (97; 87%).
PRON tokens may have the following values of Gender:
Fem(42; 38% of non-emptyGender): सैह, तिसदी, तिह्नां, इसदा, इसदी, किछ, तिन्नी, मेरी, अपणियां, अपणीMasc(70; 63% of non-emptyGender): मिंजो, सैह, तिसजो, तिसा, असां, एह, तुसां, मेरिया, म्हारे, अपणेEMPTY(96): तुसां, मैं, तिह्नां, असां, इसते, तिन्नी, तिस, इस, इसदे, सैह
| Paradigm सैह | Masc | Fem |
|---|---|---|
| सैह | सैह |
Gender seems to be lexical feature of PRON. 90% lemmas (37) occur only with one value of Gender.
AUX
97 AUX tokens (39% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (89; 92%), Person=EMPTY (83; 86%), Mood=EMPTY (70; 72%), Tense=EMPTY (70; 72%), VerbForm=Part (62; 64%), Voice=EMPTY (61; 63%), Aspect=Perf (52; 54%).
AUX tokens may have the following values of Gender:
Fem(31; 32% of non-emptyGender): गेई, थी, करदी, चाहिदी, पेई, कित्ता, गई, जा, जाणी, पोंदीMasc(66; 68% of non-emptyGender): गेया, था, करदा, थे, गे, गेइयो, चाहिदा, रेह्या, सकदा, कित्ताEMPTY(149): है, हन, जा, सकदे, हैं, कढदे, करदे, करना, गेइयो, गै
| Paradigm जाणा | Masc | Fem |
|---|---|---|
| Aspect=Perf|Number=Sing|Person=3|VerbForm=Part|Voice=Act | गेइयो, गेया | |
| Aspect=Perf|Number=Sing|VerbForm=Part | गेया, गिया, गेइयो, गेयो | गेई, गई, जाणी |
| Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Act | गेया | |
| Aspect=Perf|Number=Plur|VerbForm=Part | गे | |
| Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Act | जांदे | |
| Case=Nom|Number=Sing|Person=3 | गे, गेइयो | |
| Case=Nom|Number=Plur|Person=3 | गे | |
| Number=Sing|Person=3|Voice=Act | जा |
ADJ
92 ADJ tokens (71% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (80; 87%), Number=Sing (68; 74%), Person=3 (53; 58%).
ADJ tokens may have the following values of Gender:
Fem(31; 34% of non-emptyGender): बड़ी, केइयां, नीली, बड्डी, अधिष्ठात्री, अपणी, अपणेयां, उच्ची, काली, काळेयांMasc(61; 66% of non-emptyGender): खरे, छैळ, अगले, अपणा, पक्का, पहला, बड़ा, बड्डा, मता, वाळेEMPTY(37): खुश, घट्ट, जरा, लग्ग, अपणें, असली, अहम, औणे, कमज़ोर, खराब
| Paradigm सारा | Masc | Fem |
|---|---|---|
| Number=Sing | सारा | सारी |
| Number=Plur | सारे |
PROPN
78 PROPN tokens (89% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (78; 100%), Case=Nom (70; 90%), Number=Sing (70; 90%).
PROPN tokens may have the following values of Gender:
Fem(16; 21% of non-emptyGender): दुर्गा, रामें, कांगड़ें, चौधरिएं, ज़मीन, धन्नुए, बज्रेश्वरी, ब्रजेश्वरिया, मीना, योजनाMasc(62; 79% of non-emptyGender): कांगड़े, अमेरिका, धर्मशाला, राजकुमार, राजुए, अमरीका, इंगलैण्ड, कटोचां, कन्हैया, काळुएEMPTY(10): शर्मा, 2020, कराळी, गांधी, चौहान, बी, मोहने, शर्माजी, सनीचरवारे
| Paradigm राम | Masc | Fem |
|---|---|---|
| Number=Sing | राम | |
| Number=Plur | रामें |
Gender seems to be lexical feature of PROPN. 99% lemmas (67) occur only with one value of Gender.
DET
18 DET tokens (31% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Case=Nom (17; 94%), PronType=EMPTY (17; 94%), Person=3 (16; 89%), Number=Sing (13; 72%).
DET tokens may have the following values of Gender:
Fem(4; 22% of non-emptyGender): इतणा, केइयाँ, मतियाँ, सारेयांMasc(14; 78% of non-emptyGender): एह, मते, इक्को, इसा, इह्नां, एह्, तिन्ना, दोयो, सारे, सैहEMPTY(40): कोई, इस, इसा, एह, इन्हां, इह्नां, घट्ट, हर, इत्थू, कितणे
Gender seems to be lexical feature of DET. 100% lemmas (15) occur only with one value of Gender.
NUM
15 NUM tokens (37% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Case=Nom (15; 100%), NumType=EMPTY (15; 100%), Number=Sing (15; 100%), Person=3 (15; 100%).
NUM tokens may have the following values of Gender:
Fem(4; 27% of non-emptyGender): इक, त्रींह, त्रीह्नी, पैंतीMasc(11; 73% of non-emptyGender): इक्क, दोयो, पंज, सतEMPTY(26): दो, 8, चौंह, दोयो, 15, 19, 25, 250, 300, 35000
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nmod]–> NOUN (39; 62%),
NOUN –[amod]–> ADJ (28; 61%),
NOUN –[nmod]–> PRON (17; 57%),
NOUN –[nmod]–> ADJ (11; 69%),
VERB –[amod]–> NOUN (7; 54%),
NOUN –[compound]–> ADJ (6; 55%),
VERB –[nsubj]–> PROPN (6; 60%),
NOUN –[amod]–> NOUN (5; 56%),
NOUN –[compound]–> PROPN (4; 80%),
VERB –[obl]–> PROPN (4; 67%).