home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

67547 tokens (42%) have a non-empty value of Gender. 13230 types (74%) occur at least once with a non-empty value of Gender. 6693 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (37696; 23% instances), VERB (12823; 8% instances), ADJ (7901; 5% instances), PRON (7125; 4% instances), NUM (1384; 1% instances), AUX (600; 0% instances), DET (18; 0% instances).

NOUN

37696 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (27851; 74%), Definite=EMPTY (25936; 69%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 98% lemmas (3955) occur only with one value of Gender.

VERB

12823 VERB tokens (81% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Polarity=EMPTY (11101; 87%), VerbType=EMPTY (11101; 87%), Number=Sing (8813; 69%), Person=3 (8131; 63%), VerbForm=EMPTY (7865; 61%), Voice=Act (6805; 53%).

VERB tokens may have the following values of Gender:

Paradigm היהFem,MascMascFem
HebSource=ConvUncertainHead|Number=Sing|Person=3|Tense=Futתהיה
HebSource=ConvUncertainHead|Number=Sing|Person=3|Tense=Pastהיההיתה
HebSource=ConvUncertainHead|Number=Plur|Person=3|Tense=Pastהיו
Mood=Imp|Number=Sing|Person=2הייה, היה
Number=Sing|Person=1|Tense=Pastהייתי
Number=Sing|Person=2|Tense=Futתהיה
Number=Sing|Person=2|Tense=Pastהיית
Number=Sing|Person=3|Tense=Futיהיהתהיה
Number=Sing|Person=3|Tense=Pastהיההיתה, הייתה
Number=Plur|Person=1|Tense=Futנהיה
Number=Plur|Person=1|Tense=Pastהיינו
Number=Plur|Person=2|Tense=Pastהייתם
Number=Plur|Person=3|Tense=Futיהיותהיינה
Number=Plur|Person=3|Tense=Pastהיו

ADJ

7901 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (5619; 71%).

ADJ tokens may have the following values of Gender:

PRON

7125 PRON tokens (97% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (6448; 90%), PronType=Prs (5890; 83%), Number=Sing (5211; 73%), Case=EMPTY (4437; 62%).

PRON tokens may have the following values of Gender:

Paradigm הואFem,MascMascFem
Case=Acc|Number=Sing|Person=1_אני
Case=Acc|Number=Sing|Person=2_אתה
Case=Acc|Number=Sing|Person=3_הוא_היא
Case=Acc|Number=Plur|Person=2_אתם
Case=Acc|Number=Plur|Person=3_הם_הן
Case=Gen|Number=Sing|Person=1_אני
Case=Gen|Number=Sing|Person=2_אתה_את
Case=Gen|Number=Sing|Person=3_הוא_היא
Case=Gen|Number=Plur|Person=1_אנחנו
Case=Gen|Number=Plur|Person=2_אתם
Case=Gen|Number=Plur|Person=3_הם_הן
HebSource=ConvUncertainHead|Number=Sing|Person=3_הוא, הוא_היא, היא
HebSource=ConvUncertainHead|Number=Plur|Person=1_אנחנו
HebSource=ConvUncertainHead|Number=Plur|Person=3_הם, הם_הן, הן
Number=Sing|Person=1אני, _אני
Number=Sing|Person=2_אתה, אתה_את, את
Number=Sing|Person=3_הוא, הוא_היא, היא
Number=Plur|Person=1_אנחנו, אנו, אנחנו
Number=Plur|Person=2_אתם, אתם
Number=Plur|Person=3_הם, הם_הן, הן

NUM

1384 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (1007; 73%), Definite=EMPTY (960; 69%).

NUM tokens may have the following values of Gender:

Paradigm שמונהMascFem
Definite=Consשמונה
שמונהשמונה

Gender seems to be lexical feature of NUM. 99% lemmas (70) occur only with one value of Gender.

AUX

600 AUX tokens (71% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbType=Mod (600; 100%), Tense=EMPTY (494; 82%), VerbForm=EMPTY (486; 81%), Number=Sing (478; 80%), Person=1,2,3 (472; 79%).

AUX tokens may have the following values of Gender:

DET

18 DET tokens (0% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=EMPTY (18; 100%).

DET tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6169; 97%), VERB –[nsubj]–> NOUN (3204; 67%), NOUN –[nmod]–> NOUN (2687; 52%), NOUN –[nmod:poss]–> PRON (1348; 50%), NOUN –[acl:relcl]–> VERB (1332; 65%), NOUN –[conj]–> NOUN (1213; 60%), VERB –[conj]–> VERB (1040; 72%), VERB –[nsubj]–> PRON (767; 76%), NOUN –[nmod:poss]–> NOUN (521; 52%), NOUN –[amod]–> PRON (518; 90%).