home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

67547 tokens (42%) have a non-empty value of Gender. 13230 types (74%) occur at least once with a non-empty value of Gender. 6693 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (37696; 23% instances), VERB (11272; 7% instances), ADJ (7901; 5% instances), PRON (7125; 4% instances), AUX (2151; 1% instances), NUM (1384; 1% instances), DET (18; 0% instances).

NOUN

37696 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (27851; 74%), Definite=EMPTY (25936; 69%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 98% lemmas (3955) occur only with one value of Gender.

VERB

11272 VERB tokens (79% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (7597; 67%), VerbForm=EMPTY (7030; 62%), Voice=Act (6805; 60%), Person=3 (6636; 59%), Tense=Past (5698; 51%).

VERB tokens may have the following values of Gender:

Paradigm אמרFem,MascMascFem
HebBinyan=PAAL|HebSource=ConvUncertainHead|Number=Sing|Person=1,2,3|VerbForm=Part|Voice=Actאומרת
HebBinyan=PAAL|HebSource=ConvUncertainHead|Number=Sing|Person=3|Tense=Past|Voice=Actאמר
HebBinyan=PAAL|Number=Sing|Person=1,2,3|VerbForm=Part|Voice=Actאומראומרת
HebBinyan=PAAL|Number=Sing|Person=1|Tense=Past|Voice=Actאמרתי
HebBinyan=PAAL|Number=Sing|Person=2|Tense=Fut|Voice=Actתאמר
HebBinyan=PAAL|Number=Sing|Person=2|Tense=Past|Voice=Actאמרת
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Fut|Voice=Actתאמר
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Past|Voice=Actאמראמרה
HebBinyan=PAAL|Number=Plur|Person=1,2,3|VerbForm=Part|Voice=Actאומריםאומרות
HebBinyan=PAAL|Number=Plur|Person=1|Tense=Past|Voice=Actאמרנו
HebBinyan=PAAL|Number=Plur|Person=3|Tense=Past|Voice=Actאמרו
Mood=Imp|Number=Sing|Person=2אמור
Number=Sing|Person=1|Tense=Futאומר
Number=Sing|Person=3|Tense=Futיאמר
Number=Plur|Person=3|Tense=Futיאמרו

ADJ

7901 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (5619; 71%).

ADJ tokens may have the following values of Gender:

PRON

7125 PRON tokens (97% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (6448; 90%), PronType=Prs (5848; 82%), Number=Sing (5211; 73%), Case=EMPTY (4520; 63%).

PRON tokens may have the following values of Gender:

Paradigm הואFem,MascMascFem
Case=Acc|Number=Sing|Person=1_אני
Case=Acc|Number=Sing|Person=2_אתה
Case=Acc|Number=Sing|Person=3_הוא_היא
Case=Acc|Number=Plur|Person=2_אתם
Case=Acc|Number=Plur|Person=3_הם_הן
Case=Gen|Number=Sing|Person=1_אני
Case=Gen|Number=Sing|Person=2_אתה_את
Case=Gen|Number=Sing|Person=3_הוא_היא
Case=Gen|Number=Plur|Person=1_אנחנו
Case=Gen|Number=Plur|Person=2_אתם
Case=Gen|Number=Plur|Person=3_הם_הן
HebSource=ConvUncertainHead|Number=Sing|Person=3_הוא, הוא_היא, היא
HebSource=ConvUncertainHead|Number=Plur|Person=1_אנחנו
HebSource=ConvUncertainHead|Number=Plur|Person=3_הם, הם_הן, הן
Number=Sing|Person=1אני, _אני
Number=Sing|Person=2_אתה, אתה_את, את
Number=Sing|Person=3_הוא, הוא_היא, היא
Number=Plur|Person=1_אנחנו, אנו, אנחנו
Number=Plur|Person=2_אתם, אתם
Number=Plur|Person=3_הם, הם_הן, הן

AUX

2151 AUX tokens (86% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (1694; 79%), VerbType=Cop (1551; 72%), Person=3 (1513; 70%), VerbForm=EMPTY (1321; 61%), Polarity=Pos (1235; 57%), Tense=EMPTY (1213; 56%).

AUX tokens may have the following values of Gender:

Paradigm היהFem,MascMascFem
HebSource=ConvUncertainHead|Number=Sing|Person=3|Tense=Futתהיה
HebSource=ConvUncertainHead|Number=Plur|Person=3|Tense=Pastהיו
Mood=Imp|Number=Sing|Person=2הייה, היה
Number=Sing|Person=1|Tense=Pastהייתי
Number=Sing|Person=2|Tense=Futתהיה
Number=Sing|Person=2|Tense=Pastהיית
Number=Sing|Person=3|Tense=Futיהיהתהיה
Number=Sing|Person=3|Tense=Pastהיההיתה
Number=Plur|Person=1|Tense=Futנהיה
Number=Plur|Person=1|Tense=Pastהיינו
Number=Plur|Person=2|Tense=Pastהייתם
Number=Plur|Person=3|Tense=Futיהיו
Number=Plur|Person=3|Tense=Pastהיו

NUM

1384 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (1007; 73%), Definite=EMPTY (960; 69%).

NUM tokens may have the following values of Gender:

Paradigm שמונהMascFem
Definite=Consשמונה
שמונהשמונה

Gender seems to be lexical feature of NUM. 99% lemmas (70) occur only with one value of Gender.

DET

18 DET tokens (0% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=EMPTY (18; 100%).

DET tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6169; 97%), VERB –[nsubj]–> NOUN (3183; 67%), NOUN –[nmod]–> NOUN (2715; 52%), NOUN –[nmod:poss]–> PRON (1348; 50%), NOUN –[acl:relcl]–> VERB (1309; 62%), NOUN –[conj]–> NOUN (1213; 60%), VERB –[conj]–> VERB (1023; 70%), VERB –[nsubj]–> PRON (762; 76%), NOUN –[det]–> PRON (619; 90%), NOUN –[nmod:poss]–> NOUN (521; 52%).