home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTwiki: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

59939 tokens (43%) have a non-empty value of Gender. 10723 types (75%) occur at least once with a non-empty value of Gender. 6238 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (34541; 25% instances), VERB (8830; 6% instances), ADJ (8672; 6% instances), PRON (5266; 4% instances), AUX (922; 1% instances), NUM (858; 1% instances), PROPN (780; 1% instances), SYM (58; 0% instances), DET (12; 0% instances).

NOUN

34541 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (25487; 74%), Definite=EMPTY (25149; 73%).

NOUN tokens may have the following values of Gender:

Paradigm פניםFem,MascMascFem
Definite=Cons|Number=Plurפניפניפני
Number=Singפנים
Number=Plurפניפני, פניםפנים, פני

Gender seems to be lexical feature of NOUN. 96% lemmas (3642) occur only with one value of Gender.

VERB

8830 VERB tokens (83% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (8655; 98%), VerbForm=EMPTY (6222; 70%), Number=Sing (6179; 70%), Tense=Past (5806; 66%), Voice=Act (5432; 62%).

VERB tokens may have the following values of Gender:

Paradigm כללFem,MascMascFem
HebBinyan=HIFIL|Number=Sing|Tense=Past|Voice=Actכללה
HebBinyan=PAAL|Number=Sing|Tense=Fut|Voice=Actיכלול
HebBinyan=PAAL|Number=Sing|Tense=Past|Voice=Actכללכללה
HebBinyan=PAAL|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actכולל, כללכוללת
HebBinyan=PAAL|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Passכלול
HebBinyan=PAAL|Number=Sing|VerbForm=Part|Voice=Actכולל
HebBinyan=PAAL|Number=Plur|Tense=Past|Voice=Actכללוכללו
HebBinyan=PAAL|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actכולליםכוללות
HebBinyan=PAAL|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Passכלולים
HebBinyan=PIEL|Number=Plur|Tense=Past|Voice=Actכללו

ADJ

8672 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (6285; 72%).

ADJ tokens may have the following values of Gender:

Paradigm רבMascFem
Definite=Cons|Number=Singרברבת
Definite=Cons|Number=Plurרבי
Number=Singרברבה
Number=Plurרביםרבות

PRON

5266 PRON tokens (93% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (5070; 96%), PronType=Prs (4339; 82%), Number=Sing (3978; 76%), Poss=EMPTY (3142; 60%), Case=EMPTY (3086; 59%), Definite=EMPTY (2958; 56%).

PRON tokens may have the following values of Gender:

Paradigm הואFem,MascMascFem
Case=Acc|Definite=Def|Number=Sing|Person=3ו
Case=Acc|Definite=Def|Number=Plur|Person=3ם
Case=Acc|Number=Sing|Person=1ני
Case=Acc|Number=Sing|Person=3ו, הו, וֹה
Case=Acc|Number=Plur|Person=3ם
Case=Gen|Definite=Def|Number=Sing|Person=1|Poss=Yesיי
Case=Gen|Definite=Def|Number=Sing|Person=2|Poss=Yesךך, ה
Case=Gen|Definite=Def|Number=Sing|Person=3|Poss=Yesו, ם, וֹ, י, ןה, הּ, ו, ך, ם
Case=Gen|Definite=Def|Number=Sing|Person=3|Poss=Yes|Typo=Yesהם
Case=Gen|Definite=Def|Number=Sing|Person=3ו
Case=Gen|Definite=Def|Number=Sing|Poss=Yesו
Case=Gen|Definite=Def|Number=Plur|Person=1|Poss=Yesנו
Case=Gen|Definite=Def|Number=Plur|Person=3|Poss=Yesם, הםן, הן, ם, ה
Case=Gen|Definite=Def|Number=Plur|Person=3|Poss=Yes|Typo=Yesם, ן
Case=Gen|Number=Sing|Person=3|Poss=Yesו
Definite=Def|Number=Sing|Person=3וה
Number=Sing|Person=1אני, יאני, י, ניאני
Number=Sing|Person=2אתה, ך, ךָאת, ך
Number=Sing|Person=3|Polarity=Posהוא, היא, הםהיא, הוא, י
Number=Sing|Person=3ו, הוא, ך, וֹ, ה, יו, םה, היא, ך, את, הן
Number=Sing|Person=3|Typo=Yesהו
Number=Sing|Polarity=Posהואהיא
Number=Plur|Person=1נונו, אנחנו, אנו
Number=Plur|Person=2כם
Number=Plur|Person=3|Polarity=Posהם, םהן
Number=Plur|Person=3הם, ם, ןהן, ן, הם
Number=Plur|Person=3|Typo=Yesהם
Number=Plur|Polarity=Posהם
Number=Plurהן

AUX

922 AUX tokens (96% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=EMPTY (799; 87%), Person=3 (712; 77%), Number=Sing (647; 70%), VerbType=EMPTY (591; 64%), HebBinyan=PAAL (541; 59%), Tense=Past (499; 54%).

AUX tokens may have the following values of Gender:

Paradigm היהFem,MascMascFem
Number=Sing|Person=1|Polarity=Pos|Tense=Pastהייתיהייתי
Number=Sing|Person=1|Polarity=Pos|Tense=Past|VerbType=Copהייתי
Number=Sing|Person=1|Tense=Futאהיה
Number=Sing|Person=1|Tense=Past|VerbType=Copהייתי
Number=Sing|Person=3הייתה
Number=Sing|Person=3|Polarity=Pos|Tense=Futיהיהתהיה
Number=Sing|Person=3|Polarity=Pos|Tense=Fut|VerbType=Copיהיהתהיה
Number=Sing|Person=3|Polarity=Pos|Tense=Pastהיההייתה
Number=Sing|Person=3|Polarity=Pos|Tense=Past|Typo=Yes|VerbType=Copהייתה, היה
Number=Sing|Person=3|Polarity=Pos|Tense=Past|VerbType=Copהיההייתה
Number=Sing|Person=3|Tense=Futיהיהתהא
Number=Sing|Person=3|Tense=Fut|VerbType=Copיהיה, יהאתהיה, תהא
Number=Sing|Person=3|Tense=Pastהיההייתה
Number=Sing|Person=3|Tense=Past|VerbType=Copהיההייתה
Number=Sing|Person=3|VerbType=Copהייתה
Number=Plur|Person=1|Polarity=Pos|Tense=Pastהיינו
Number=Plur|Person=1|Tense=Fut|VerbType=Copנהיה
Number=Plur|Person=3|Polarity=Pos|Tense=Futיהיויהיויהיו
Number=Plur|Person=3|Polarity=Pos|Tense=Fut|VerbType=Copיהיותהיינה
Number=Plur|Person=3|Polarity=Pos|Tense=Pastהיוהיוהיו
Number=Plur|Person=3|Polarity=Pos|Tense=Past|Typo=Yes|VerbType=Copהיה
Number=Plur|Person=3|Polarity=Pos|Tense=Past|VerbType=Copהיוהיוהיו
Number=Plur|Person=3|Tense=Fut|VerbType=Copיהיו
Number=Plur|Person=3|Tense=Pastהיוהיו
Number=Plur|Person=3|Tense=Past|VerbType=Copהיוהיוהיו

NUM

858 NUM tokens (27% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (731; 85%).

NUM tokens may have the following values of Gender:

Paradigm שלושיםFem,MascMascFem
_שלושיםשלושים
NumType=Cardשלושיםשלושים

PROPN

780 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Paradigm עיןMascFem
Definite=Consעין
עיןעין

Gender seems to be lexical feature of PROPN. 99% lemmas (262) occur only with one value of Gender.

SYM

58 SYM tokens (40% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

DET

12 DET tokens (0% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=EMPTY (12; 100%), Definite=Cons (11; 92%).

DET tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6646; 99%), NOUN –[compound]–> NOUN (3613; 52%), VERB –[nsubj]–> NOUN (2822; 88%), NOUN –[nmod]–> NOUN (2225; 51%), NOUN –[acl:relcl]–> VERB (1841; 82%), NOUN –[conj]–> NOUN (1309; 62%), NOUN –[nmod:poss]–> PRON (1208; 55%), VERB –[conj]–> VERB (1085; 79%), NOUN –[nmod:poss]–> NOUN (748; 52%), VERB –[nsubj:pass]–> NOUN (717; 96%).