home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

67075 tokens (42%) have a non-empty value of Gender. 13230 types (74%) occur at least once with a non-empty value of Gender. 6692 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (37499; 23% instances), VERB (11302; 7% instances), ADJ (8289; 5% instances), PRON (7451; 5% instances), NUM (1369; 1% instances), AUX (1147; 1% instances), DET (18; 0% instances).

NOUN

37499 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (27680; 74%), Definite=EMPTY (25799; 69%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 98% lemmas (3955) occur only with one value of Gender.

VERB

11302 VERB tokens (79% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (7671; 68%), VerbForm=EMPTY (7026; 62%), Voice=Act (6710; 59%), Person=3 (6611; 58%), Tense=Past (5664; 50%).

VERB tokens may have the following values of Gender:

Paradigm אמרFem,MascMascFem
HebBinyan=PAAL|Number=Sing|Person=1,2,3|VerbForm=Part|Voice=Actאומראומרת
HebBinyan=PAAL|Number=Sing|Person=1|Tense=Past|Voice=Actאמרתי
HebBinyan=PAAL|Number=Sing|Person=2|Tense=Fut|Voice=Actתאמר
HebBinyan=PAAL|Number=Sing|Person=2|Tense=Past|Voice=Actאמרת
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Fut|Voice=Actתאמר
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Past|Voice=Actאמראמרה
HebBinyan=PAAL|Number=Plur|Person=1,2,3|VerbForm=Part|Voice=Actאומריםאומרות
HebBinyan=PAAL|Number=Plur|Person=1|Tense=Past|Voice=Actאמרנו
HebBinyan=PAAL|Number=Plur|Person=3|Tense=Past|Voice=Actאמרו
Mood=Imp|Number=Sing|Person=2אמור
Number=Sing|Person=1|Tense=Futאומר
Number=Sing|Person=3|Tense=Futיאמר
Number=Plur|Person=3|Tense=Futיאמרו

ADJ

8289 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (5932; 72%).

ADJ tokens may have the following values of Gender:

PRON

7451 PRON tokens (97% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (6782; 91%), PronType=Prs (5796; 78%), Number=Sing (5499; 74%), Case=EMPTY (4866; 65%).

PRON tokens may have the following values of Gender:

Paradigm הואFem,MascMascFem
Case=Acc|Number=Sing|Person=1|PronType=Prs_אני
Case=Acc|Number=Sing|Person=2|PronType=Prs_אתה
Case=Acc|Number=Sing|Person=3|PronType=Prs_הוא_היא
Case=Acc|Number=Plur|Person=2|PronType=Prs_אתם
Case=Acc|Number=Plur|Person=3|PronType=Prs_הם_הן
Case=Gen|Number=Sing|Person=1|PronType=Prs_אני
Case=Gen|Number=Sing|Person=2|PronType=Prs_אתה_את
Case=Gen|Number=Sing|Person=3|PronType=Prs_הוא_היא
Case=Gen|Number=Plur|Person=1|PronType=Prs_אנחנו
Case=Gen|Number=Plur|Person=2|PronType=Prs_אתם
Case=Gen|Number=Plur|Person=3|PronType=Prs_הם_הן
Number=Sing|Person=1|PronType=Prsאני, _אני
Number=Sing|Person=2|PronType=Prs_אתה, אתה_את, את
Number=Sing|Person=3|Polarity=Posהואהיא
Number=Sing|Person=3|PronType=Prs_הוא, הוא_היא, היא
Number=Plur|Person=1|PronType=Prs_אנחנו, אנו, אנחנו
Number=Plur|Person=2|PronType=Prs_אתם, אתם
Number=Plur|Person=3|Polarity=Posהםהן
Number=Plur|Person=3|PronType=Prs_הם, הם_הן, הן

NUM

1369 NUM tokens (42% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (994; 73%), Definite=EMPTY (949; 69%).

NUM tokens may have the following values of Gender:

Paradigm שמונהMascFem
Definite=Consשמונה
שמונהשמונה

Gender seems to be lexical feature of NUM. 99% lemmas (69) occur only with one value of Gender.

AUX

1147 AUX tokens (93% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbType=Cop (1147; 100%), Person=3 (1092; 95%), Number=Sing (881; 77%), Polarity=Pos (837; 73%), VerbForm=EMPTY (831; 72%), Tense=Past (681; 59%).

AUX tokens may have the following values of Gender:

Paradigm היהFem,MascMascFem
Mood=Imp|Number=Sing|Person=2הייה, היה
Number=Sing|Person=1|Tense=Pastהייתי
Number=Sing|Person=2|Tense=Futתהיה
Number=Sing|Person=2|Tense=Pastהיית
Number=Sing|Person=3|Tense=Futיהיהתהיה
Number=Sing|Person=3|Tense=Pastהיההיתה
Number=Plur|Person=1|Tense=Futנהיה
Number=Plur|Person=1|Tense=Pastהיינו
Number=Plur|Person=2|Tense=Pastהייתם
Number=Plur|Person=3|Tense=Futיהיו
Number=Plur|Person=3|Tense=Pastהיו

DET

18 DET tokens (0% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=EMPTY (18; 100%).

DET tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6139; 97%), VERB –[nsubj]–> NOUN (3221; 68%), NOUN –[nmod]–> NOUN (2755; 52%), NOUN –[nmod:poss]–> PRON (1338; 50%), NOUN –[acl:relcl]–> VERB (1321; 65%), NOUN –[conj]–> NOUN (1209; 60%), VERB –[conj]–> VERB (1034; 72%), VERB –[nsubj]–> PRON (763; 75%), NOUN –[det]–> PRON (617; 90%), NOUN –[nmod:poss]–> NOUN (513; 52%).