home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: Features: Number

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing. Some words have combined values of the feature; 2 combinations have been observed: Dual|Plur, Plur|Sing.

67539 tokens (42%) have a non-empty value of Number. 13229 types (74%) occur at least once with a non-empty value of Number. 6689 lemmas (64%) occur at least once with a non-empty value of Number. The feature is used with 6 part-of-speech tags: NOUN (37706; 23% instances), VERB (11272; 7% instances), ADJ (7901; 5% instances), PRON (7125; 4% instances), AUX (2151; 1% instances), NUM (1384; 1% instances).

NOUN

37706 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=EMPTY (25940; 69%), Gender=Masc (23438; 62%).

NOUN tokens may have the following values of Number:

Paradigm שנהSingDualPlur
Definite=Consשנתשנות
Definite=Cons|HebSource=ConvUncertainHeadשנות
Definite=Defשנה_
שנהשנתייםשנים
HebSource=ConvUncertainHeadשנתיים

VERB

11272 VERB tokens (79% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=EMPTY (7030; 62%), Voice=Act (6805; 60%), Person=3 (6636; 59%), Gender=Masc (6501; 58%), Tense=Past (5698; 51%).

VERB tokens may have the following values of Number:

Paradigm אמרSingPlur
Gender=Masc|HebBinyan=PAAL|HebSource=ConvUncertainHead|Person=3|Tense=Past|Voice=Actאמר
Gender=Masc|HebBinyan=PAAL|Person=1,2,3|VerbForm=Part|Voice=Actאומראומרים
Gender=Masc|HebBinyan=PAAL|Person=2|Tense=Fut|Voice=Actתאמר
Gender=Masc|HebBinyan=PAAL|Person=2|Tense=Past|Voice=Actאמרת
Gender=Masc|HebBinyan=PAAL|Person=3|Tense=Past|Voice=Actאמר
Gender=Masc|Mood=Imp|Person=2אמור
Gender=Masc|Person=3|Tense=Futיאמר
Gender=Fem,Masc|HebBinyan=PAAL|Person=1|Tense=Past|Voice=Actאמרתיאמרנו
Gender=Fem,Masc|HebBinyan=PAAL|Person=3|Tense=Past|Voice=Actאמרו
Gender=Fem,Masc|Person=1|Tense=Futאומר
Gender=Fem,Masc|Person=3|Tense=Futיאמרו
Gender=Fem|HebBinyan=PAAL|HebSource=ConvUncertainHead|Person=1,2,3|VerbForm=Part|Voice=Actאומרת
Gender=Fem|HebBinyan=PAAL|Person=1,2,3|VerbForm=Part|Voice=Actאומרתאומרות
Gender=Fem|HebBinyan=PAAL|Person=3|Tense=Fut|Voice=Actתאמר
Gender=Fem|HebBinyan=PAAL|Person=3|Tense=Past|Voice=Actאמרה

ADJ

7901 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (4837; 61%).

ADJ tokens may have the following values of Number:

PRON

7125 PRON tokens (97% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=3 (6448; 90%), PronType=Prs (5848; 82%), Gender=Masc (4742; 67%), Case=EMPTY (4520; 63%).

PRON tokens may have the following values of Number:

AUX

2151 AUX tokens (86% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbType=Cop (1551; 72%), Person=3 (1513; 70%), Gender=Masc (1326; 62%), VerbForm=EMPTY (1321; 61%), Polarity=Pos (1235; 57%), Tense=EMPTY (1213; 56%).

AUX tokens may have the following values of Number:

Paradigm היהSingPlur
Gender=Masc|Mood=Imp|Person=2הייה, היה
Gender=Masc|Person=2|Tense=Futתהיה
Gender=Masc|Person=2|Tense=Pastהייתהייתם
Gender=Masc|Person=3|Tense=Futיהיה
Gender=Masc|Person=3|Tense=Pastהיה
Gender=Fem,Masc|HebSource=ConvUncertainHead|Person=3|Tense=Pastהיו
Gender=Fem,Masc|Person=1|Tense=Futנהיה
Gender=Fem,Masc|Person=1|Tense=Pastהייתיהיינו
Gender=Fem,Masc|Person=3|Tense=Futיהיו
Gender=Fem,Masc|Person=3|Tense=Pastהיו
Gender=Fem|HebSource=ConvUncertainHead|Person=3|Tense=Futתהיה
Gender=Fem|Person=3|Tense=Futתהיה
Gender=Fem|Person=3|Tense=Pastהיתה

NUM

1384 NUM tokens (42% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: Definite=EMPTY (960; 69%), Gender=Masc (885; 64%).

NUM tokens may have the following values of Number:

Paradigm שניSingPlur
Definite=Consשני
שני

Number seems to be lexical feature of NUM. 99% lemmas (70) occur only with one value of Number.

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (6174; 97%), NOUN –[compound:smixut]–> NOUN (4291; 59%), VERB –[nsubj]–> NOUN (4099; 87%), NOUN –[nmod]–> NOUN (3282; 63%), NOUN –[nmod:poss]–> PRON (1706; 64%), NOUN –[acl:relcl]–> VERB (1670; 79%), NOUN –[conj]–> NOUN (1528; 76%), VERB –[conj]–> VERB (1122; 76%), VERB –[nsubj]–> PRON (973; 96%), NOUN –[det]–> PRON (673; 98%).