home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: Features: Number

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing. Some words have combined values of the feature; 2 combinations have been observed: Dual|Plur, Plur|Sing.

67539 tokens (42%) have a non-empty value of Number. 13229 types (74%) occur at least once with a non-empty value of Number. 6689 lemmas (64%) occur at least once with a non-empty value of Number. The feature is used with 6 part-of-speech tags: NOUN (37706; 23% instances), VERB (12823; 8% instances), ADJ (7901; 5% instances), PRON (7125; 4% instances), NUM (1384; 1% instances), AUX (600; 0% instances).

NOUN

37706 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=EMPTY (25940; 69%), Gender=Masc (23438; 62%).

NOUN tokens may have the following values of Number:

Paradigm שנהSingDualPlur
Definite=Consשנתשנות
Definite=Cons|HebSource=ConvUncertainHeadשנות
Definite=Defשנה_
שנהשנתייםשנים
HebSource=ConvUncertainHeadשנתיים

VERB

12823 VERB tokens (81% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Polarity=EMPTY (11101; 87%), VerbType=EMPTY (11101; 87%), Person=3 (8131; 63%), VerbForm=EMPTY (7865; 61%), Gender=Masc (7388; 58%), Voice=Act (6805; 53%).

VERB tokens may have the following values of Number:

Paradigm היהSingPlur
Gender=Masc|HebSource=ConvUncertainHead|Person=3|Tense=Pastהיה
Gender=Masc|Mood=Imp|Person=2הייה, היה
Gender=Masc|Person=2|Tense=Futתהיה
Gender=Masc|Person=2|Tense=Pastהייתהייתם
Gender=Masc|Person=3|Tense=Futיהיה
Gender=Masc|Person=3|Tense=Pastהיה
Gender=Fem,Masc|HebSource=ConvUncertainHead|Person=3|Tense=Pastהיו
Gender=Fem,Masc|Person=1|Tense=Futנהיה
Gender=Fem,Masc|Person=1|Tense=Pastהייתיהיינו
Gender=Fem,Masc|Person=3|Tense=Futיהיו
Gender=Fem,Masc|Person=3|Tense=Pastהיו
Gender=Fem|HebSource=ConvUncertainHead|Person=3|Tense=Futתהיה
Gender=Fem|HebSource=ConvUncertainHead|Person=3|Tense=Pastהיתה
Gender=Fem|Person=3|Tense=Futתהיהתהיינה
Gender=Fem|Person=3|Tense=Pastהיתה, הייתה

ADJ

7901 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (4837; 61%).

ADJ tokens may have the following values of Number:

PRON

7125 PRON tokens (97% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=3 (6448; 90%), PronType=Prs (5890; 83%), Gender=Masc (4742; 67%), Case=EMPTY (4437; 62%).

PRON tokens may have the following values of Number:

NUM

1384 NUM tokens (42% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: Definite=EMPTY (960; 69%), Gender=Masc (885; 64%).

NUM tokens may have the following values of Number:

Paradigm שניSingPlur
Definite=Consשני
שני

Number seems to be lexical feature of NUM. 99% lemmas (70) occur only with one value of Number.

AUX

600 AUX tokens (71% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbType=Mod (600; 100%), Tense=EMPTY (494; 82%), VerbForm=EMPTY (486; 81%), Person=1,2,3 (472; 79%), Gender=Masc (439; 73%).

AUX tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (6174; 97%), NOUN –[compound:smixut]–> NOUN (4291; 59%), VERB –[nsubj]–> NOUN (4122; 87%), NOUN –[nmod]–> NOUN (3245; 63%), VERB –[obl]–> NOUN (2868; 51%), NOUN –[acl:relcl]–> VERB (1707; 83%), NOUN –[nmod:poss]–> PRON (1706; 64%), NOUN –[conj]–> NOUN (1528; 76%), VERB –[conj]–> VERB (1137; 79%), VERB –[nsubj]–> PRON (978; 96%).