Treebank Statistics: UD_Hebrew-HTB: Features: Definite
This feature is universal but the values Cons are language-specific.
It occurs with 2 different values: Cons, Def.
13389 tokens (8%) have a non-empty value of Definite.
3051 types (17%) occur at least once with a non-empty value of Definite.
2067 lemmas (20%) occur at least once with a non-empty value of Definite.
The feature is used with 7 part-of-speech tags: NOUN (11797; 7% instances), DET (885; 1% instances), NUM (425; 0% instances), ADJ (104; 0% instances), VERB (84; 0% instances), PRON (83; 0% instances), ADP (11; 0% instances).
NOUN
11797 NOUN tokens (31% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (8582; 73%), Gender=Masc (7287; 62%).
NOUN tokens may have the following values of Definite:
Cons(9303; 79% of non-emptyDefinite): בית, משרד, יום, שר, תל, פי, ידי, ראש, חברת, בתיDef(2494; 21% of non-emptyDefinite): יד_, שם_, דבר_, בית_, חבר_, תפקיד_, פנים_, דרך_, חיים_, חלק_EMPTY(26249): משטרה, %, משפט, ממשלה, חברה, ארץ, שנים, פועל, שנה, ש”ח
| Paradigm בית | Def | Cons |
|---|---|---|
| Number=Sing | בית_ | בית |
| Number=Plur | בית_ | בתי |
DET
885 DET tokens (5% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: PronType=EMPTY (885; 100%).
DET tokens may have the following values of Definite:
Cons(885; 100% of non-emptyDefinite): כל, כמה, רוב, הרבה, שום, מספר, אף, מרבית, מחצית, מעטEMPTY(16396): ה, ה_
Definite seems to be lexical feature of DET. 100% lemmas (19) occur only with one value of Definite.
NUM
425 NUM tokens (13% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: Number=Plur (317; 75%), Gender=Masc (245; 58%).
NUM tokens may have the following values of Definite:
Cons(425; 100% of non-emptyDefinite): שני, שתי, אחד, אלפי, מאות, עשרות, שלוש, שלושת, אחת, מיליוניEMPTY(2863): אחד, אחת, 1, 0, מיליון, אלף, 2, שלושה, 3, שני
Definite seems to be lexical feature of NUM. 100% lemmas (23) occur only with one value of Definite.
ADJ
104 ADJ tokens (1% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Number=Sing (71; 68%), Gender=Masc (66; 63%).
ADJ tokens may have the following values of Definite:
Cons(104; 100% of non-emptyDefinite): חסר, רב, חסרת, חסרות, חסרי, גדול, דמוי, משולל, רבת, דלEMPTY(8312): אחרים, ראשון, לאומי, גדול, חדש, אחר, קשה, ראשונה, צריך, רבים
Definite seems to be lexical feature of ADJ. 100% lemmas (41) occur only with one value of Definite.
VERB
84 VERB tokens (1% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Person=1,2,3 (84; 100%), Tense=EMPTY (84; 100%), VerbForm=Part (84; 100%), Voice=Act (76; 90%), Gender=Masc (61; 73%), Number=Sing (44; 52%).
VERB tokens may have the following values of Definite:
Cons(84; 100% of non-emptyDefinite): ממלא, מחזיקות, אוזלת, מצופה, נותן, רווי, יוצאת, לובשי, מיידי, מכביEMPTY(14204): יש, אין, אמר, יכול, אומר, נראה, עבר, מדובר, היו, חולים
Definite seems to be lexical feature of VERB. 100% lemmas (54) occur only with one value of Definite.
PRON
83 PRON tokens (1% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Case=EMPTY (83; 100%), Person=3 (83; 100%), PronType=Prs (83; 100%), Number=Sing (66; 80%), Gender=Masc (44; 53%).
PRON tokens may have the following values of Definite:
Def(83; 100% of non-emptyDefinite): אותו, אותה, אותם, אותןEMPTY(7623): _הוא, _הם, _היא, הוא, זה, היא, הם, זו, כך, אלה
ADP
11 ADP tokens (0% of all ADP tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADP and Definite co-occurred: Case=EMPTY (11; 100%).
ADP tokens may have the following values of Definite:
Def(11; 100% of non-emptyDefinite): אותו_, אותוEMPTY(26575): ב, ל, של, של, את, מ, על, כ, עם, ל_