home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-PostRab: POS Tags: NUM

There are 45 NUM lemmas (3%), 57 NUM types (3%) and 144 NUM tokens (2%). Out of 13 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: אחת, שתיים, שני, ארבע, עשרה, ד, שלוש, י, ראשון, א

The 10 most frequent NUM types: אחד, שני, ארבע, אחת, ד, י’, עשר, עשרה, ראשון, שניה

The 10 most frequent ambiguous lemmas: שני (NUM 9, ADJ 3), ד (SCONJ 30, NUM 6, ADP 1), ראשון (ADJ 7, NUM 5), י”ב (NUM 2, ADV 1), מחצה (NOUN 2, NUM 2), אחד (NOUN 1, NUM 1), ב (ADP 386, NUM 1), ה (DET 512, SCONJ 11, NUM 1, PROPN 1), ה’ (PROPN 6, NUM 1), יד (NOUN 33, NUM 1)

The 10 most frequent ambiguous types: אחד (NUM 29, NOUN 1), שני (NUM 11, ADJ 3), ד (SCONJ 30, NUM 4, ADP 1), ראשון (ADJ 4, NUM 4), שנים (NOUN 3, NUM 3), י”ב (NUM 2, ADV 1), שתי (NUM 2, NOUN 1), ה (DET 512, PRON 102, SCONJ 11, NUM 1), ה’ (PROPN 6, NUM 1), י (PRON 37, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.266667 (the average of all parts of speech is 1.440168).

The 1st highest number of forms (5) was observed with the lemma “שתיים”: שני, שניה, שנים, שתי, שתים.

The 2nd highest number of forms (4) was observed with the lemma “שלוש”: ג׳, שלוש, שלש, שלשה.

The 3rd highest number of forms (2) was observed with the lemma “א”: א, א’.

NUM occurs with 2 features: Number (106; 74% instances), Gender (105; 73% instances)

NUM occurs with 6 feature-value pairs: Gender=Fem, Gender=Fem,Masc, Gender=Masc, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 9 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (46 tokens). Examples: אחד, ראשון, מחצה, עשר, שני, חד, חמישי, רביעי, רמח, רפה

Relations

NUM nodes are attached to their parents using 23 different relations: nummod (76; 53% instances), obl (13; 9% instances), compound:smixut (10; 7% instances), conj (10; 7% instances), nsubj (6; 4% instances), obj (4; 3% instances), fixed (3; 2% instances), nmod (3; 2% instances), amod (2; 1% instances), nsubj:cop (2; 1% instances), obl:unmarked (2; 1% instances), xcomp (2; 1% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), dep (1; 1% instances), dislocated (1; 1% instances), flat (1; 1% instances), nmod:poss (1; 1% instances), nmod:tmod (1; 1% instances), nmod:unmarked (1; 1% instances), parataxis (1; 1% instances), root (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (93; 65% instances), VERB (27; 19% instances), NUM (14; 10% instances), PRON (5; 3% instances), ADJ (2; 1% instances), PROPN (2; 1% instances), (1; 1% instances)

93 (65%) NUM nodes are leaves.

36 (25%) NUM nodes have one child.

9 (6%) NUM nodes have two children.

6 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 15 different relations: case (19; 25% instances), nmod (12; 16% instances), det (10; 13% instances), conj (8; 11% instances), cc (7; 9% instances), mark (5; 7% instances), fixed (3; 4% instances), nsubj:cop (3; 4% instances), compound:smixut (2; 3% instances), flat (2; 3% instances), acl:relcl (1; 1% instances), cop (1; 1% instances), nmod:tmod (1; 1% instances), nummod (1; 1% instances), obl (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: ADP (20; 26% instances), NUM (14; 18% instances), NOUN (13; 17% instances), DET (10; 13% instances), CCONJ (7; 9% instances), PRON (5; 7% instances), SCONJ (3; 4% instances), ADV (1; 1% instances), AUX (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)