home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: POS Tags: ADJ

There are 1292 ADJ lemmas (12%), 2481 ADJ types (13%) and 8032 ADJ tokens (5%). Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent ADJ lemmas: _, אחר, רב, חדש, גדול, לאומי, אחרון, ישראלי, אמריקני, טוב

The 10 most frequent ADJ types: אחרים, ראשון, גדול, לאומי, חדש, אחר, ראשונה, רבים, רב, טוב

The 10 most frequent ambiguous lemmas: _ (VERB 420, NOUN 368, ADJ 231, ADP 190, ADV 174, PRON 130, CCONJ 113, AUX 99, X 86, SCONJ 47, PART 34, DET 33), אחר (ADJ 214, ADP 47), רב (ADJ 177, NOUN 48, VERB 35, ADV 3), גדול (ADJ 148, NOUN 1), לאומי (ADJ 129, PROPN 1), ישראלי (ADJ 106, NOUN 12), אמריקני (ADJ 98, NOUN 16), טוב (ADJ 97, NOUN 11, ADV 3), נוסף (ADJ 93, VERB 18), שונה (ADJ 83, VERB 3)

The 10 most frequent ambiguous types: ראשון (ADJ 77, PROPN 19, NUM 5), גדול (ADJ 70, NOUN 1), לאומי (ADJ 68, PROPN 1), אחר (ADJ 54, ADP 47, CCONJ 2), רבים (ADJ 49, VERB 35), רב (ADJ 47, NOUN 39, ADV 3), טוב (ADJ 46, NOUN 7, ADV 3), ישראלי (ADJ 43, NOUN 5), אחרת (ADJ 39, ADV 9), יהודי (ADJ 38, NOUN 14)

Morphology

The form / lemma ratio of ADJ is 1.920279 (the average of all parts of speech is 1.709692).

The 1st highest number of forms (56) was observed with the lemma “_”: אזוטרי, אידאליסטי, אלמנטרי, אקספרסיווי, אשמה, בית”ריות, בכורה, בלטיות, בלטית, בנאליות, גרמניה, דומה, דרמאטי, הבעתי, המוצעת, ויזואלית, חובבניות, חוליגאנים, טכסיים, טרפים, יהודיה, יורקית, ייצוגי, יתר, כווייתים, כולל, כוללת, לבדך, מגוייס, מגוררת, מדוייקות, מהולל, מהוללת, מהפכני, נ”ל, סבאי, סיטרי, סימבולי, סימבולית, סיעודי, ספורטיווית, עיצוביים, עלוב, פונדמליסטיים, פלורנטיני, קולינארית, קולינרי, קיים, ראשון, ראשונה, ראשונות, ראשונים, רצוי, רשמית, שיפוטיות, תיאמן.

The 2nd highest number of forms (7) was observed with the lemma “ותיק”: וותיק, וותיקה, וותיקות, וותיקים, ותיק, ותיקה, ותיקים.

The 3rd highest number of forms (7) was observed with the lemma “יהודי”: יהודי, יהודיה, יהודיות, יהודייה, יהודיים, יהודים, יהודית.

ADJ occurs with 5 features: Gender (7901; 98% instances), Number (7901; 98% instances), HebSource (146; 2% instances), Definite (106; 1% instances), Abbr (10; 0% instances)

ADJ occurs with 8 feature-value pairs: Abbr=Yes, Definite=Cons, Gender=Fem, Gender=Masc, HebSource=ConvUncertainHead, HebSource=ConvUncertainLabel, Number=Plur, Number=Sing

ADJ occurs with 27 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (3230 tokens). Examples: ראשון, לאומי, גדול, חדש, אחר, טוב, אמריקאי, רב, יהודי, ישראלי

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (6481; 81% instances), conj (417; 5% instances), root (263; 3% instances), dep (174; 2% instances), acl:relcl (145; 2% instances), advmod (89; 1% instances), nmod (79; 1% instances), ccomp (65; 1% instances), iobj (59; 1% instances), obl (55; 1% instances), advcl (37; 0% instances), appos (32; 0% instances), nsubj (31; 0% instances), parataxis (20; 0% instances), flat:name (15; 0% instances), conj:discourse (13; 0% instances), fixed (11; 0% instances), compound:smixut (10; 0% instances), obj (10; 0% instances), nmod:poss (9; 0% instances), det (6; 0% instances), acl (5; 0% instances), nsubj:cop (4; 0% instances), advmod:phrase (1; 0% instances), case (1; 0% instances)

Parents of ADJ nodes belong to 15 different parts of speech: NOUN (6713; 84% instances), VERB (420; 5% instances), ADJ (392; 5% instances), (263; 3% instances), PROPN (106; 1% instances), ADV (56; 1% instances), PRON (24; 0% instances), AUX (23; 0% instances), ADP (10; 0% instances), X (10; 0% instances), DET (5; 0% instances), NUM (4; 0% instances), CCONJ (3; 0% instances), SCONJ (2; 0% instances), PUNCT (1; 0% instances)

3401 (42%) ADJ nodes are leaves.

3309 (41%) ADJ nodes have one child.

528 (7%) ADJ nodes have two children.

794 (10%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 12.

Children of ADJ nodes are attached using 36 different relations: det:def (3037; 39% instances), advmod (1028; 13% instances), punct (885; 11% instances), nsubj (457; 6% instances), conj (446; 6% instances), cc (401; 5% instances), mark (283; 4% instances), dep (224; 3% instances), obl (216; 3% instances), aux (190; 2% instances), case (148; 2% instances), compound:smixut (84; 1% instances), iobj (69; 1% instances), advcl (50; 1% instances), cop (48; 1% instances), nsubj:cop (46; 1% instances), xcomp (20; 0% instances), acl:relcl (17; 0% instances), case:gen (16; 0% instances), amod (15; 0% instances), parataxis (15; 0% instances), conj:discourse (11; 0% instances), ccomp (10; 0% instances), nummod (10; 0% instances), appos (9; 0% instances), det (8; 0% instances), advmod:phrase (7; 0% instances), fixed (6; 0% instances), obj (5; 0% instances), case:acc (4; 0% instances), nmod:poss (4; 0% instances), aux:q (3; 0% instances), acl (2; 0% instances), det:quant (1; 0% instances), dislocated (1; 0% instances), obl:tmod (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: DET (3077; 40% instances), NOUN (971; 12% instances), PUNCT (912; 12% instances), ADV (742; 10% instances), VERB (525; 7% instances), CCONJ (422; 5% instances), ADJ (392; 5% instances), SCONJ (294; 4% instances), PRON (152; 2% instances), ADP (144; 2% instances), PROPN (63; 1% instances), AUX (34; 0% instances), NUM (23; 0% instances), PART (20; 0% instances), X (6; 0% instances)