home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTknesset: POS Tags: ADJ

There are 680 ADJ lemmas (13%), 1101 ADJ types (13%) and 3090 ADJ tokens (5%). Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 3 in number of types and 8 in number of tokens.

The 10 most frequent ADJ lemmas: אחר, חשוב, גדול, נכון, רב, ראשון, יהודי, טוב, ערבי, חדש

The 10 most frequent ADJ types: חשוב, נכון, יהודי, גדול, טוב, שני, ראשון, אחרים, קשה, ראשונה

The 10 most frequent ambiguous lemmas: אחר (ADJ 82, ADP 20), גדול (ADJ 69, NOUN 3, ADV 1, PROPN 1), נכון (ADJ 64, ADV 2, INTJ 1), רב (ADJ 63, NOUN 37, ADV 2, PROPN 2, VERB 1), ראשון (ADJ 61, PROPN 5), יהודי (ADJ 60, NOUN 39, PROPN 1), טוב (ADJ 56, ADV 5, NOUN 5), ערבי (ADJ 53, NOUN 33), חדש (ADJ 48, NOUN 2), שני (ADJ 47, PROPN 2)

The 10 most frequent ambiguous types: נכון (ADJ 48, ADV 2, INTJ 1), יהודי (ADJ 35, NOUN 10, PROPN 1), גדול (ADJ 34, ADV 1, NOUN 1), טוב (ADJ 34, ADV 5, NOUN 5), שני (NUM 44, ADJ 33, PROPN 2), ראשון (ADJ 31, PROPN 5), קשה (ADJ 28, ADV 1), ראשונה (ADJ 27, ADV 1), אחר (ADJ 26, ADP 20), בר (ADJ 22, PROPN 2)

Morphology

The form / lemma ratio of ADJ is 1.619118 (the average of all parts of speech is 1.545540).

The 1st highest number of forms (6) was observed with the lemma “מסוים”: מסויים, מסויימים, מסוים, מסוימות, מסוימים, מסוימת.

The 2nd highest number of forms (5) was observed with the lemma “יהודי”: יהודי, יהודיות, יהודיים, יהודים, יהודית.

The 3rd highest number of forms (5) was observed with the lemma “ישראלי”: ישראלי, ישראליות, ישראליים, ישראלים, ישראלית.

ADJ occurs with 6 features: Gender (3090; 100% instances), Number (3090; 100% instances), NumType (75; 2% instances), Definite (39; 1% instances), Abbr (2; 0% instances), Typo (1; 0% instances)

ADJ occurs with 8 feature-value pairs: Abbr=Yes, Definite=Cons, Gender=Fem, Gender=Masc, NumType=Ord, Number=Plur, Number=Sing, Typo=Yes

ADJ occurs with 13 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (1351 tokens). Examples: חשוב, נכון, יהודי, גדול, טוב, אחר, ברור, חדש, לאומי, ערבי

Relations

ADJ nodes are attached to their parents using 29 different relations: amod (2282; 74% instances), conj (220; 7% instances), root (179; 6% instances), acl:relcl (90; 3% instances), ccomp (51; 2% instances), parataxis (46; 1% instances), advcl (41; 1% instances), obl (40; 1% instances), xcomp (24; 1% instances), compound (17; 1% instances), obj (15; 0% instances), nsubj (14; 0% instances), advmod (12; 0% instances), csubj (11; 0% instances), appos (10; 0% instances), nmod (9; 0% instances), fixed (6; 0% instances), acl (4; 0% instances), obl:unmarked (4; 0% instances), dep (3; 0% instances), nmod:poss (3; 0% instances), reparandum (2; 0% instances), case (1; 0% instances), discourse (1; 0% instances), flat (1; 0% instances), list (1; 0% instances), nsubj:outer (1; 0% instances), nsubj:pass (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (2318; 75% instances), VERB (251; 8% instances), ADJ (193; 6% instances), (179; 6% instances), PROPN (88; 3% instances), PRON (32; 1% instances), ADV (10; 0% instances), NUM (8; 0% instances), ADP (3; 0% instances), INTJ (3; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), SYM (1; 0% instances)

1058 (34%) ADJ nodes are leaves.

1313 (42%) ADJ nodes have one child.

264 (9%) ADJ nodes have two children.

455 (15%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 9.

Children of ADJ nodes are attached using 36 different relations: det (1061; 29% instances), advmod (535; 14% instances), punct (416; 11% instances), nsubj (311; 8% instances), obl (232; 6% instances), conj (231; 6% instances), cc (186; 5% instances), mark (185; 5% instances), cop (126; 3% instances), case (72; 2% instances), csubj (54; 1% instances), advcl (51; 1% instances), compound (50; 1% instances), compound:affix (39; 1% instances), parataxis (37; 1% instances), nmod (24; 1% instances), ccomp (18; 0% instances), vocative (17; 0% instances), obl:unmarked (11; 0% instances), aux (10; 0% instances), acl:relcl (7; 0% instances), amod (7; 0% instances), obj (7; 0% instances), dislocated (5; 0% instances), nsubj:outer (5; 0% instances), xcomp (5; 0% instances), appos (4; 0% instances), nmod:poss (3; 0% instances), acl (2; 0% instances), dep (2; 0% instances), discourse (2; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), list (1; 0% instances), orphan (1; 0% instances), reparandum (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: DET (1084; 29% instances), ADV (569; 15% instances), NOUN (429; 12% instances), PUNCT (416; 11% instances), PRON (277; 7% instances), ADJ (193; 5% instances), VERB (187; 5% instances), CCONJ (186; 5% instances), SCONJ (175; 5% instances), AUX (98; 3% instances), ADP (77; 2% instances), PROPN (22; 1% instances), NUM (4; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)