home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: POS Tags: ADV

There are 299 ADV lemmas (3%), 353 ADV types (2%) and 6108 ADV tokens (4%). Out of 16 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: לא, גם, רק, יותר, _, מה, עוד, כך, ביותר, אתמול

The 10 most frequent ADV types: לא, גם, רק, יותר, מה, עוד, כך, ביותר, אתמול, כבר

The 10 most frequent ambiguous lemmas: _ (VERB 420, NOUN 368, ADJ 231, ADP 190, ADV 174, PRON 130, CCONJ 113, AUX 99, X 86, SCONJ 47, PART 34, DET 33), מה (ADV 168, PRON 5), עוד (ADV 144, NOUN 1, PROPN 1), כך (PRON 205, ADV 119, PROPN 33), מי (ADV 96, PRON 3), שם (ADV 93, NOUN 87, VERB 18), כן (ADV 72, PRON 3, ADJ 1, NOUN 1), אי (ADV 62, NOUN 21), כאן (ADV 50, VERB 1), פחות (ADV 38, ADJ 3)

The 10 most frequent ambiguous types: יותר (ADV 232, VERB 1), עוד (ADV 141, ADP 4, NOUN 1), כך (PRON 205, ADV 119, PROPN 35), מי (ADV 96, NOUN 3), שם (ADV 93, NOUN 39, VERB 6), אין (VERB 154, ADV 76, AUX 17, NOUN 2), כן (ADV 73, PRON 51), אי (ADV 62, NOUN 20), כאן (ADV 50, VERB 1), מדי (ADV 43, NOUN 1)

Morphology

The form / lemma ratio of ADV is 1.180602 (the average of all parts of speech is 1.709692).

The 1st highest number of forms (46) was observed with the lemma “_”: אין, אל, אם, אף, ביה, בינוני, בסיבוב, בקלות, בתמים, דו, הותר, חזרה, חלילה, טובה, יתרה, כ, כאמור, כדומה, כו, כמו, כן, לאו, להתראות, למחר, לעילא, לשם, מין, מיניה, מעין, מערבית, מצויין, מתמיד, נא, נהדר, נוסף, נוספים, סוף, סמוך, עין, עצם, פתע, קל, רגלית, שלא, שלישית, תו.

The 2nd highest number of forms (4) was observed with the lemma “לבד”: לבד, לבדה, לבדו, לבדם.

The 3rd highest number of forms (4) was observed with the lemma “עוד”: עוד, עודו, עודן, עודנה.

ADV occurs with 5 features: Polarity (1055; 17% instances), PronType (390; 6% instances), HebSource (297; 5% instances), Prefix (235; 4% instances), Abbr (1; 0% instances)

ADV occurs with 6 feature-value pairs: Abbr=Yes, HebSource=ConvUncertainHead, HebSource=ConvUncertainLabel, Polarity=Neg, Prefix=Yes, PronType=Int

ADV occurs with 11 feature combinations. The most frequent feature combination is _ (4225 tokens). Examples: גם, רק, יותר, עוד, כך, ביותר, אתמול, כבר, שם, אז

Relations

ADV nodes are attached to their parents using 28 different relations: advmod (4018; 66% instances), det (524; 9% instances), dep (375; 6% instances), obl:tmod (231; 4% instances), fixed (196; 3% instances), advmod:phrase (154; 3% instances), nsubj (84; 1% instances), conj (76; 1% instances), obl (67; 1% instances), root (66; 1% instances), case (54; 1% instances), obj (38; 1% instances), nmod (36; 1% instances), aux:q (34; 1% instances), parataxis (34; 1% instances), iobj (25; 0% instances), advcl (22; 0% instances), nsubj:cop (17; 0% instances), nmod:poss (14; 0% instances), amod (11; 0% instances), appos (9; 0% instances), ccomp (6; 0% instances), acl (5; 0% instances), compound:smixut (5; 0% instances), acl:relcl (3; 0% instances), dislocated (2; 0% instances), acl:inf (1; 0% instances), conj:discourse (1; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (2841; 47% instances), NOUN (1103; 18% instances), ADJ (742; 12% instances), ADP (476; 8% instances), ADV (305; 5% instances), AUX (232; 4% instances), PRON (124; 2% instances), PROPN (68; 1% instances), (66; 1% instances), DET (51; 1% instances), CCONJ (46; 1% instances), NUM (39; 1% instances), SCONJ (12; 0% instances), PUNCT (2; 0% instances), X (1; 0% instances)

5285 (87%) ADV nodes are leaves.

456 (7%) ADV nodes have one child.

225 (4%) ADV nodes have two children.

142 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 10.

Children of ADV nodes are attached using 30 different relations: dep (284; 20% instances), advmod (187; 13% instances), case (164; 11% instances), fixed (163; 11% instances), acl:relcl (155; 11% instances), punct (152; 11% instances), cc (84; 6% instances), conj (43; 3% instances), nsubj (35; 2% instances), det (27; 2% instances), case:gen (25; 2% instances), obl (23; 2% instances), mark (16; 1% instances), xcomp (13; 1% instances), advcl (12; 1% instances), case:acc (11; 1% instances), det:def (8; 1% instances), aux (6; 0% instances), ccomp (6; 0% instances), cop (4; 0% instances), nsubj:cop (4; 0% instances), acl:inf (3; 0% instances), iobj (3; 0% instances), acl (2; 0% instances), advmod:inf (2; 0% instances), appos (2; 0% instances), conj:discourse (2; 0% instances), dislocated (2; 0% instances), parataxis (2; 0% instances), obj (1; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: ADV (305; 21% instances), VERB (215; 15% instances), ADP (208; 14% instances), PUNCT (187; 13% instances), NOUN (130; 9% instances), CCONJ (126; 9% instances), DET (58; 4% instances), ADJ (56; 4% instances), PRON (52; 4% instances), SCONJ (37; 3% instances), PART (35; 2% instances), AUX (14; 1% instances), PROPN (9; 1% instances), NUM (7; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)