home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: POS Tags: ADV

There are 319 ADV lemmas (3%), 374 ADV types (2%) and 6337 ADV tokens (4%). Out of 15 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: לא, גם, רק, יותר, _, מה, עוד, כך, ביותר, אתמול

The 10 most frequent ADV types: לא, גם, רק, יותר, מה, עוד, כך, ביותר, אתמול, כבר

The 10 most frequent ambiguous lemmas: _ (NOUN 368, AUX 268, VERB 251, ADJ 231, ADV 177, CCONJ 110, X 86, PRON 57, SCONJ 47, DET 33), מה (ADV 168, PRON 5), עוד (ADV 144, ADP 4, NOUN 1, PROPN 1), כך (PRON 205, ADV 119, PROPN 35), מי (ADV 96, PRON 3), שם (ADV 93, NOUN 87, VERB 18), כן (ADV 72, PRON 3, ADJ 1, NOUN 1), אי (ADV 62, NOUN 21), כאן (ADV 50, VERB 1), אף (CCONJ 53, ADV 46, DET 20, NOUN 13)

The 10 most frequent ambiguous types: יותר (ADV 232, VERB 1), עוד (ADV 141, ADP 4, NOUN 1), כך (PRON 205, ADV 119, PROPN 35), מי (ADV 96, NOUN 3), שם (ADV 93, NOUN 39, VERB 6), אין (VERB 154, ADV 76, AUX 17, NOUN 2), כן (ADV 73, PRON 51), אי (ADV 62, NOUN 20), אף (ADV 60, CCONJ 53, DET 20, NOUN 12), כאן (ADV 50, VERB 1)

Morphology

The form / lemma ratio of ADV is 1.172414 (the average of all parts of speech is 1.701251).

The 1st highest number of forms (48) was observed with the lemma “_”: אחר, אין, אל, אם, אף, ביה, בינוני, בסיבוב, בקלות, בתמים, דו, הותר, ו, חזרה, חלילה, טובה, יתרה, כ, כאמור, כדומה, כו, כמו, כן, לאו, להתראות, למחר, לעילא, לשם, מין, מיניה, מעין, מערבית, מצויין, מתמיד, נא, נהדר, נוסף, נוספים, סוף, סמוך, עין, עצם, פתע, קל, רגלית, שלא, שלישית, תו.

The 2nd highest number of forms (4) was observed with the lemma “לבד”: לבד, לבדה, לבדו, לבדם.

The 3rd highest number of forms (4) was observed with the lemma “עוד”: עוד, עודו, עודן, עודנה.

ADV occurs with 5 features: Polarity (1055; 17% instances), PronType (390; 6% instances), HebSource (297; 5% instances), Prefix (235; 4% instances), Abbr (1; 0% instances)

ADV occurs with 6 feature-value pairs: Abbr=Yes, HebSource=ConvUncertainHead, HebSource=ConvUncertainLabel, Polarity=Neg, Prefix=Yes, PronType=Int

ADV occurs with 11 feature combinations. The most frequent feature combination is _ (4454 tokens). Examples: גם, רק, יותר, עוד, כך, ביותר, אתמול, כבר, שם, אז

Relations

ADV nodes are attached to their parents using 22 different relations: advmod (4788; 76% instances), det (514; 8% instances), compound:affix (235; 4% instances), fixed (232; 4% instances), obl (90; 1% instances), nsubj (84; 1% instances), conj (76; 1% instances), root (66; 1% instances), case (54; 1% instances), obj (38; 1% instances), nmod (36; 1% instances), mark:q (34; 1% instances), advcl (22; 0% instances), nsubj:cop (17; 0% instances), nmod:poss (14; 0% instances), appos (9; 0% instances), acl (6; 0% instances), amod (6; 0% instances), ccomp (6; 0% instances), compound:smixut (5; 0% instances), acl:relcl (3; 0% instances), dislocated (2; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (2948; 47% instances), NOUN (1122; 18% instances), ADJ (755; 12% instances), ADP (491; 8% instances), ADV (335; 5% instances), AUX (291; 5% instances), PRON (126; 2% instances), PROPN (72; 1% instances), (66; 1% instances), DET (51; 1% instances), NUM (39; 1% instances), CCONJ (25; 0% instances), SCONJ (12; 0% instances), PUNCT (3; 0% instances), X (1; 0% instances)

5413 (85%) ADV nodes are leaves.

551 (9%) ADV nodes have one child.

232 (4%) ADV nodes have two children.

141 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 10.

Children of ADV nodes are attached using 27 different relations: fixed (317; 21% instances), advmod (214; 14% instances), dep (175; 11% instances), case (163; 11% instances), acl:relcl (155; 10% instances), punct (152; 10% instances), cc (108; 7% instances), conj (51; 3% instances), nsubj (35; 2% instances), obl (27; 2% instances), case:gen (25; 2% instances), det (25; 2% instances), mark (16; 1% instances), advcl (13; 1% instances), xcomp (13; 1% instances), case:acc (11; 1% instances), cop (10; 1% instances), det:def (8; 1% instances), acl (7; 0% instances), ccomp (6; 0% instances), nsubj:cop (4; 0% instances), parataxis (3; 0% instances), appos (2; 0% instances), compound:affix (2; 0% instances), dislocated (2; 0% instances), discourse (1; 0% instances), obj (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: ADV (335; 22% instances), ADP (296; 19% instances), VERB (210; 14% instances), PUNCT (187; 12% instances), CCONJ (144; 9% instances), NOUN (132; 9% instances), DET (58; 4% instances), ADJ (56; 4% instances), PRON (53; 3% instances), SCONJ (37; 2% instances), AUX (19; 1% instances), PROPN (10; 1% instances), NUM (7; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)