home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: POS Tags: ADV

There are 332 ADV lemmas (3%), 400 ADV types (2%) and 6551 ADV tokens (4%). Out of 15 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: לא, גם, רק, יותר, _, מה, עוד, כך, ביותר, אתמול

The 10 most frequent ADV types: לא, גם, רק, יותר, מה, עוד, כך, ביותר, אתמול, כבר

The 10 most frequent ambiguous lemmas: _ (NOUN 365, VERB 326, ADJ 230, ADV 192, AUX 169, CCONJ 109, X 76, PRON 57, SCONJ 46, DET 33), מה (ADV 166, PRON 5), עוד (ADV 144, ADP 4, NOUN 1, PROPN 1), כך (PRON 204, ADV 117, PROPN 35), אפשר (ADV 98, VERB 34), מי (ADV 96, PRON 3), שם (ADV 91, NOUN 87, VERB 18), כן (ADV 71, PRON 3, ADJ 1, NOUN 1), אי (ADV 62, NOUN 21), אף (ADV 58, CCONJ 38, DET 20, NOUN 13)

The 10 most frequent ambiguous types: יותר (ADV 230, VERB 1), עוד (ADV 141, ADP 4, NOUN 1), כך (PRON 204, ADV 117, PROPN 35), מי (ADV 96, NOUN 3), אין (VERB 152, ADV 92, NOUN 2), שם (ADV 91, NOUN 39, VERB 6), אף (ADV 72, CCONJ 38, DET 20, NOUN 12), כן (ADV 72, PRON 51), אי (ADV 62, NOUN 20), יש (VERB 210, ADV 49)

Morphology

The form / lemma ratio of ADV is 1.204819 (the average of all parts of speech is 1.702584).

The 1st highest number of forms (61) was observed with the lemma “_”: אחר, אין, אל, אם, אסורים, אף, אפשר, ביה, בינוני, בסיבוב, בקלות, בתמים, דו, דומה, הותר, הרי, ו, חזרה, חלילה, טובה, יתרה, כ, כאמור, כדומה, כו, כמו, כן, לאו, להתראות, למחר, לעילא, לראשונה, מיניה, מעוניינות, מעוניינים, מעין, מערבית, מצויין, מתמיד, נא, נאלצים, נהדר, נוסף, נוספים, נכונים, סבור, סבורה, סבורים, סוף, סמוך, עומדת, עין, עצם, עתידה, פלוס, פתע, קל, רגלית, שלא, שלישית, תו.

The 2nd highest number of forms (4) was observed with the lemma “לבד”: לבד, לבדה, לבדו, לבדם.

The 3rd highest number of forms (4) was observed with the lemma “עוד”: עוד, עודו, עודן, עודנה.

ADV occurs with 4 features: Polarity (1034; 16% instances), PronType (386; 6% instances), Prefix (235; 4% instances), Abbr (1; 0% instances)

ADV occurs with 4 feature-value pairs: Abbr=Yes, Polarity=Neg, Prefix=Yes, PronType=Int

ADV occurs with 5 feature combinations. The most frequent feature combination is _ (4895 tokens). Examples: גם, רק, יותר, עוד, כך, ביותר, אתמול, כבר, אפשר, שם

Relations

ADV nodes are attached to their parents using 24 different relations: advmod (5322; 81% instances), compound:affix (235; 4% instances), fixed (230; 4% instances), root (176; 3% instances), conj (106; 2% instances), obl (91; 1% instances), nsubj (79; 1% instances), ccomp (42; 1% instances), advcl (37; 1% instances), obj (37; 1% instances), nmod (36; 1% instances), mark:q (34; 1% instances), acl:relcl (31; 0% instances), case (18; 0% instances), nmod:poss (14; 0% instances), nsubj:cop (14; 0% instances), dep (10; 0% instances), appos (9; 0% instances), acl (8; 0% instances), nsubj:outer (7; 0% instances), amod (6; 0% instances), compound:smixut (5; 0% instances), dislocated (2; 0% instances), parataxis (2; 0% instances)

Parents of ADV nodes belong to 14 different parts of speech: VERB (3061; 47% instances), NOUN (1171; 18% instances), ADJ (852; 13% instances), ADP (489; 7% instances), ADV (415; 6% instances), (176; 3% instances), PRON (144; 2% instances), PROPN (73; 1% instances), DET (51; 1% instances), AUX (41; 1% instances), NUM (40; 1% instances), CCONJ (25; 0% instances), SCONJ (12; 0% instances), X (1; 0% instances)

5409 (83%) ADV nodes are leaves.

534 (8%) ADV nodes have one child.

312 (5%) ADV nodes have two children.

296 (5%) ADV nodes have three or more children.

The highest child degree of a ADV node is 14.

Children of ADV nodes are attached using 30 different relations: punct (422; 18% instances), fixed (279; 12% instances), advmod (254; 11% instances), dep (175; 7% instances), xcomp (172; 7% instances), case (167; 7% instances), acl:relcl (153; 6% instances), cc (148; 6% instances), obl (125; 5% instances), mark (98; 4% instances), conj (71; 3% instances), advcl (65; 3% instances), nsubj (48; 2% instances), ccomp (30; 1% instances), det (29; 1% instances), cop (28; 1% instances), compound:affix (25; 1% instances), csubj (23; 1% instances), case:gen (20; 1% instances), parataxis (12; 1% instances), case:acc (11; 0% instances), acl (7; 0% instances), obj (7; 0% instances), dislocated (5; 0% instances), nsubj:cop (5; 0% instances), mark:q (3; 0% instances), appos (2; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), nsubj:outer (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: VERB (484; 20% instances), PUNCT (422; 18% instances), ADV (415; 17% instances), ADP (291; 12% instances), NOUN (204; 9% instances), CCONJ (188; 8% instances), SCONJ (118; 5% instances), PRON (74; 3% instances), ADJ (73; 3% instances), DET (59; 2% instances), AUX (30; 1% instances), PROPN (16; 1% instances), NUM (9; 0% instances), X (3; 0% instances), INTJ (1; 0% instances)