home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: POS Tags: AUX

There are 42 AUX lemmas (0%), 99 AUX types (1%) and 843 AUX tokens (1%). Out of 16 observed tags, the rank of AUX is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent AUX lemmas: _, אפשר, יכול, צריך, יש, עלול, אמור, חייב, מוכן, עשוי

The 10 most frequent AUX types: אפשר, יכול, יש, צריך, קשה, יכולה, ייתכן, ניתן, אמור, עלול

The 10 most frequent ambiguous lemmas: _ (VERB 420, NOUN 368, ADJ 231, ADP 190, ADV 174, PRON 130, CCONJ 113, AUX 99, X 86, SCONJ 47, PART 34, DET 33), אפשר (AUX 98, VERB 34), יכול (AUX 84, VERB 2), צריך (AUX 68, ADJ 1), יש (VERB 214, AUX 49), אמור (AUX 39, ADJ 1), חייב (AUX 35, VERB 15, NOUN 3, ADJ 1), עשוי (AUX 29, ADJ 15), ניתן (VERB 43, AUX 27), קשה (ADJ 43, AUX 27, ADV 11)

The 10 most frequent ambiguous types: יש (VERB 211, AUX 49), צריך (AUX 49, ADJ 1), קשה (ADJ 29, AUX 27, ADV 11), יכולה (AUX 26, VERB 1), ניתן (AUX 23, VERB 17), אמור (AUX 22, VERB 2, ADJ 1), חייב (AUX 19, ADJ 1, NOUN 1, VERB 1), אין (VERB 154, ADV 76, AUX 17, NOUN 2), חשוב (ADJ 18, AUX 16), עשוי (AUX 14, ADJ 6, VERB 1)

Morphology

The form / lemma ratio of AUX is 2.357143 (the average of all parts of speech is 1.709692).

The 1st highest number of forms (20) was observed with the lemma “_”: אסורים, אפשר, דומה, זקוק, יכול, יכולה, מאפשר, מאפשרות, מאפשרים, מאפשרת, מעוניינות, מעוניינים, נאלצים, נכונים, סבור, סבורה, סבורים, עומדת, עתידה, תוכל.

The 2nd highest number of forms (9) was observed with the lemma “יכול”: יוכל, יוכלו, יכול, יכולה, יכולות, יכולים, יכולנו, נוכל, תוכל.

The 3rd highest number of forms (4) was observed with the lemma “אמור”: אמור, אמורה, אמורות, אמורים.

AUX occurs with 7 features: VerbType (843; 100% instances), Gender (600; 71% instances), Number (600; 71% instances), Person (517; 61% instances), VerbForm (114; 14% instances), Tense (106; 13% instances), HebSource (23; 3% instances)

AUX occurs with 14 feature-value pairs: Gender=Fem, Gender=Fem,Masc, Gender=Masc, HebSource=ConvUncertainHead, Number=Plur, Number=Sing, Person=1, Person=1,2,3, Person=2, Person=3, Tense=Fut, Tense=Past, VerbForm=Part, VerbType=Mod

AUX occurs with 29 feature combinations. The most frequent feature combination is VerbType=Mod (233 tokens). Examples: אפשר, יש, יכול, ייתכן, צריך, אין, קשה, יכולה, ניתן, אמור

Relations

AUX nodes are attached to their parents using 15 different relations: root (360; 43% instances), acl:relcl (137; 16% instances), conj (119; 14% instances), ccomp (94; 11% instances), advcl (53; 6% instances), dep (30; 4% instances), obl (23; 3% instances), conj:discourse (7; 1% instances), iobj (7; 1% instances), acl (5; 1% instances), appos (3; 0% instances), advmod (2; 0% instances), aux (1; 0% instances), nsubj:cop (1; 0% instances), parataxis (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: (360; 43% instances), VERB (246; 29% instances), NOUN (144; 17% instances), ADJ (34; 4% instances), AUX (26; 3% instances), ADV (14; 2% instances), PRON (9; 1% instances), CCONJ (4; 0% instances), DET (2; 0% instances), PROPN (2; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)

5 (1%) AUX nodes are leaves.

7 (1%) AUX nodes have one child.

171 (20%) AUX nodes have two children.

660 (78%) AUX nodes have three or more children.

The highest child degree of a AUX node is 13.

Children of AUX nodes are attached using 24 different relations: punct (717; 22% instances), xcomp (684; 21% instances), nsubj (362; 11% instances), mark (316; 10% instances), advmod (276; 8% instances), obl (227; 7% instances), cc (140; 4% instances), advcl (111; 3% instances), aux (104; 3% instances), conj (81; 2% instances), iobj (68; 2% instances), ccomp (31; 1% instances), obj (29; 1% instances), case (28; 1% instances), dep (27; 1% instances), parataxis (27; 1% instances), aux:q (8; 0% instances), advmod:phrase (7; 0% instances), conj:discourse (7; 0% instances), obl:tmod (6; 0% instances), dislocated (5; 0% instances), cop (2; 0% instances), det (2; 0% instances), nsubj:cop (2; 0% instances)

Children of AUX nodes belong to 13 different parts of speech: VERB (1065; 33% instances), PUNCT (728; 22% instances), NOUN (473; 14% instances), SCONJ (321; 10% instances), ADV (232; 7% instances), PRON (155; 5% instances), CCONJ (148; 5% instances), PROPN (54; 2% instances), AUX (26; 1% instances), ADJ (23; 1% instances), ADP (22; 1% instances), DET (17; 1% instances), NUM (3; 0% instances)