home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: POS Tags: AUX

There are 48 AUX lemmas (0%), 130 AUX types (1%) and 2487 AUX tokens (2%). Out of 15 observed tags, the rank of AUX is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent AUX lemmas: היה, הוא, אינו, _, אפשר, יכול, צריך, יש, עלול, אמור

The 10 most frequent AUX types: היה, הוא, היא, היו, היתה, אינו, אפשר, להיות, יהיה, אינה

The 10 most frequent ambiguous lemmas: היה (AUX 778, VERB 148), הוא (PRON 5615, AUX 394), _ (NOUN 368, AUX 268, VERB 251, ADJ 231, ADV 177, CCONJ 110, X 86, PRON 57, SCONJ 47, DET 33), אפשר (AUX 98, VERB 34), יכול (AUX 84, VERB 2), צריך (AUX 68, ADJ 1), יש (VERB 214, AUX 49), אמור (AUX 39, ADJ 1), חייב (AUX 35, VERB 15, NOUN 3, ADJ 1), עשוי (AUX 29, ADJ 15)

The 10 most frequent ambiguous types: היה (AUX 385, VERB 38, X 1), הוא (PRON 554, AUX 164), היא (PRON 198, AUX 164), היו (AUX 139, VERB 46), היתה (AUX 132, VERB 22), יהיה (AUX 86, VERB 24), הם (PRON 209, AUX 49), יש (VERB 211, AUX 49), צריך (AUX 49, ADJ 1), תהיה (AUX 37, VERB 15)

Morphology

The form / lemma ratio of AUX is 2.708333 (the average of all parts of speech is 1.701251).

The 1st highest number of forms (29) was observed with the lemma “_”: אינך, איננו, אסורים, אפשר, דומה, היה, היו, הייתי, היתה, זקוק, יהיה, יהיו, יכול, יכולה, מאפשר, מאפשרות, מאפשרים, מאפשרת, מעוניינות, מעוניינים, נאלצים, נכונים, סבור, סבורה, סבורים, עומדת, עתידה, תהיה, תוכל.

The 2nd highest number of forms (13) was observed with the lemma “היה”: היה, היו, הייה, היינו, היית, הייתי, הייתם, היתה, יהיה, יהיו, להיות, נהיה, תהיה.

The 3rd highest number of forms (9) was observed with the lemma “יכול”: יוכל, יוכלו, יכול, יכולה, יכולות, יכולים, יכולנו, נוכל, תוכל.

AUX occurs with 9 features: VerbType (2487; 100% instances), Gender (2151; 86% instances), Number (2151; 86% instances), Person (2068; 83% instances), Polarity (1640; 66% instances), Tense (938; 38% instances), VerbForm (919; 37% instances), HebSource (32; 1% instances), Mood (3; 0% instances)

AUX occurs with 19 feature-value pairs: Gender=Fem, Gender=Fem,Masc, Gender=Masc, HebSource=ConvUncertainHead, Mood=Imp, Number=Plur, Number=Sing, Person=1, Person=1,2,3, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Past, VerbForm=Inf, VerbForm=Part, VerbType=Cop, VerbType=Mod

AUX occurs with 62 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|Person=3|Polarity=Pos|Tense=Past|VerbType=Cop (384 tokens). Examples: היה

Relations

AUX nodes are attached to their parents using 16 different relations: cop (1111; 45% instances), aux (670; 27% instances), advmod (317; 13% instances), root (118; 5% instances), acl:relcl (71; 3% instances), conj (50; 2% instances), advcl (48; 2% instances), ccomp (28; 1% instances), xcomp (26; 1% instances), dep (23; 1% instances), obl (8; 0% instances), acl (7; 0% instances), appos (3; 0% instances), parataxis (3; 0% instances), det (2; 0% instances), fixed (2; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: VERB (1139; 46% instances), NOUN (634; 25% instances), ADJ (295; 12% instances), AUX (219; 9% instances), (118; 5% instances), PROPN (38; 2% instances), PRON (20; 1% instances), ADV (19; 1% instances), NUM (3; 0% instances), DET (2; 0% instances)

1444 (58%) AUX nodes are leaves.

176 (7%) AUX nodes have one child.

330 (13%) AUX nodes have two children.

537 (22%) AUX nodes have three or more children.

The highest child degree of a AUX node is 12.

Children of AUX nodes are attached using 23 different relations: punct (784; 25% instances), obl (474; 15% instances), nsubj (400; 13% instances), mark (382; 12% instances), advmod (314; 10% instances), cc (151; 5% instances), advcl (142; 5% instances), cop (107; 3% instances), conj (96; 3% instances), aux (51; 2% instances), dep (45; 1% instances), obj (44; 1% instances), ccomp (32; 1% instances), compound:affix (23; 1% instances), parataxis (21; 1% instances), case (19; 1% instances), nsubj:cop (11; 0% instances), mark:q (9; 0% instances), dislocated (5; 0% instances), det (2; 0% instances), fixed (2; 0% instances), csubj (1; 0% instances), xcomp (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: PUNCT (798; 26% instances), NOUN (636; 20% instances), SCONJ (373; 12% instances), ADV (291; 9% instances), VERB (281; 9% instances), AUX (219; 7% instances), PRON (175; 6% instances), CCONJ (149; 5% instances), PROPN (80; 3% instances), ADJ (56; 2% instances), ADP (30; 1% instances), DET (18; 1% instances), NUM (8; 0% instances), X (2; 0% instances)