home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-HTB: POS Tags: AUX

There are 4 AUX lemmas (0%), 27 AUX types (0%) and 1239 AUX tokens (1%). Out of 15 observed tags, the rank of AUX is: 14 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent AUX lemmas: היה, אינו, _, הינו

The 10 most frequent AUX types: היה, היו, היתה, אינו, להיות, יהיה, אינה, אינם, תהיה, אינן

The 10 most frequent ambiguous lemmas: היה (AUX 774, VERB 146), _ (NOUN 365, VERB 326, ADJ 230, ADV 192, AUX 169, CCONJ 109, X 76, PRON 57, SCONJ 46, DET 33)

The 10 most frequent ambiguous types: היה (AUX 382, VERB 38, X 1), היו (AUX 139, VERB 46), היתה (AUX 132, VERB 22), יהיה (AUX 85, VERB 24), תהיה (AUX 37, VERB 14), יהיו (AUX 24, VERB 22), היינו (AUX 8, CCONJ 2, ADV 1), הייה (AUX 2, VERB 1), נהיה (AUX 2, VERB 1)

Morphology

The form / lemma ratio of AUX is 6.750000 (the average of all parts of speech is 1.702584).

The 1st highest number of forms (14) was observed with the lemma “היה”: היה, היו, הייה, היינו, היית, הייתי, הייתם, היתה, יהיה, יהיו, להיות, נהיה, תהיה, תהייה.

The 2nd highest number of forms (9) was observed with the lemma “_”: אינך, איננו, היה, היו, הייתי, היתה, יהיה, יהיו, תהיה.

The 3rd highest number of forms (8) was observed with the lemma “אינו”: אינה, אינו, איני, אינכם, אינם, אינן, איננה, אינני.

AUX occurs with 8 features: VerbType (1239; 100% instances), Polarity (1236; 100% instances), Gender (1147; 93% instances), Number (1147; 93% instances), Person (1147; 93% instances), Tense (828; 67% instances), VerbForm (405; 33% instances), Mood (3; 0% instances)

AUX occurs with 16 feature-value pairs: Gender=Fem, Gender=Fem,Masc, Gender=Masc, Mood=Imp, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Past, VerbForm=Inf, VerbForm=Part, VerbType=Cop

AUX occurs with 26 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|Person=3|Polarity=Pos|Tense=Past|VerbType=Cop (381 tokens). Examples: היה

Relations

AUX nodes are attached to their parents using 13 different relations: cop (1035; 84% instances), xcomp (76; 6% instances), acl:relcl (42; 3% instances), root (33; 3% instances), conj (15; 1% instances), advcl (11; 1% instances), dep (8; 1% instances), acl (6; 0% instances), ccomp (5; 0% instances), obl (3; 0% instances), appos (2; 0% instances), parataxis (2; 0% instances), csubj (1; 0% instances)

Parents of AUX nodes belong to 9 different parts of speech: VERB (422; 34% instances), ADJ (367; 30% instances), NOUN (361; 29% instances), (33; 3% instances), ADV (30; 2% instances), PROPN (11; 1% instances), PRON (8; 1% instances), NUM (4; 0% instances), AUX (3; 0% instances)

1046 (84%) AUX nodes are leaves.

74 (6%) AUX nodes have one child.

48 (4%) AUX nodes have two children.

71 (6%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 19 different relations: obl (182; 39% instances), punct (62; 13% instances), mark (60; 13% instances), nsubj (38; 8% instances), advmod (35; 8% instances), conj (15; 3% instances), dep (15; 3% instances), obj (15; 3% instances), cc (14; 3% instances), advcl (12; 3% instances), case (5; 1% instances), fixed (2; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances), csubj (1; 0% instances), mark:q (1; 0% instances), nsubj:cop (1; 0% instances), xcomp (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: NOUN (154; 33% instances), PUNCT (62; 13% instances), SCONJ (59; 13% instances), ADV (41; 9% instances), ADJ (35; 8% instances), VERB (32; 7% instances), PROPN (26; 6% instances), PRON (19; 4% instances), CCONJ (15; 3% instances), ADP (8; 2% instances), NUM (6; 1% instances), AUX (3; 1% instances), X (2; 0% instances), DET (1; 0% instances)