Treebank Statistics: UD_Hebrew-HTB: POS Tags: AUX
There are 48 AUX
lemmas (0%), 130 AUX
types (1%) and 2487 AUX
tokens (2%).
Out of 15 observed tags, the rank of AUX
is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.
The 10 most frequent AUX
lemmas: היה, הוא, אינו, _, אפשר, יכול, צריך, יש, עלול, אמור
The 10 most frequent AUX
types: היה, הוא, היא, היו, היתה, אינו, אפשר, להיות, יהיה, אינה
The 10 most frequent ambiguous lemmas: היה (AUX 778, VERB 148), הוא (PRON 5615, AUX 394), _ (NOUN 366, AUX 268, VERB 251, ADJ 231, ADV 177, CCONJ 110, X 86, PRON 57, SCONJ 47, DET 33), אפשר (AUX 98, VERB 34), יכול (AUX 84, VERB 2), צריך (AUX 68, ADJ 1), יש (VERB 214, AUX 49), אמור (AUX 39, ADJ 1), חייב (AUX 35, VERB 15, NOUN 3, ADJ 1), עשוי (AUX 29, ADJ 15)
The 10 most frequent ambiguous types: היה (AUX 385, VERB 38, X 1), הוא (PRON 554, AUX 164), היא (PRON 198, AUX 164), היו (AUX 139, VERB 46), היתה (AUX 132, VERB 22), יהיה (AUX 86, VERB 24), הם (PRON 209, AUX 49), יש (VERB 211, AUX 49), צריך (AUX 49, ADJ 1), תהיה (AUX 37, VERB 15)
- היה
- הוא
- היא
- היו
- היתה
- יהיה
- הם
- יש
- צריך
- תהיה
Morphology
The form / lemma ratio of AUX
is 2.708333 (the average of all parts of speech is 1.701287).
The 1st highest number of forms (29) was observed with the lemma “_”: אינך, איננו, אסורים, אפשר, דומה, היה, היו, הייתי, היתה, זקוק, יהיה, יהיו, יכול, יכולה, מאפשר, מאפשרות, מאפשרים, מאפשרת, מעוניינות, מעוניינים, נאלצים, נכונים, סבור, סבורה, סבורים, עומדת, עתידה, תהיה, תוכל.
The 2nd highest number of forms (13) was observed with the lemma “היה”: היה, היו, הייה, היינו, היית, הייתי, הייתם, היתה, יהיה, יהיו, להיות, נהיה, תהיה.
The 3rd highest number of forms (9) was observed with the lemma “יכול”: יוכל, יוכלו, יכול, יכולה, יכולות, יכולים, יכולנו, נוכל, תוכל.
AUX
occurs with 8 features: VerbType (2487; 100% instances), Gender (2151; 86% instances), Number (2151; 86% instances), Person (2068; 83% instances), Polarity (1640; 66% instances), Tense (938; 38% instances), VerbForm (919; 37% instances), Mood (3; 0% instances)
AUX
occurs with 18 feature-value pairs: Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Mood=Imp
, Number=Plur
, Number=Sing
, Person=1
, Person=1,2,3
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Fut
, Tense=Past
, VerbForm=Inf
, VerbForm=Part
, VerbType=Cop
, VerbType=Mod
AUX
occurs with 46 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing|Person=3|Polarity=Pos|Tense=Past|VerbType=Cop
(384 tokens).
Examples: היה
Relations
AUX
nodes are attached to their parents using 16 different relations: cop (1111; 45% instances), aux (670; 27% instances), advmod (317; 13% instances), root (118; 5% instances), acl:relcl (71; 3% instances), conj (50; 2% instances), advcl (48; 2% instances), ccomp (28; 1% instances), xcomp (26; 1% instances), dep (23; 1% instances), obl (8; 0% instances), acl (7; 0% instances), appos (3; 0% instances), parataxis (3; 0% instances), det (2; 0% instances), fixed (2; 0% instances)
Parents of AUX
nodes belong to 10 different parts of speech: VERB (1139; 46% instances), NOUN (634; 25% instances), ADJ (295; 12% instances), AUX (219; 9% instances), (118; 5% instances), PROPN (38; 2% instances), PRON (20; 1% instances), ADV (19; 1% instances), NUM (3; 0% instances), DET (2; 0% instances)
1462 (59%) AUX
nodes are leaves.
249 (10%) AUX
nodes have one child.
324 (13%) AUX
nodes have two children.
452 (18%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 8.
Children of AUX
nodes are attached using 23 different relations: obl (474; 18% instances), nsubj (400; 15% instances), mark (382; 15% instances), advmod (314; 12% instances), punct (264; 10% instances), advcl (142; 5% instances), cc (138; 5% instances), cop (107; 4% instances), conj (96; 4% instances), aux (51; 2% instances), dep (44; 2% instances), obj (44; 2% instances), ccomp (32; 1% instances), compound:affix (23; 1% instances), parataxis (21; 1% instances), case (19; 1% instances), nsubj:cop (11; 0% instances), mark:q (9; 0% instances), dislocated (5; 0% instances), det (2; 0% instances), fixed (2; 0% instances), csubj (1; 0% instances), xcomp (1; 0% instances)
Children of AUX
nodes belong to 14 different parts of speech: NOUN (636; 25% instances), SCONJ (373; 14% instances), ADV (291; 11% instances), VERB (281; 11% instances), PUNCT (264; 10% instances), AUX (219; 8% instances), PRON (175; 7% instances), CCONJ (149; 6% instances), PROPN (80; 3% instances), ADJ (56; 2% instances), ADP (30; 1% instances), DET (18; 1% instances), NUM (8; 0% instances), X (2; 0% instances)