home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTknesset: POS Tags: X

There are 40 X lemmas (1%), 40 X types (0%) and 60 X tokens (0%). Out of 16 observed tags, the rank of X is: 8 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: אללה, על, הבאב, the, Welcome, to, הא, פר, Life, added

The 10 most frequent X types: אללה, על, הבאב, the, Welcome, to, הא, פר, Life, added

The 10 most frequent ambiguous lemmas: על (ADP 651, X 6, PROPN 5, ADV 2), פר (X 2, ADP 1), ב (ADP 2394, ADV 7, X 1), דה (ADV 2, X 1), כו’ (ADV 1, X 1), כולי (ADV 1, X 1), סטטוס (PROPN 1, X 1), קוו (PROPN 1, X 1)

The 10 most frequent ambiguous types: על (ADP 592, X 6, PROPN 5, ADV 2), פר (X 2, ADP 1), אפ (NOUN 1, X 1), ב (ADP 2393, ADV 7, X 1), דה (ADV 2, X 1), כו’ (ADV 1, X 1), כולי (ADV 2, ADJ 1, X 1), סטטוס (PROPN 1, X 1), קוו (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.545540).

The 1st highest number of forms (1) was observed with the lemma “Life”: Life.

The 2nd highest number of forms (1) was observed with the lemma “Welcome”: Welcome.

The 3rd highest number of forms (1) was observed with the lemma “added”: added.

X occurs with 1 features: Foreign (43; 72% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (43 tokens). Examples: אללה, על, הבאב, הא, פר, Life, blue, of, out, the

Relations

X nodes are attached to their parents using 18 different relations: flat (24; 40% instances), dep (10; 17% instances), compound (4; 7% instances), fixed (4; 7% instances), conj (3; 5% instances), nsubj (2; 3% instances), xcomp (2; 3% instances), appos (1; 2% instances), case (1; 2% instances), ccomp (1; 2% instances), nmod (1; 2% instances), nmod:poss (1; 2% instances), nmod:unmarked (1; 2% instances), obj (1; 2% instances), obl (1; 2% instances), obl:unmarked (1; 2% instances), parataxis (1; 2% instances), root (1; 2% instances)

Parents of X nodes belong to 5 different parts of speech: X (35; 58% instances), NOUN (14; 23% instances), VERB (9; 15% instances), ADJ (1; 2% instances), (1; 2% instances)

31 (52%) X nodes are leaves.

12 (20%) X nodes have one child.

1 (2%) X nodes have two children.

16 (27%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 12 different relations: punct (25; 32% instances), flat (24; 30% instances), dep (7; 9% instances), case (4; 5% instances), det (4; 5% instances), fixed (4; 5% instances), nmod (4; 5% instances), cc (2; 3% instances), nmod:poss (2; 3% instances), advmod (1; 1% instances), compound (1; 1% instances), conj (1; 1% instances)

Children of X nodes belong to 10 different parts of speech: X (35; 44% instances), PUNCT (25; 32% instances), NOUN (5; 6% instances), ADP (4; 5% instances), DET (3; 4% instances), CCONJ (2; 3% instances), PRON (2; 3% instances), ADV (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)