home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Egyptian-PC: POS Tags: X

There are 52 X lemmas (2%), 68 X types (2%) and 243 X tokens (1%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: _, ꞽ, nb, f, n, […], k, ꜣꜣꜣ, ḫwrr, hꞽ

The 10 most frequent X types: […], {⸗ꞽ}, {nb}, {⸗f}, {n}, ꜣꜣꜣ, {⸗k}, ḫwrr, hꞽ, {mṭw}

The 10 most frequent ambiguous lemmas: _ (X 93, VERB 1), (PRON 235, X 43, VERB 39, INTJ 37), nb (ADJ 108, NOUN 68, X 10), f (PRON 1973, X 9), n (ADP 1278, ADJ 461, SCONJ 23, X 8, NOUN 2, PRON 2), k (PRON 2337, X 6), ꜣꜣꜣ (X 6, VERB 1), hꞽ (INTJ 7, NOUN 3, X 3, VERB 1), mṭw (NOUN 519, X 3), r (ADP 717, NOUN 36, ADJ 25, SCONJ 6, X 3)

The 10 most frequent ambiguous types: […] (X 81, VERB 1), ꜣꜣꜣ (X 6, VERB 1), hꞽ (INTJ 7, X 3, NOUN 2), bꞽtꞽ (NOUN 2, X 1), hy (NOUN 4, VERB 2, X 1), šw (VERB 4, NOUN 3, X 1), ḥꜣ (ADP 36, NOUN 1, VERB 1, X 1), ꞽč (VERB 27, X 1)

Morphology

The form / lemma ratio of X is 1.307692 (the average of all parts of speech is 1.926618).

The 1st highest number of forms (12) was observed with the lemma “_”: […], […]n, […]nw, […]t, […]tꞽ, […]˹w˺, m[…], n[…], {⸗f}, ś[…], ˹ꜣčw˺, ꞽ[…]m.

The 2nd highest number of forms (6) was observed with the lemma “[…]”: […]r, […]ꞽ, č[…], ˹n˺[…], ˹ś˺[…], ꞽ[…].

The 3rd highest number of forms (2) was observed with the lemma “k”: {k}, {⸗k}.

X occurs with 2 features: Typo (203; 84% instances), Foreign (28; 12% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, Typo=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Typo=Yes (203 tokens). Examples: […], {⸗ꞽ}, {nb}, {⸗f}, {n}, {⸗k}, {mṭw}, {r}, {č̣(ṭ)}, […]ꞽ

Relations

X nodes are attached to their parents using 19 different relations: reparandum (104; 43% instances), dep (77; 32% instances), flat:foreign (15; 6% instances), root (13; 5% instances), obl (9; 4% instances), conj (6; 2% instances), ccomp:speech (4; 2% instances), nsubj (3; 1% instances), flat:name (2; 1% instances), appos (1; 0% instances), case (1; 0% instances), ccomp:obj (1; 0% instances), dislocated:nsubj (1; 0% instances), nmod (1; 0% instances), nmod:nisba (1; 0% instances), nmod:poss (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: VERB (72; 30% instances), PRON (64; 26% instances), X (34; 14% instances), NOUN (29; 12% instances), PROPN (16; 7% instances), (13; 5% instances), ADP (7; 3% instances), PART (5; 2% instances), ADJ (3; 1% instances)

181 (74%) X nodes are leaves.

33 (14%) X nodes have one child.

13 (5%) X nodes have two children.

16 (7%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 24 different relations: dep (43; 36% instances), flat:foreign (15; 13% instances), case (12; 10% instances), obl (7; 6% instances), conj (5; 4% instances), amod (4; 3% instances), nmod (4; 3% instances), nsubj (4; 3% instances), vocative (4; 3% instances), reparandum (3; 3% instances), advcl (2; 2% instances), advmod (2; 2% instances), appos (2; 2% instances), parataxis (2; 2% instances), acl (1; 1% instances), acl:relcl (1; 1% instances), cop (1; 1% instances), det (1; 1% instances), dislocated:nsubj (1; 1% instances), mark (1; 1% instances), nmod:poss (1; 1% instances), nummod (1; 1% instances), obj (1; 1% instances), punct (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: NOUN (38; 32% instances), X (34; 29% instances), ADP (14; 12% instances), PRON (9; 8% instances), VERB (8; 7% instances), PROPN (6; 5% instances), ADJ (4; 3% instances), DET (3; 3% instances), ADV (1; 1% instances), NUM (1; 1% instances), PUNCT (1; 1% instances)