home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek: POS Tags: X

There are 495 X lemmas (8%), 495 X types (4%) and 972 X tokens (2%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: αλ, Κάστρο, σκορ, Αφγανιστάν, Ουάσιγκτον, Σαντόρουμ, Μιτ, Ρικ, Ρόμνεϊ, Θιβέτ

The 10 most frequent X types: αλ, Κάστρο, σκορ, Αφγανιστάν, Ουάσιγκτον, Σαντόρουμ, Μιτ, Ρικ, Ρόμνεϊ, Θιβέτ

The 10 most frequent ambiguous lemmas: σκορ (X 19, NOUN 1), Μπάμπα (X 7, PROPN 1), Κάιντα (X 6, PROPN 4), Τζων (X 5, PROPN 4), απαρτχάιντ (X 5, NOUN 1), Πούτιν (X 4, PROPN 2), Αλαμπάμα (X 3, PROPN 2), Βλαντιμίρ (X 3, PROPN 1), Γκεβάρα (X 3, ADV 1), Μάλι (PROPN 6, X 3)

The 10 most frequent ambiguous types: Κάστρο (X 20, PROPN 9), σκορ (X 19, NOUN 1), Μπάμπα (X 7, PROPN 1), Κάιντα (X 6, PROPN 4), Τζων (X 5, PROPN 4), Πούτιν (X 4, PROPN 2), Αλαμπάμα (X 3, PROPN 2), Βλαντιμίρ (X 3, PROPN 1), Γκεβάρα (X 3, ADV 1), Μάλι (PROPN 6, X 3)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.772686).

The 1st highest number of forms (2) was observed with the lemma “Άμστερνταμ”: Άμστερνταμ, Αμστερνταμ.

The 2nd highest number of forms (1) was observed with the lemma “ABC”: ABC.

The 3rd highest number of forms (1) was observed with the lemma “AIPAC”: AIPAC.

X occurs with 1 features: Foreign (972; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 1 feature combinations. The most frequent feature combination is Foreign=Yes (972 tokens). Examples: αλ, Κάστρο, σκορ, Αφγανιστάν, Ουάσιγκτον, Σαντόρουμ, Μιτ, Ρικ, Ρόμνεϊ, Θιβέτ

Relations

X nodes are attached to their parents using 13 different relations: nmod (591; 61% instances), nsubj (147; 15% instances), obl (82; 8% instances), conj (60; 6% instances), obj (39; 4% instances), appos (38; 4% instances), orphan (6; 1% instances), root (3; 0% instances), obl:arg (2; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (344; 35% instances), X (291; 30% instances), VERB (268; 28% instances), PROPN (30; 3% instances), ADJ (19; 2% instances), ADV (8; 1% instances), NUM (5; 1% instances), PRON (3; 0% instances), (3; 0% instances), ADP (1; 0% instances)

346 (36%) X nodes are leaves.

187 (19%) X nodes have one child.

251 (26%) X nodes have two children.

188 (19%) X nodes have three or more children.

The highest child degree of a X node is 11.

Children of X nodes are attached using 18 different relations: det (478; 35% instances), nmod (317; 23% instances), punct (174; 13% instances), case (151; 11% instances), compound (70; 5% instances), conj (63; 5% instances), amod (31; 2% instances), cc (30; 2% instances), acl:relcl (14; 1% instances), appos (14; 1% instances), nummod (4; 0% instances), acl (3; 0% instances), advmod (3; 0% instances), orphan (3; 0% instances), cop (2; 0% instances), aux (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: DET (478; 35% instances), X (291; 21% instances), PUNCT (174; 13% instances), ADP (150; 11% instances), NOUN (115; 8% instances), NUM (40; 3% instances), PROPN (30; 2% instances), ADJ (29; 2% instances), CCONJ (27; 2% instances), VERB (14; 1% instances), ADV (6; 0% instances), AUX (2; 0% instances), PART (2; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)