home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESLSpok: POS Tags: X

There are 1 X lemmas (6%), 58 X types (2%) and 72 X tokens (0%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 6 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: -, ku, nante, V, V., bye, m, nabe, nandakke, nandarou

The 10 most frequent ambiguous lemmas: _ (PUNCT 3316, NOUN 3083, PRON 2869, VERB 2552, ADV 1444, AUX 1302, DET 1271, ADP 1136, CCONJ 1124, ADJ 1032, PART 891, PROPN 490, SCONJ 267, INTJ 235, NUM 228, X 72)

The 10 most frequent ambiguous types: - (PUNCT 21, X 4), V (X 2, NOUN 1), bye (X 2, INTJ 1), A (DET 4, PROPN 1, X 1), D (NOUN 1, X 1), end (NOUN 7, X 1), nan (ADV 1, X 1), road (NOUN 2, X 1), term (NOUN 2, X 1), up (ADP 29, ADV 1, X 1)

Morphology

The form / lemma ratio of X is 58.000000 (the average of all parts of speech is 146.187500).

The 1st highest number of forms (58) was observed with the lemma “_”: -, A, D, M., V, V., atto, bye, chotto, cycle, daro, dattakana, end, hai, hitoe, igusa, iu, iukana, iundaro, ja, joei, jukai, juni, ka, kaishain, koreha, ku, m, ma, mae, mates, matte, nabe, nan, nandakke, nandaro, nandarou, nandattakana, nante, nawatobi, nnto, osarusanha, regi, road, rui, sandan, shinkansen, sorekara, telebi, term, tyotto, up, wakan, wakannaissune, worker, yakiniku, yami, zenekon.

X does not occur with any features.

Relations

X nodes are attached to their parents using 11 different relations: dep (26; 36% instances), goeswith (22; 31% instances), flat:foreign (13; 18% instances), root (4; 6% instances), compound (1; 1% instances), conj (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), obl (1; 1% instances), xcomp (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: NOUN (21; 29% instances), VERB (15; 21% instances), X (12; 17% instances), PROPN (7; 10% instances), ADJ (6; 8% instances), (4; 6% instances), ADV (3; 4% instances), INTJ (3; 4% instances), PRON (1; 1% instances)

59 (82%) X nodes are leaves.

6 (8%) X nodes have one child.

4 (6%) X nodes have two children.

3 (4%) X nodes have three or more children.

The highest child degree of a X node is 4.

Children of X nodes are attached using 9 different relations: flat:foreign (12; 50% instances), punct (3; 13% instances), case (2; 8% instances), cc (2; 8% instances), amod (1; 4% instances), cop (1; 4% instances), dep (1; 4% instances), det (1; 4% instances), nsubj (1; 4% instances)

Children of X nodes belong to 9 different parts of speech: X (12; 50% instances), PUNCT (3; 13% instances), ADP (2; 8% instances), CCONJ (2; 8% instances), ADJ (1; 4% instances), ADV (1; 4% instances), AUX (1; 4% instances), DET (1; 4% instances), PRON (1; 4% instances)