Treebank Statistics: UD_English-ESLSpok: POS Tags: X
There are 1 X
lemmas (6%), 58 X
types (2%) and 72 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 16 in number of lemmas, 6 in number of types and 16 in number of tokens.
The 10 most frequent X
lemmas: _
The 10 most frequent X
types: -, ku, nante, V, V., bye, m, nabe, nandakke, nandarou
The 10 most frequent ambiguous lemmas: _ (PUNCT 3316, NOUN 3083, PRON 2869, VERB 2552, ADV 1444, AUX 1302, DET 1271, ADP 1136, CCONJ 1124, ADJ 1032, PART 891, PROPN 490, SCONJ 267, INTJ 235, NUM 228, X 72)
The 10 most frequent ambiguous types: - (PUNCT 21, X 4), V (X 2, NOUN 1), bye (X 2, INTJ 1), A (DET 4, PROPN 1, X 1), D (NOUN 1, X 1), end (NOUN 7, X 1), nan (ADV 1, X 1), road (NOUN 2, X 1), term (NOUN 2, X 1), up (ADP 29, ADV 1, X 1)
- -
- V
- bye
- A
- D
- end
- nan
- road
- term
- up
Morphology
The form / lemma ratio of X
is 58.000000 (the average of all parts of speech is 146.187500).
The 1st highest number of forms (58) was observed with the lemma “_”: -, A, D, M., V, V., atto, bye, chotto, cycle, daro, dattakana, end, hai, hitoe, igusa, iu, iukana, iundaro, ja, joei, jukai, juni, ka, kaishain, koreha, ku, m, ma, mae, mates, matte, nabe, nan, nandakke, nandaro, nandarou, nandattakana, nante, nawatobi, nnto, osarusanha, regi, road, rui, sandan, shinkansen, sorekara, telebi, term, tyotto, up, wakan, wakannaissune, worker, yakiniku, yami, zenekon.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 11 different relations: dep (26; 36% instances), goeswith (22; 31% instances), flat:foreign (13; 18% instances), root (4; 6% instances), compound (1; 1% instances), conj (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), obl (1; 1% instances), xcomp (1; 1% instances)
Parents of X
nodes belong to 9 different parts of speech: NOUN (21; 29% instances), VERB (15; 21% instances), X (12; 17% instances), PROPN (7; 10% instances), ADJ (6; 8% instances), (4; 6% instances), ADV (3; 4% instances), INTJ (3; 4% instances), PRON (1; 1% instances)
59 (82%) X
nodes are leaves.
6 (8%) X
nodes have one child.
4 (6%) X
nodes have two children.
3 (4%) X
nodes have three or more children.
The highest child degree of a X
node is 4.
Children of X
nodes are attached using 9 different relations: flat:foreign (12; 50% instances), punct (3; 13% instances), case (2; 8% instances), cc (2; 8% instances), amod (1; 4% instances), cop (1; 4% instances), dep (1; 4% instances), det (1; 4% instances), nsubj (1; 4% instances)
Children of X
nodes belong to 9 different parts of speech: X (12; 50% instances), PUNCT (3; 13% instances), ADP (2; 8% instances), CCONJ (2; 8% instances), ADJ (1; 4% instances), ADV (1; 4% instances), AUX (1; 4% instances), DET (1; 4% instances), PRON (1; 4% instances)