home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CHILDES: POS Tags: X

There are 36 X lemmas (1%), 52 X types (1%) and 164 X tokens (0%). Out of 17 observed tags, the rank of X is: 12 in number of lemmas, 11 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, doo, a, e, r, d, s, o, h, mm

The 10 most frequent X types: a, doo, e, r, d, s, hm, o, Mm, c

The 10 most frequent ambiguous lemmas: doo (X 16, NOUN 10, INTJ 2), a (DET 7218, NOUN 36, X 15, SYM 10, PRON 8, INTJ 6, PROPN 4, ADP 2, PUNCT 2), e (NOUN 21, X 15, PROPN 11, INTJ 2), r (X 10, NOUN 4, PROPN 1), d (PROPN 32, NOUN 20, X 8, INTJ 3), s (PART 209, NOUN 8, X 6, DET 1, PROPN 1), o (NOUN 14, X 5, PUNCT 3, INTJ 2, PROPN 1, SYM 1), h (NOUN 14, X 4, INTJ 2, DET 1, PROPN 1), mm (INTJ 58, PROPN 5, ADV 4, NOUN 4, X 4), ya (PRON 88, X 4, PROPN 2, INTJ 1)

The 10 most frequent ambiguous types: a (DET 5737, NOUN 32, PART 14, X 13, ADP 12, SYM 9, PRON 6, PROPN 4, INTJ 3, PUNCT 2, ADV 1, AUX 1), doo (X 12, NOUN 8, INTJ 2), e (NOUN 17, X 13, PROPN 8, INTJ 1), r (NOUN 11, X 10, PROPN 5, AUX 1), d (PROPN 22, NOUN 18, X 6, AUX 2, INTJ 1), s (PART 19, NOUN 8, X 5, AUX 2, PRON 2, PROPN 1), hm (INTJ 22, PROPN 6, X 5, NOUN 3, PRON 1), o (NOUN 14, X 5, PUNCT 3, ADP 1, INTJ 1, PROPN 1, SYM 1), Mm (INTJ 40, ADV 4, NOUN 4, X 4, PROPN 3), c (NOUN 35, PROPN 10, X 2, ADV 1, INTJ 1, VERB 1)

Morphology

The form / lemma ratio of X is 1.444444 (the average of all parts of speech is 1.232942).

The 1st highest number of forms (26) was observed with the lemma “_”: ’s, A, C, arf, be, beep, eeeh, eeh, f, ha, hm, ho, hop, huh, night, oh, out, p, pa, r, s, stairs, t, wee, whoo, woof.

The 2nd highest number of forms (1) was observed with the lemma “a”: a.

The 3rd highest number of forms (1) was observed with the lemma “b”: b.

X does not occur with any features.

Relations

X nodes are attached to their parents using 11 different relations: flat (56; 34% instances), goeswith (32; 20% instances), conj (28; 17% instances), root (16; 10% instances), discourse (10; 6% instances), nsubj (7; 4% instances), obj (4; 2% instances), reparandum (4; 2% instances), vocative (3; 2% instances), obl (2; 1% instances), xcomp (2; 1% instances)

Parents of X nodes belong to 14 different parts of speech: X (69; 42% instances), NOUN (19; 12% instances), INTJ (17; 10% instances), (16; 10% instances), VERB (12; 7% instances), PROPN (8; 5% instances), ADV (5; 3% instances), PRON (5; 3% instances), ADJ (3; 2% instances), AUX (3; 2% instances), ADP (2; 1% instances), DET (2; 1% instances), NUM (2; 1% instances), PART (1; 1% instances)

141 (86%) X nodes are leaves.

2 (1%) X nodes have one child.

3 (2%) X nodes have two children.

18 (11%) X nodes have three or more children.

The highest child degree of a X node is 17.

Children of X nodes are attached using 8 different relations: flat (49; 46% instances), conj (33; 31% instances), punct (16; 15% instances), case (2; 2% instances), cop (2; 2% instances), det (2; 2% instances), nmod:poss (1; 1% instances), nsubj:outer (1; 1% instances)

Children of X nodes belong to 8 different parts of speech: X (69; 65% instances), PUNCT (16; 15% instances), NOUN (7; 7% instances), PROPN (5; 5% instances), ADP (3; 3% instances), PRON (3; 3% instances), AUX (2; 2% instances), DET (1; 1% instances)