home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: X

There are 134 X lemmas (1%), 135 X types (1%) and 217 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: e.g., al., et, i.e., etc, etc., de, 1, 2, 3

The 10 most frequent X types: e.g., al., et, i.e., etc, de, etc., 1, 2, 3

The 10 most frequent ambiguous lemmas: de (PROPN 12, X 4), 1 (NUM 64, X 4), 2 (NUM 54, X 4), 3 (NUM 33, X 4), 4 (NUM 21, X 4), 5 (NUM 20, X 2), Montejo (PROPN 3, X 2), c. (X 2, ADP 1, ADV 1), paseo (X 2, PROPN 1), 6 (NUM 16, DET 1, X 1)

The 10 most frequent ambiguous types: de (PROPN 12, X 4), 1 (NUM 64, X 4), 2 (NUM 54, X 4), 3 (NUM 33, X 4), 4 (NUM 21, X 4), 5 (NUM 20, X 2), Montejo (PROPN 3, X 2), Paseo (X 2, PROPN 1), c. (X 2, ADP 1, ADV 1), 6 (NUM 16, DET 1, X 1)

Morphology

The form / lemma ratio of X is 1.007463 (the average of all parts of speech is 1.227660).

The 1st highest number of forms (2) was observed with the lemma “@ord@”: 4., 5..

The 2nd highest number of forms (2) was observed with the lemma “etc.”: etc, etc..

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

X does not occur with any features.

Relations

X nodes are attached to their parents using 18 different relations: compound (53; 24% instances), dep (32; 15% instances), conj (29; 13% instances), appos (20; 9% instances), advmod (17; 8% instances), nmod (16; 7% instances), obl (10; 5% instances), nsubj (9; 4% instances), cc (8; 4% instances), obj (5; 2% instances), root (4; 2% instances), advcl (3; 1% instances), case (3; 1% instances), xcomp (3; 1% instances), parataxis (2; 1% instances), amod (1; 0% instances), det (1; 0% instances), nmod:npmod (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: X (70; 32% instances), NOUN (46; 21% instances), VERB (45; 21% instances), PROPN (41; 19% instances), (4; 2% instances), NUM (3; 1% instances), CCONJ (2; 1% instances), INTJ (2; 1% instances), ADV (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)

107 (49%) X nodes are leaves.

44 (20%) X nodes have one child.

28 (13%) X nodes have two children.

38 (18%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 21 different relations: punct (94; 37% instances), compound (52; 21% instances), case (26; 10% instances), cc (14; 6% instances), conj (10; 4% instances), nmod (9; 4% instances), appos (7; 3% instances), det (7; 3% instances), cop (5; 2% instances), dep (4; 2% instances), nmod:npmod (4; 2% instances), amod (3; 1% instances), mark (3; 1% instances), nmod:tmod (3; 1% instances), nsubj (3; 1% instances), acl:relcl (2; 1% instances), advmod (2; 1% instances), nummod (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (94; 37% instances), X (70; 28% instances), ADP (22; 9% instances), NOUN (17; 7% instances), CCONJ (8; 3% instances), PROPN (8; 3% instances), NUM (7; 3% instances), ADJ (6; 2% instances), AUX (6; 2% instances), DET (6; 2% instances), VERB (4; 2% instances), ADV (2; 1% instances), PART (2; 1% instances), SCONJ (1; 0% instances)