home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: X

There are 1116 X lemmas (3%), 1131 X types (2%) and 1797 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: ex, ‘s, ya, etc., c, x, d, i, mm, and

The 10 most frequent X types: ex, ‘s, ya, etc., C, x, i, mm, C., and

The 10 most frequent ambiguous lemmas: ex (X 43, PART 4, ADJ 3, NOUN 2), ’s (X 37, PROPN 2), ya (ADV 423, X 33), etc. (X 29, ADV 4), c (X 27, NOUN 6, PROPN 1), x (X 17, NUM 10, ADJ 4, PROPN 1), d (X 15, ADP 2, PROPN 2), i (NUM 53, CCONJ 20, X 15, ADJ 4, PRON 1, PROPN 1), mm (NOUN 20, X 15), and (X 14, CCONJ 12, PROPN 10)

The 10 most frequent ambiguous types: ex (X 42, PART 4, ADJ 3, NOUN 2), ’s (X 37, PROPN 2, AUX 1), ya (ADV 393, X 33), etc. (X 29, ADV 4), C (X 27, PROPN 8, NOUN 6), x (X 11, PROPN 1), i (CCONJ 20, X 6, PROPN 1), mm (NOUN 20, X 14), C. (X 14, PROPN 6, NOUN 1), and (X 13, CCONJ 11, PROPN 10)

Morphology

The form / lemma ratio of X is 1.013441 (the average of all parts of speech is 1.326443).

The 1st highest number of forms (2) was observed with the lemma “8vo”: 8vo, 8vos.

The 2nd highest number of forms (2) was observed with the lemma “_”: a, ª.

The 3rd highest number of forms (2) was observed with the lemma “ac”: AC, aC.

X occurs with 5 features: Number (616; 34% instances), Gender (506; 28% instances), Foreign (87; 5% instances), Person (83; 5% instances), VerbForm (34; 2% instances)

X occurs with 11 feature-value pairs: Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

X occurs with 27 feature combinations. The most frequent feature combination is _ (1052 tokens). Examples: ex, ya, C, etc., x, ‘s, C., i, B, Sub

Relations

X nodes are attached to their parents using 27 different relations: appos (392; 22% instances), dep (288; 16% instances), nmod (214; 12% instances), compound (203; 11% instances), conj (176; 10% instances), obl (120; 7% instances), amod (69; 4% instances), nsubj (63; 4% instances), case (58; 3% instances), cc (56; 3% instances), obj (56; 3% instances), flat (39; 2% instances), root (13; 1% instances), parataxis (8; 0% instances), acl:relcl (7; 0% instances), xcomp (7; 0% instances), fixed (6; 0% instances), acl (4; 0% instances), mark (4; 0% instances), obl:arg (3; 0% instances), advcl (2; 0% instances), ccomp (2; 0% instances), goeswith (2; 0% instances), obl:agent (2; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: NOUN (700; 39% instances), X (374; 21% instances), VERB (293; 16% instances), PROPN (262; 15% instances), ADJ (64; 4% instances), NUM (41; 2% instances), SYM (28; 2% instances), PRON (13; 1% instances), (13; 1% instances), ADV (5; 0% instances), CCONJ (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

690 (38%) X nodes are leaves.

466 (26%) X nodes have one child.

307 (17%) X nodes have two children.

334 (19%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 26 different relations: punct (679; 29% instances), case (331; 14% instances), det (213; 9% instances), compound (164; 7% instances), nmod (160; 7% instances), nummod (148; 6% instances), conj (124; 5% instances), appos (90; 4% instances), amod (88; 4% instances), cc (82; 4% instances), dep (81; 3% instances), acl:relcl (30; 1% instances), flat (30; 1% instances), advmod (20; 1% instances), cop (14; 1% instances), nsubj (14; 1% instances), acl (12; 1% instances), obj (12; 1% instances), mark (11; 0% instances), fixed (6; 0% instances), parataxis (6; 0% instances), advcl (4; 0% instances), expl:pv (4; 0% instances), aux (3; 0% instances), csubj (1; 0% instances), obl (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (679; 29% instances), X (374; 16% instances), ADP (317; 14% instances), DET (215; 9% instances), NUM (179; 8% instances), NOUN (177; 8% instances), PROPN (76; 3% instances), CCONJ (74; 3% instances), ADJ (73; 3% instances), VERB (54; 2% instances), SYM (44; 2% instances), ADV (23; 1% instances), AUX (17; 1% instances), PRON (16; 1% instances), SCONJ (9; 0% instances), PART (1; 0% instances)