home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: X

There are 1186 X lemmas (3%), 1206 X types (2%) and 1981 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: ex, hab, ‘s, etc., ya, c, ², x, d, i

The 10 most frequent X types: ex, hab, ‘s, etc., ya, C, ², x, i, mm

The 10 most frequent ambiguous lemmas: ex (X 43, PART 4, ADJ 3, NOUN 2, PROPN 1), hab (X 39, NOUN 25, VERB 3), ’s (X 37, PROPN 3), ya (ADV 422, X 33, PROPN 3), c (X 27, PROPN 9, NOUN 6), ² (SYM 302, X 19, NUM 16), x (X 17, NUM 10, PROPN 8), d (X 16, PROPN 4, ADP 2), i (NUM 52, PROPN 22, CCONJ 20, X 16), mm (NOUN 20, X 15)

The 10 most frequent ambiguous types: ex (X 42, PART 4, ADJ 3, NOUN 2), hab (X 39, NOUN 25, VERB 3), ’s (X 37, PROPN 3), ya (ADV 393, X 33), C (X 27, PROPN 8, NOUN 6), ² (SYM 302, X 19, NUM 16), x (X 11, PROPN 1), i (CCONJ 20, X 6, PROPN 1), mm (NOUN 20, X 14), C. (X 14, PROPN 6, NOUN 1)

Morphology

The form / lemma ratio of X is 1.016863 (the average of all parts of speech is 1.278515).

The 1st highest number of forms (3) was observed with the lemma “ser”: es, fue, ser.

The 2nd highest number of forms (2) was observed with the lemma “8vo”: 8vo, 8vos.

The 3rd highest number of forms (2) was observed with the lemma “ac”: AC, aC.

X occurs with 6 features: Number (670; 34% instances), Gender (542; 27% instances), Person (99; 5% instances), Foreign (51; 3% instances), VerbForm (42; 2% instances), Polarity (3; 0% instances)

X occurs with 12 feature-value pairs: Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

X occurs with 29 feature combinations. The most frequent feature combination is _ (1207 tokens). Examples: ex, hab, ya, C, etc., ², x, ‘s, C., i

Relations

X nodes are attached to their parents using 31 different relations: appos (405; 20% instances), dep (302; 15% instances), compound (234; 12% instances), nmod (218; 11% instances), conj (179; 9% instances), obl (145; 7% instances), amod (71; 4% instances), nsubj (67; 3% instances), case (63; 3% instances), cc (60; 3% instances), obj (56; 3% instances), punct (32; 2% instances), nummod (23; 1% instances), advmod (17; 1% instances), root (17; 1% instances), det (16; 1% instances), flat (16; 1% instances), parataxis (9; 0% instances), acl:relcl (8; 0% instances), aux (8; 0% instances), fixed (7; 0% instances), mark (6; 0% instances), cop (5; 0% instances), acl (4; 0% instances), advcl (3; 0% instances), iobj (3; 0% instances), ccomp (2; 0% instances), xcomp (2; 0% instances), aux:pass (1; 0% instances), csubj (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 14 different parts of speech: NOUN (756; 38% instances), X (398; 20% instances), VERB (346; 17% instances), PROPN (288; 15% instances), ADJ (67; 3% instances), NUM (43; 2% instances), SYM (38; 2% instances), (17; 1% instances), PRON (14; 1% instances), ADV (5; 0% instances), SCONJ (4; 0% instances), AUX (2; 0% instances), CCONJ (2; 0% instances), ADP (1; 0% instances)

928 (47%) X nodes are leaves.

333 (17%) X nodes have one child.

313 (16%) X nodes have two children.

407 (21%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 27 different relations: punct (703; 28% instances), case (374; 15% instances), det (222; 9% instances), compound (195; 8% instances), nummod (195; 8% instances), nmod (182; 7% instances), conj (125; 5% instances), dep (112; 4% instances), appos (95; 4% instances), amod (87; 3% instances), cc (87; 3% instances), acl:relcl (31; 1% instances), advmod (24; 1% instances), cop (21; 1% instances), nsubj (16; 1% instances), obj (13; 1% instances), acl (11; 0% instances), mark (11; 0% instances), fixed (10; 0% instances), parataxis (8; 0% instances), flat (6; 0% instances), advcl (4; 0% instances), aux (4; 0% instances), iobj (4; 0% instances), aux:pass (2; 0% instances), csubj (2; 0% instances), nsubj:pass (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: PUNCT (701; 28% instances), X (398; 16% instances), ADP (359; 14% instances), DET (222; 9% instances), NUM (222; 9% instances), NOUN (205; 8% instances), PROPN (91; 4% instances), CCONJ (79; 3% instances), ADJ (76; 3% instances), SYM (64; 3% instances), VERB (61; 2% instances), ADV (22; 1% instances), AUX (20; 1% instances), PRON (16; 1% instances), SCONJ (9; 0% instances)