home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: AUX

There are 24 AUX lemmas (0%), 160 AUX types (0%) and 10754 AUX tokens (2%). Out of 17 observed tags, the rank of AUX is: 14 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: ser, haber, poder, estar, deber, corresponder, querer, volver, soler, venir

The 10 most frequent AUX types: es, fue, ha, son, ser, eran, era, han, está, puede

The 10 most frequent ambiguous lemmas: ser (AUX 6901, VERB 963, NOUN 49, PROPN 25, X 7, CCONJ 1, DET 1, PART 1), haber (AUX 1860, VERB 372, PROPN 7, NOUN 5, X 2, ADV 1), poder (AUX 917, NOUN 102, VERB 18, PROPN 3, X 2), estar (AUX 857, VERB 422, X 2, PROPN 1), deber (AUX 187, VERB 152, NOUN 1, PROPN 1), corresponder (VERB 78, AUX 8), querer (VERB 183, AUX 3, PROPN 3, X 1), volver (VERB 193, AUX 3, PROPN 2), soler (VERB 65, AUX 2, PROPN 1), venir (VERB 90, AUX 2)

The 10 most frequent ambiguous types: es (AUX 2550, VERB 297, PROPN 3, X 3), fue (AUX 1172, VERB 107, PART 1, X 1), ha (AUX 766, NOUN 3, PROPN 1, X 1), son (AUX 509, VERB 54, NOUN 2, PROPN 1), ser (AUX 433, VERB 34, NOUN 26, X 1), eran (AUX 392, VERB 102), era (AUX 326, VERB 140, NOUN 12, PROPN 1), está (AUX 280, VERB 102, X 1), fueron (AUX 247, VERB 27), sido (AUX 241, VERB 13)

Morphology

The form / lemma ratio of AUX is 6.666667 (the average of all parts of speech is 1.278515).

The 1st highest number of forms (37) was observed with the lemma “poder”: Podeis, Podriaís, podamos, podemos, poder, podes, podido, podras, podre, podremos, podrá, podrán, podrás, podréis, podría, podríamos, podrían, podéis, podía, podíamos, podían, pude, pudiera, pudieran, pudieras, pudieron, pudiesen, pudimos, pudiéndo, pudo, pueda, puedan, puedas, puede, pueden, puedes, puedo.

The 2nd highest number of forms (30) was observed with the lemma “haber”: ha, habeis, haber, haberle, haberse, habiendo, habiéndo, habrá, habrán, habría, habrían, habéis, había, habíamos, habían, han, has, hay, haya, hayamos, hayan, hayas, he, hemos, hubiera, hubieran, hubiese, hubiesen, hubiéramos, hubo.

The 3rd highest number of forms (27) was observed with the lemma “estar”: esta, estaba, estabamos, estaban, estado, estamos, estan, estando, estar, estará, estarán, estaría, estarían, estarías, estas, este, estoy, estuve, estuviera, estuvieron, estuvimos, estuvo, está, estábamos, están, esté, estén.

AUX occurs with 6 features: VerbForm (10754; 100% instances), Number (9927; 92% instances), Tense (9627; 90% instances), Person (9583; 89% instances), Mood (9582; 89% instances), Gender (348; 3% instances)

AUX occurs with 18 feature-value pairs: Gender=Fem, Gender=Masc, Mood=Cnd, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 37 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (4387 tokens). Examples: es, ha, está, puede, debe, hay, esta, fuera, podre, quiere

Relations

AUX nodes are attached to their parents using 15 different relations: cop (5534; 51% instances), aux (3339; 31% instances), aux:pass (1847; 17% instances), acl:relcl (8; 0% instances), conj (8; 0% instances), parataxis (4; 0% instances), root (4; 0% instances), advcl (2; 0% instances), ccomp (2; 0% instances), dep (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), nsubj:pass (1; 0% instances), obl (1; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: VERB (4912; 46% instances), NOUN (3392; 32% instances), ADJ (1884; 18% instances), PRON (343; 3% instances), PROPN (107; 1% instances), DET (39; 0% instances), X (20; 0% instances), NUM (18; 0% instances), ADV (13; 0% instances), SYM (12; 0% instances), ADP (4; 0% instances), (4; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), PUNCT (1; 0% instances)

10358 (96%) AUX nodes are leaves.

361 (3%) AUX nodes have one child.

12 (0%) AUX nodes have two children.

23 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 19 different relations: advmod (242; 50% instances), fixed (49; 10% instances), iobj (47; 10% instances), mark (42; 9% instances), obl (28; 6% instances), nsubj (18; 4% instances), punct (17; 3% instances), obj (12; 2% instances), advcl (9; 2% instances), conj (9; 2% instances), cc (3; 1% instances), det (3; 1% instances), xcomp (3; 1% instances), amod (1; 0% instances), aux (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: ADV (242; 50% instances), PRON (62; 13% instances), CCONJ (50; 10% instances), ADP (38; 8% instances), NOUN (37; 8% instances), PUNCT (17; 3% instances), VERB (14; 3% instances), SCONJ (7; 1% instances), DET (4; 1% instances), PROPN (4; 1% instances), SYM (4; 1% instances), AUX (3; 1% instances), ADJ (2; 0% instances), X (2; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)