home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: AUX

There are 26 AUX lemmas (0%), 198 AUX types (0%) and 11006 AUX tokens (3%). Out of 17 observed tags, the rank of AUX is: 14 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: ser, haber, poder, estar, deber, ir, tener, corresponder, querer, volver

The 10 most frequent AUX types: es, fue, ha, son, ser, eran, era, han, está, puede

The 10 most frequent ambiguous lemmas: ser (AUX 6901, VERB 963, NOUN 49, PROPN 25, X 7, CCONJ 1, DET 1, PART 1), haber (AUX 1860, VERB 372, PROPN 7, NOUN 5, X 2, ADV 1), poder (AUX 917, NOUN 102, VERB 18, PROPN 3, X 2), estar (AUX 857, VERB 422, X 2, PROPN 1), deber (AUX 187, VERB 152, NOUN 1, PROPN 1), ir (VERB 174, AUX 138, PROPN 22, PART 2, DET 1), tener (VERB 1306, AUX 114, PROPN 1), corresponder (VERB 78, AUX 8), querer (VERB 183, AUX 3, PROPN 3, X 1), volver (VERB 193, AUX 3, PROPN 2)

The 10 most frequent ambiguous types: es (AUX 2550, VERB 297, PROPN 3, X 3), fue (AUX 1172, VERB 107, PART 1, X 1), ha (AUX 766, NOUN 3, PROPN 1, X 1), son (AUX 509, VERB 54, NOUN 2, PROPN 1), ser (AUX 433, VERB 34, NOUN 26, X 1), eran (AUX 392, VERB 102), era (AUX 326, VERB 140, NOUN 12, PROPN 1), está (AUX 280, VERB 102, X 1), fueron (AUX 247, VERB 27), sido (AUX 241, VERB 13)

Morphology

The form / lemma ratio of AUX is 7.615385 (the average of all parts of speech is 1.279343).

The 1st highest number of forms (37) was observed with the lemma “poder”: Podeis, Podriaís, podamos, podemos, poder, podes, podido, podras, podre, podremos, podrá, podrán, podrás, podréis, podría, podríamos, podrían, podéis, podía, podíamos, podían, pude, pudiera, pudieran, pudieras, pudieron, pudiesen, pudimos, pudiéndo, pudo, pueda, puedan, puedas, puede, pueden, puedes, puedo.

The 2nd highest number of forms (30) was observed with the lemma “haber”: ha, habeis, haber, haberle, haberse, habiendo, habiéndo, habrá, habrán, habría, habrían, habéis, había, habíamos, habían, han, has, hay, haya, hayamos, hayan, hayas, he, hemos, hubiera, hubieran, hubiese, hubiesen, hubiéramos, hubo.

The 3rd highest number of forms (27) was observed with the lemma “estar”: esta, estaba, estabamos, estaban, estado, estamos, estan, estando, estar, estará, estarán, estaría, estarían, estarías, estas, este, estoy, estuve, estuviera, estuvieron, estuvimos, estuvo, está, estábamos, están, esté, estén.

AUX occurs with 6 features: VerbForm (11006; 100% instances), Number (10155; 92% instances), Tense (9837; 89% instances), Person (9790; 89% instances), Mood (9789; 89% instances), Gender (369; 3% instances)

AUX occurs with 18 feature-value pairs: Gender=Fem, Gender=Masc, Mood=Cnd, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 39 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (4442 tokens). Examples: es, ha, está, puede, debe, va, hay, tiene, esta, fuera

Relations

AUX nodes are attached to their parents using 15 different relations: cop (5534; 50% instances), aux (3588; 33% instances), aux:pass (1848; 17% instances), acl:relcl (9; 0% instances), conj (8; 0% instances), parataxis (4; 0% instances), root (4; 0% instances), advcl (3; 0% instances), ccomp (2; 0% instances), dep (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), nsubj:pass (1; 0% instances), obl (1; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: VERB (5137; 47% instances), NOUN (3397; 31% instances), ADJ (1902; 17% instances), PRON (344; 3% instances), PROPN (109; 1% instances), DET (39; 0% instances), X (20; 0% instances), NUM (19; 0% instances), ADV (13; 0% instances), SYM (12; 0% instances), ADP (4; 0% instances), (4; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), PUNCT (1; 0% instances)

10488 (95%) AUX nodes are leaves.

482 (4%) AUX nodes have one child.

12 (0%) AUX nodes have two children.

24 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 19 different relations: advmod (242; 40% instances), fixed (164; 27% instances), iobj (50; 8% instances), mark (43; 7% instances), obl (29; 5% instances), nsubj (19; 3% instances), punct (17; 3% instances), obj (13; 2% instances), advcl (9; 1% instances), conj (9; 1% instances), case (3; 0% instances), cc (3; 0% instances), det (3; 0% instances), xcomp (3; 0% instances), amod (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: ADV (243; 40% instances), CCONJ (110; 18% instances), ADP (95; 16% instances), PRON (66; 11% instances), NOUN (38; 6% instances), PUNCT (17; 3% instances), VERB (14; 2% instances), SCONJ (7; 1% instances), DET (4; 1% instances), PROPN (4; 1% instances), SYM (4; 1% instances), AUX (3; 0% instances), X (3; 0% instances), ADJ (2; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)