home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-AnCora: POS Tags: AUX

There are 7 AUX lemmas (0%), 152 AUX types (0%) and 13565 AUX tokens (2%). Out of 17 observed tags, the rank of AUX is: 15 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent AUX lemmas: ser, haber, estar, poder, deber, saber, querer

The 10 most frequent AUX types: es, ha, han, fue, ser, son, está, sido, puede, había

The 10 most frequent ambiguous lemmas: ser (AUX 6222, NOUN 37), haber (AUX 4181, VERB 693, NOUN 4), estar (AUX 1330, VERB 321, NOUN 1), poder (AUX 1283, NOUN 126, VERB 40, PROPN 1), deber (AUX 512, VERB 37, NOUN 11), saber (VERB 281, AUX 21, NOUN 8), querer (VERB 380, AUX 16, NOUN 1)

The 10 most frequent ambiguous types: es (AUX 2313, NOUN 47, PROPN 3, CCONJ 1), ha (AUX 2153, VERB 2), fue (AUX 713, VERB 17), ser (AUX 540, NOUN 23), son (AUX 485, NOUN 4), está (AUX 469, VERB 97, NOUN 1), puede (AUX 369, VERB 12, NOUN 1), había (AUX 354, VERB 65), era (AUX 300, NOUN 13, PROPN 1), haber (AUX 232, VERB 18, NOUN 4)

Morphology

The form / lemma ratio of AUX is 21.714286 (the average of all parts of speech is 1.505634).

The 1st highest number of forms (32) was observed with the lemma “estar”: Estabas, estaba, estabais, estaban, estado, estamos, estando, estar, estaremos, estará, estarán, estaré, estaría, estaríamos, estarían, estemos, estoy, estuve, estuviera, estuvieran, estuvieron, estuviese, estuviesen, estuvimos, estuvo, está, estábamos, estáis, están, estás, esté, estén.

The 2nd highest number of forms (32) was observed with the lemma “ser”: Eramos, Serías, Sé, era, eran, eras, eres, es, fue, fuera, fueran, fuere, fueron, fuese, fuesen, fui, sea, seamos, sean, seas, ser, será, serán, seré, sería, serían, sido, siendo, somos, son, soy, éramos.

The 3rd highest number of forms (30) was observed with the lemma “haber”: ha, haber, habiendo, habremos, habrá, habrán, habrás, habré, habría, habríamos, habrían, habéis, había, habíamos, habían, habías, han, has, haya, hayamos, hayan, he, hemos, hubiera, hubieran, hubieron, hubiese, hubiesen, hubiéramos, hubo.

AUX occurs with 7 features: VerbForm (13565; 100% instances), Number (12499; 92% instances), Tense (12059; 89% instances), Mood (12018; 89% instances), Person (12018; 89% instances), Gender (481; 4% instances), Typo (1; 0% instances)

AUX occurs with 19 feature-value pairs: Gender=Masc, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 44 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (5762 tokens). Examples: es, ha, está, puede, debe, quiere

Relations

AUX nodes are attached to their parents using 15 different relations: aux (7260; 54% instances), cop (5547; 41% instances), root (354; 3% instances), ccomp (115; 1% instances), advcl (90; 1% instances), conj (75; 1% instances), acl (66; 0% instances), xcomp (16; 0% instances), aux:pass (15; 0% instances), parataxis (13; 0% instances), csubj (10; 0% instances), acl:relcl (1; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), dep (1; 0% instances)

Parents of AUX nodes belong to 17 different parts of speech: VERB (7123; 53% instances), NOUN (2665; 20% instances), ADJ (2641; 19% instances), (354; 3% instances), PRON (322; 2% instances), PROPN (143; 1% instances), ADV (132; 1% instances), NUM (48; 0% instances), DET (46; 0% instances), AUX (38; 0% instances), ADP (30; 0% instances), SYM (12; 0% instances), CCONJ (6; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), X (1; 0% instances)

12815 (94%) AUX nodes are leaves.

37 (0%) AUX nodes have one child.

68 (1%) AUX nodes have two children.

645 (5%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 22 different relations: punct (762; 28% instances), ccomp (567; 21% instances), nsubj (477; 17% instances), mark (177; 6% instances), advmod (162; 6% instances), obl (112; 4% instances), csubj (110; 4% instances), cc (105; 4% instances), advcl (102; 4% instances), conj (77; 3% instances), det (19; 1% instances), obj (18; 1% instances), aux (15; 1% instances), xcomp (12; 0% instances), parataxis (9; 0% instances), amod (7; 0% instances), obl:arg (4; 0% instances), case (3; 0% instances), dep (2; 0% instances), expl:pv (2; 0% instances), fixed (1; 0% instances), obl:agent (1; 0% instances)

Children of AUX nodes belong to 15 different parts of speech: PUNCT (762; 28% instances), VERB (762; 28% instances), NOUN (439; 16% instances), SCONJ (152; 6% instances), PRON (136; 5% instances), ADV (130; 5% instances), CCONJ (114; 4% instances), ADJ (89; 3% instances), PROPN (52; 2% instances), ADP (47; 2% instances), AUX (38; 1% instances), DET (15; 1% instances), NUM (6; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)