home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: VERB

There are 3436 VERB lemmas (9%), 9479 VERB types (19%) and 36280 VERB tokens (8%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent VERB lemmas: tener, ser, hacer, encontrar, dar, estar, haber, decir, llegar, ir

The 10 most frequent VERB types: tiene, es, encuentra, hacer, hay, hace, tenía, tienen, era, ubicado

The 10 most frequent ambiguous lemmas: tener (VERB 1420, PROPN 1), ser (AUX 6901, VERB 963, NOUN 49, PROPN 25, X 7, CCONJ 1, DET 1, PART 1), hacer (VERB 822, NOUN 6, ADP 2), dar (VERB 465, ADP 9, PROPN 6, X 2), estar (AUX 857, VERB 422, X 2, PROPN 1), haber (AUX 1860, VERB 372, PROPN 7, NOUN 5, X 2, ADV 1), decir (VERB 358, PROPN 1), ir (VERB 312, PROPN 22, PART 2, DET 1), conocer (VERB 303, PROPN 1), ver (VERB 298, NOUN 9, PROPN 4, X 1)

The 10 most frequent ambiguous types: es (AUX 2550, VERB 297, PROPN 3, X 3), hacer (VERB 200, NOUN 4), hay (VERB 155, AUX 35, ADV 1, X 1), tenía (VERB 176, PROPN 1), era (AUX 326, VERB 140, NOUN 12, PROPN 1), ubicado (VERB 134, ADJ 15), fue (AUX 1172, VERB 107, PART 1, X 1), ver (VERB 126, NOUN 1, X 1), debido (VERB 97, AUX 1), conocido (VERB 113, ADJ 10, NOUN 4)

Morphology

The form / lemma ratio of VERB is 2.758731 (the average of all parts of speech is 1.278515).

The 1st highest number of forms (32) was observed with the lemma “tener”: Tendrian, tendremos, tendrá, tendrán, tendría, tendrían, tenemos, tener, tenga, tengamos, tengan, tengo, tengáis, tenia, tenidas, tenido, tenidos, teniendo, tenéis, tenía, teníamos, tenían, tiene, tienen, tienes, tuve, tuviera, tuvieron, tuviese, tuviesen, tuvimos, tuvo.

The 2nd highest number of forms (28) was observed with the lemma “dar”: Daban, da, daba, dabamos, dada, dadas, dado, dados, dan, dando, dar, dara, daran, daremos, dará, darán, daría, darían, den, di, diera, dieran, dieron, dimos, dio, doy, dándo, dé.

The 3rd highest number of forms (26) was observed with the lemma “hacer”: hace, hacemos, hacen, hacer, haciendo, hacía, hacían, haga, hagamos, hagan, hago, haremos, hará, harán, haría, harían, hecha, hechas, hecho, hechos, hice, hiciera, hicieran, hicieron, hicimos, hizo.

VERB occurs with 6 features: VerbForm (36280; 100% instances), Number (27878; 77% instances), Tense (22998; 63% instances), Person (20360; 56% instances), Mood (20358; 56% instances), Gender (7462; 21% instances)

VERB occurs with 19 feature-value pairs: Gender=Fem, Gender=Masc, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

VERB occurs with 57 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (6693 tokens). Examples: tiene, es, encuentra, hay, hace, está, cuenta, da, dice, quiere

Relations

VERB nodes are attached to their parents using 29 different relations: root (11852; 33% instances), advcl (6631; 18% instances), acl:relcl (4876; 13% instances), conj (4363; 12% instances), acl (2678; 7% instances), xcomp (1773; 5% instances), ccomp (1242; 3% instances), parataxis (1150; 3% instances), cop (719; 2% instances), csubj (617; 2% instances), fixed (85; 0% instances), mark (62; 0% instances), cc (55; 0% instances), appos (37; 0% instances), case (34; 0% instances), dep (22; 0% instances), flat (13; 0% instances), nmod (11; 0% instances), aux (10; 0% instances), obl (10; 0% instances), amod (8; 0% instances), nsubj (8; 0% instances), csubj:pass (6; 0% instances), obj (6; 0% instances), advmod (5; 0% instances), det (3; 0% instances), nsubj:pass (2; 0% instances), compound (1; 0% instances), iobj (1; 0% instances)

Parents of VERB nodes belong to 17 different parts of speech: VERB (12707; 35% instances), (11852; 33% instances), NOUN (8358; 23% instances), ADJ (1483; 4% instances), PROPN (874; 2% instances), PRON (622; 2% instances), ADV (86; 0% instances), DET (67; 0% instances), X (61; 0% instances), ADP (55; 0% instances), NUM (54; 0% instances), SYM (25; 0% instances), AUX (14; 0% instances), CCONJ (13; 0% instances), SCONJ (7; 0% instances), PART (1; 0% instances), PUNCT (1; 0% instances)

966 (3%) VERB nodes are leaves.

3842 (11%) VERB nodes have one child.

6789 (19%) VERB nodes have two children.

24683 (68%) VERB nodes have three or more children.

The highest child degree of a VERB node is 18.

Children of VERB nodes are attached using 30 different relations: obl (24864; 20% instances), punct (20653; 17% instances), obj (13449; 11% instances), nsubj (11615; 10% instances), mark (10465; 9% instances), iobj (7372; 6% instances), advmod (6363; 5% instances), advcl (5366; 4% instances), conj (4421; 4% instances), cc (4158; 3% instances), aux (3144; 3% instances), xcomp (2239; 2% instances), aux:pass (1744; 1% instances), ccomp (1522; 1% instances), parataxis (1140; 1% instances), nsubj:pass (1133; 1% instances), case (661; 1% instances), csubj (324; 0% instances), fixed (296; 0% instances), amod (259; 0% instances), det (227; 0% instances), dep (132; 0% instances), appos (64; 0% instances), cop (62; 0% instances), nummod (25; 0% instances), nmod (8; 0% instances), csubj:pass (7; 0% instances), compound (4; 0% instances), flat (2; 0% instances), acl:relcl (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (38524; 32% instances), PUNCT (20642; 17% instances), VERB (12707; 10% instances), PRON (11351; 9% instances), PROPN (7476; 6% instances), SCONJ (6862; 6% instances), ADV (6721; 6% instances), AUX (4912; 4% instances), CCONJ (4290; 4% instances), ADP (3939; 3% instances), NUM (1874; 2% instances), ADJ (1308; 1% instances), SYM (432; 0% instances), X (346; 0% instances), DET (303; 0% instances), INTJ (25; 0% instances), PART (8; 0% instances)