home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-Valico: POS Tags: VERB

There are 255 VERB lemmas (25%), 490 VERB types (34%) and 968 VERB tokens (14%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: fare, dire, leggere, vedere, avere, pensare, andare, portare, gridare, essere

The 10 most frequent VERB types: detto, visto, fatto, era, portava, aveva, leggendo, pensato, sentito, seduto

The 10 most frequent ambiguous lemmas: avere (AUX 298, VERB 33), essere (AUX 230, VERB 24), stare (AUX 24, VERB 9), volere (AUX 16, VERB 4)

The 10 most frequent ambiguous types: fatto (VERB 21, NOUN 4), era (AUX 69, VERB 12), aveva (AUX 21, VERB 13), letto (VERB 6, NOUN 1), stava (AUX 11, VERB 4), è (AUX 53, VERB 4), avevo (AUX 7, VERB 1), fa (VERB 3, ADV 1), ha (AUX 143, VERB 3), ho (AUX 62, VERB 3)

Morphology

The form / lemma ratio of VERB is 1.921569 (the average of all parts of speech is 1.391304).

The 1st highest number of forms (12) was observed with the lemma “fare”: fa, facendo, faceva, facevo, faciendo, fai, fanno, far, fare, farebbe, farò, fatto.

The 2nd highest number of forms (9) was observed with the lemma “andare”: andare, andata, andati, andato, andava, andò, va, vada, vai.

The 3rd highest number of forms (9) was observed with the lemma “avere”: Avrai, avere, aveste, aveva, avevano, avevo, avuto, ha, ho.

VERB occurs with 6 features: VerbForm (964; 100% instances), Number (744; 77% instances), Tense (744; 77% instances), Gender (384; 40% instances), Mood (360; 37% instances), Person (359; 37% instances)

VERB occurs with 19 feature-value pairs: Gender=Fem, Gender=Masc, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

VERB occurs with 28 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part (349 tokens). Examples: detto, visto, fatto, pensato, sentito, seduto, cominciato, andato, gridato, salvato

Relations

VERB nodes are attached to their parents using 10 different relations: root (324; 33% instances), conj (225; 23% instances), advcl (103; 11% instances), xcomp (82; 8% instances), acl:relcl (81; 8% instances), parataxis (60; 6% instances), ccomp (53; 5% instances), acl (27; 3% instances), csubj (12; 1% instances), discourse (1; 0% instances)

Parents of VERB nodes belong to 8 different parts of speech: VERB (470; 49% instances), (324; 33% instances), NOUN (110; 11% instances), ADJ (47; 5% instances), PRON (13; 1% instances), PROPN (2; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances)

10 (1%) VERB nodes are leaves.

69 (7%) VERB nodes have one child.

190 (20%) VERB nodes have two children.

699 (72%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 27 different relations: punct (564; 16% instances), aux (440; 12% instances), obj (437; 12% instances), obl (363; 10% instances), nsubj (329; 9% instances), advmod (241; 7% instances), conj (219; 6% instances), mark (206; 6% instances), cc (199; 6% instances), xcomp (113; 3% instances), expl (98; 3% instances), advcl (96; 3% instances), parataxis (78; 2% instances), ccomp (70; 2% instances), iobj (61; 2% instances), dislocated (8; 0% instances), expl:impers (7; 0% instances), discourse (6; 0% instances), csubj (4; 0% instances), vocative (4; 0% instances), aux:pass (2; 0% instances), cop (2; 0% instances), dep (2; 0% instances), obl:agent (2; 0% instances), acl:relcl (1; 0% instances), nsubj:pass (1; 0% instances), orphan (1; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (798; 22% instances), PUNCT (564; 16% instances), PRON (475; 13% instances), VERB (470; 13% instances), AUX (444; 12% instances), ADV (247; 7% instances), CCONJ (200; 6% instances), SCONJ (118; 3% instances), ADP (86; 2% instances), PROPN (77; 2% instances), ADJ (63; 2% instances), INTJ (5; 0% instances), NUM (4; 0% instances), DET (3; 0% instances)