home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: VERB

There are 4147 VERB lemmas (19%), 15927 VERB types (32%) and 40904 VERB tokens (15%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: būt, varēt, zināt, notikt, sākt, redzēt, teikt, izmantot, domāt, sacīt

The 10 most frequent VERB types: ir, var, varētu, nav, bija, nevar, iespējams, būs, teica, notiek

The 10 most frequent ambiguous lemmas: būt (AUX 6107, VERB 1759, PART 9), tikt (AUX 933, VERB 87), ievērot (VERB 81, ADV 1), nozīmēt (VERB 61, NOUN 1), tapt (VERB 56, AUX 2), izteikt (VERB 49, ADV 2), beigt (VERB 44, ADV 1), pamatot (VERB 33, ADV 1), klāt (ADV 46, VERB 10), rīt (ADV 9, VERB 4)

The 10 most frequent ambiguous types: ir (AUX 2977, VERB 666), var (VERB 529, PART 1), nav (AUX 632, VERB 219), bija (AUX 874, VERB 223), iespējams (VERB 97, ADJ 15), būs (AUX 220, VERB 89), stāsta (VERB 81, NOUN 5), uzskata (VERB 75, NOUN 2), nebija (AUX 91, VERB 59), nepieciešams (VERB 59, ADJ 3, ADV 2)

Morphology

The form / lemma ratio of VERB is 3.840608 (the average of all parts of speech is 2.233228).

The 1st highest number of forms (60) was observed with the lemma “būt”: Tr, bij, bija, bijis, biju, bijusi, bijušais, bijušajai, bijušajiem, bijušajā, bijušajām, bijušajās, bijušas, bijuši, bijušā, bijušās, bijām, būdams, būs, būsi, būt, būtu, būšot, būšu, esam, esat, esi, esmu, esot, esošais, esošajam, esošajiem, esošajos, esošajā, esošajām, esošajās, esošas, esošie, esošiem, esošo, esošos, esošu, esošus, esošā, esošās, esu, ir, jābūt, nav, nebija, nebijis, nebijušu, nebūdams, nebūs, nebūt, nebūtu, neesat, neesot, neesošajai, neesošām.

The 2nd highest number of forms (55) was observed with the lemma “redzēt”: Neredzamā, Redzēdama, jāredz, neredz, neredzam, neredzama, neredzamas, neredzamos, neredzams, neredzamu, neredzi, neredzot, neredzu, neredzēja, neredzēju, neredzējām, neredzēt, neredzētu, neredzēšu, redz, redzam, redzama, redzamajai, redzamajiem, redzamam, redzamas, redzami, redzamie, redzamo, redzamos, redzams, redzamākajiem, redzamām, redzamās, redzat, redzi, redziet, redzot, redzu, redzēja, redzējis, redzēju, redzējusi, redzējuši, redzējāt, redzēs, redzēsiet, redzēsim, redzēt, redzēta, redzēti, redzēto, redzēts, redzētu, redzētā.

The 3rd highest number of forms (48) was observed with the lemma “zināt”: Nezinādami, Nezināmais, Zināmākie, jāzina, kāzina, nez’, nezina, nezini, nezinot, nezinu, nezināja, nezināju, nezinām, nezināmas, nezināms, nezināmā, nezināmās, zin’, zina, zini, ziniet, zinot, zinošāks, zinu, zinādami, zināja, zināji, zinājis, zināju, zinājuši, zinājām, zinām, zināma, zināmais, zināmajai, zināmas, zināmi, zināmiem, zināmo, zināms, zināmu, zināmus, zināmā, zināmākajiem, zināmās, zinās, zināt, zinātu.

VERB occurs with 16 features: Polarity (40901; 100% instances), VerbForm (40901; 100% instances), Voice (32934; 81% instances), Tense (30365; 74% instances), Mood (22486; 55% instances), Person (20265; 50% instances), Evident (19912; 49% instances), Number (14236; 35% instances), Gender (10531; 26% instances), Case (10530; 26% instances), Definite (10405; 25% instances), Degree (10377; 25% instances), Aspect (8809; 22% instances), Reflex (6403; 16% instances), Typo (84; 0% instances), Abbr (3; 0% instances)

VERB occurs with 41 feature-value pairs: Abbr=Yes, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Case=Voc, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Evident=Fh, Evident=Nfh, Gender=Fem, Gender=Masc, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Nec, Mood=Qot, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Reflex=Yes, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 470 feature combinations. The most frequent feature combination is Evident=Fh|Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (7002 tokens). Examples: ir, var, notiek, stāsta, saka, uzskata, liecina, šķiet, norāda, strādā

Relations

VERB nodes are attached to their parents using 21 different relations: root (13177; 32% instances), conj (7531; 18% instances), advcl (4117; 10% instances), acl (3751; 9% instances), ccomp (3561; 9% instances), xcomp (3288; 8% instances), amod (3266; 8% instances), parataxis (722; 2% instances), csubj (711; 2% instances), dep (167; 0% instances), discourse (149; 0% instances), csubj:pass (98; 0% instances), obj (80; 0% instances), nsubj (79; 0% instances), obl (74; 0% instances), iobj (66; 0% instances), orphan (29; 0% instances), nmod (24; 0% instances), nsubj:pass (10; 0% instances), reparandum (3; 0% instances), compound (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: VERB (18157; 44% instances), (13177; 32% instances), NOUN (7204; 18% instances), ADV (703; 2% instances), ADJ (673; 2% instances), PRON (579; 1% instances), PROPN (240; 1% instances), DET (53; 0% instances), NUM (46; 0% instances), X (26; 0% instances), AUX (19; 0% instances), SYM (17; 0% instances), PART (6; 0% instances), INTJ (2; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)

2907 (7%) VERB nodes are leaves.

5193 (13%) VERB nodes have one child.

5536 (14%) VERB nodes have two children.

27268 (67%) VERB nodes have three or more children.

The highest child degree of a VERB node is 16.

Children of VERB nodes are attached using 32 different relations: punct (31668; 24% instances), nsubj (17795; 14% instances), obl (15945; 12% instances), obj (11743; 9% instances), advmod (9740; 7% instances), conj (7465; 6% instances), iobj (5550; 4% instances), cc (5357; 4% instances), mark (4339; 3% instances), xcomp (4030; 3% instances), ccomp (3701; 3% instances), advcl (3631; 3% instances), discourse (2370; 2% instances), nsubj:pass (2007; 2% instances), aux (1693; 1% instances), aux:pass (1591; 1% instances), parataxis (908; 1% instances), cop (542; 0% instances), csubj (346; 0% instances), dep (232; 0% instances), case (123; 0% instances), nmod (121; 0% instances), csubj:pass (109; 0% instances), nummod (97; 0% instances), det (77; 0% instances), vocative (66; 0% instances), acl (57; 0% instances), amod (22; 0% instances), orphan (5; 0% instances), reparandum (2; 0% instances), compound (1; 0% instances), flat:name (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (37857; 29% instances), PUNCT (31668; 24% instances), VERB (18157; 14% instances), PRON (11296; 9% instances), ADV (10269; 8% instances), CCONJ (5340; 4% instances), PROPN (4348; 3% instances), SCONJ (4335; 3% instances), AUX (3845; 3% instances), PART (1789; 1% instances), ADJ (1402; 1% instances), NUM (404; 0% instances), X (195; 0% instances), SYM (185; 0% instances), ADP (98; 0% instances), INTJ (77; 0% instances), DET (69; 0% instances)