home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-Cairo: POS Tags: VERB

There are 28 VERB lemmas (26%), 31 VERB types (26%) and 32 VERB tokens (19%). Out of 13 observed tags, the rank of VERB is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: varēt, tikt, uzrakstīt, apgriezt, apskauties, atmest, atnākt, atstāt, atvērt, būt

The 10 most frequent VERB types: uzrakstīja, Nevarēja, apgriezt, apskāvās, atmest, atnākt, atstāja, atver, centās, domā

The 10 most frequent ambiguous lemmas: būt (AUX 3, VERB 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of VERB is 1.107143 (the average of all parts of speech is 1.102804).

The 1st highest number of forms (3) was observed with the lemma “varēt”: Nevarēja, nevarēju, varēsi.

The 2nd highest number of forms (2) was observed with the lemma “tikt”: tika, tikt.

The 3rd highest number of forms (1) was observed with the lemma “apgriezt”: apgriezt.

VERB occurs with 14 features: Polarity (32; 100% instances), VerbForm (32; 100% instances), Voice (25; 78% instances), Mood (24; 75% instances), Person (24; 75% instances), Tense (24; 75% instances), Evident (23; 72% instances), Number (7; 22% instances), Reflex (4; 13% instances), Aspect (1; 3% instances), Case (1; 3% instances), Definite (1; 3% instances), Degree (1; 3% instances), Gender (1; 3% instances)

VERB occurs with 23 feature-value pairs: Aspect=Perf, Case=Nom, Definite=Ind, Degree=Pos, Evident=Fh, Gender=Fem, Mood=Imp, Mood=Ind, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Reflex=Yes, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 13 feature combinations. The most frequent feature combination is Evident=Fh|Mood=Ind|Person=3|Polarity=Pos|Tense=Past|VerbForm=Fin|Voice=Act (11 tokens). Examples: uzrakstīja, atstāja, ieguva, lika, nokrāsoja, nopirka, skrēja, tika, uzauga, vajadzēja

Relations

VERB nodes are attached to their parents using 7 different relations: root (17; 53% instances), xcomp (7; 22% instances), conj (3; 9% instances), ccomp (2; 6% instances), acl (1; 3% instances), advcl (1; 3% instances), csubj (1; 3% instances)

Parents of VERB nodes belong to 4 different parts of speech: (17; 53% instances), VERB (13; 41% instances), NOUN (1; 3% instances), PROPN (1; 3% instances)

3 (9%) VERB nodes are leaves.

5 (16%) VERB nodes have one child.

1 (3%) VERB nodes have two children.

23 (72%) VERB nodes have three or more children.

The highest child degree of a VERB node is 6.

Children of VERB nodes are attached using 15 different relations: punct (22; 24% instances), nsubj (18; 20% instances), obj (12; 13% instances), obl (8; 9% instances), xcomp (7; 8% instances), advmod (6; 7% instances), conj (5; 5% instances), cc (3; 3% instances), ccomp (2; 2% instances), iobj (2; 2% instances), mark (2; 2% instances), advcl (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), vocative (1; 1% instances)

Children of VERB nodes belong to 9 different parts of speech: PUNCT (22; 24% instances), NOUN (19; 21% instances), PRON (19; 21% instances), VERB (13; 14% instances), ADV (6; 7% instances), PROPN (6; 7% instances), CCONJ (3; 3% instances), SCONJ (2; 2% instances), PART (1; 1% instances)