home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: VERB

There are 277 VERB lemmas (10%), 514 VERB types (10%) and 1841 VERB tokens (5%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent VERB lemmas: obsahovat, použít, moci, mít, účtovat, uvést, rozumět, sestavovat, vést, stanovit

The 10 most frequent VERB types: obsahuje, rozumí, může, uvede, mohou, použijí, stanoví, vést, musí, účtuje

The 10 most frequent ambiguous lemmas: stát (NOUN 40, VERB 7)

The 10 most frequent ambiguous types: delší (ADJ 15, VERB 4), koupí (NOUN 2, VERB 2), daní (NOUN 2, VERB 1), vlastní (ADJ 18, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.855596 (the average of all parts of speech is 1.723629).

The 1st highest number of forms (8) was observed with the lemma “účtovat”: neúčtovala, neúčtovat, neúčtuje, neúčtují, účtovala, účtovat, účtuje, účtují.

The 2nd highest number of forms (7) was observed with the lemma “moci”: mohl, mohla, mohlo, mohou, může, nemohou, nemůže.

The 3rd highest number of forms (7) was observed with the lemma “mít”: mají, má, mít, měla, nemají, nemá, neměla.

VERB occurs with 11 features: Polarity (1841; 100% instances), VerbForm (1841; 100% instances), Number (1527; 83% instances), Tense (1527; 83% instances), Voice (1527; 83% instances), Mood (1411; 77% instances), Person (1411; 77% instances), Gender (116; 6% instances), Animacy (37; 2% instances), Aspect (1; 0% instances), Style (1; 0% instances)

VERB occurs with 22 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=3, Polarity=Neg, Polarity=Pos, Style=Arch, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

VERB occurs with 17 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (819 tokens). Examples: obsahuje, rozumí, může, uvede, stanoví, účtuje, lze, musí, má, použije

Relations

VERB nodes are attached to their parents using 13 different relations: root (727; 39% instances), acl:relcl (356; 19% instances), conj (252; 14% instances), xcomp (200; 11% instances), advcl (99; 5% instances), acl (68; 4% instances), csubj (55; 3% instances), parataxis (53; 3% instances), dep (15; 1% instances), ccomp (13; 1% instances), appos (1; 0% instances), csubj:pass (1; 0% instances), orphan (1; 0% instances)

Parents of VERB nodes belong to 8 different parts of speech: (727; 39% instances), NOUN (475; 26% instances), VERB (475; 26% instances), ADJ (136; 7% instances), DET (15; 1% instances), X (10; 1% instances), AUX (2; 0% instances), ADV (1; 0% instances)

3 (0%) VERB nodes are leaves.

142 (8%) VERB nodes have one child.

198 (11%) VERB nodes have two children.

1498 (81%) VERB nodes have three or more children.

The highest child degree of a VERB node is 15.

Children of VERB nodes are attached using 27 different relations: punct (1915; 26% instances), obl (1137; 16% instances), nsubj (871; 12% instances), obj (838; 11% instances), obl:arg (404; 6% instances), expl:pass (355; 5% instances), nsubj:pass (312; 4% instances), conj (261; 4% instances), advmod (239; 3% instances), cc (177; 2% instances), advcl (147; 2% instances), mark (145; 2% instances), xcomp (136; 2% instances), nmod (102; 1% instances), expl:pv (80; 1% instances), ccomp (55; 1% instances), csubj (37; 1% instances), parataxis (33; 0% instances), aux (24; 0% instances), dep (23; 0% instances), advmod:emph (3; 0% instances), csubj:pass (3; 0% instances), iobj (3; 0% instances), appos (2; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), det (1; 0% instances)

Children of VERB nodes belong to 12 different parts of speech: NOUN (3068; 42% instances), PUNCT (1915; 26% instances), PRON (528; 7% instances), VERB (475; 7% instances), DET (343; 5% instances), ADV (273; 4% instances), CCONJ (173; 2% instances), ADJ (159; 2% instances), X (159; 2% instances), SCONJ (145; 2% instances), NUM (36; 0% instances), AUX (31; 0% instances)