Treebank Statistics: UD_Czech-PDTC: POS Tags: VERB
There are 7436 VERB lemmas (8%), 32399 VERB types (17%) and 319942 VERB tokens (9%).
Out of 17 observed tags, the rank of VERB is: 5 in number of lemmas, 3 in number of types and 5 in number of tokens.
The 10 most frequent VERB lemmas: mít, moci, říci, muset, říkat, uvést, jít, chtít, stát, vědět
The 10 most frequent VERB types: má, řekl, říká, měl, měli, měla, může, mají, uvedla, uvedl
The 10 most frequent ambiguous lemmas: stát (NOUN 3056, VERB 3038), jet (VERB 1244, PROPN 6), vzrůst (VERB 960, NOUN 84), růst (NOUN 856, VERB 381), hnát (VERB 37, NOUN 1), drát (NOUN 49, VERB 6), pět (NUM 1420, VERB 3), nakupit (VERB 2, NOUN 1), obrat (NOUN 671, VERB 2), srůst (VERB 2, NOUN 1)
The 10 most frequent ambiguous types: má (VERB 4262, DET 49), stát (VERB 441, NOUN 428), vlastní (ADJ 855, VERB 380), myslí (VERB 197, NOUN 4), moci (NOUN 202, VERB 198), pomoci (NOUN 216, VERB 176), jet (VERB 122, PROPN 6), trvalo (VERB 82, NOUN 1), žil (VERB 81, NOUN 1), stálo (VERB 84, NOUN 5)
- má
- stát
- vlastní
- myslí
- moci
- pomoci
- jet
- trvalo
- žil
- stálo
Morphology
The form / lemma ratio of VERB is 4.357047 (the average of all parts of speech is 2.169184).
The 1st highest number of forms (43) was observed with the lemma “jít”: Poďme, jde, jdem, jdeme, jdete, jdeš, jdi, jdou, jdu, jděte, jít, nejde, nejdeme, nejdeš, nejdou, nejdu, nejít, nepůjde, nepůjdeme, nepůjdete, nepůjdeš, nepůjdou, nepůjdu, nešel, nešla, nešli, nešlo, nešly, pojď, pojďme, pojďte, půjde, půjdem, půjdeme, půjdete, půjdeš, půjdou, půjdu, šel, šla, šli, šlo, šly.
The 2nd highest number of forms (42) was observed with the lemma “stát”: Staňte, Stojíš, nestal, nestala, nestali, nestalo, nestaly, nestane, nestanou, nestojí, nestojím, nestojíme, nestojíte, nestál, nestála, nestáli, nestálo, nestály, nestát, stal, stala, stali, stalo, staly, stane, staneme, stanete, stanou, stanu, stoje, stojí, stojím, stojíme, stojíte, stál, stála, stáli, stálo, stály, stát, státi, stůj.
The 3rd highest number of forms (36) was observed with the lemma “mít”: Neměj, Nemějme, maje, maji, mají, majíce, mam, má, mám, máme, máte, máš, mít, míti, měj, mějme, mějte, měl, měla, měli, mělo, měly, nemaje, nemají, nemá, nemám, nemáme, nemáte, nemáš, nemít, nemějte, neměl, neměla, neměli, nemělo, neměly.
VERB occurs with 14 features: Aspect (319942; 100% instances), Polarity (319942; 100% instances), VerbForm (319942; 100% instances), Number (270743; 85% instances), Tense (268298; 84% instances), Voice (268298; 84% instances), Gender (144213; 45% instances), Person (126509; 40% instances), Mood (126508; 40% instances), Animacy (41972; 13% instances), Style (487; 0% instances), ExtPos (9; 0% instances), Abbr (5; 0% instances), Typo (4; 0% instances)
VERB occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Imp,Perf, Aspect=Perf, ExtPos=ADP, ExtPos=ADV, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Style=Expr, Style=Slng, Style=Vrnc, Style=Vulg, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act
VERB occurs with 174 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (50175 tokens).
Examples: má, říká, může, musí, jde, lze, znamená, očekává, chce, tvrdí
Relations
VERB nodes are attached to their parents using 20 different relations: root (141844; 44% instances), conj (45147; 14% instances), xcomp (29403; 9% instances), acl:relcl (28951; 9% instances), ccomp (25003; 8% instances), advcl (22805; 7% instances), acl (11390; 4% instances), csubj (8373; 3% instances), parataxis (2768; 1% instances), appos (1631; 1% instances), dep (1311; 0% instances), csubj:pass (932; 0% instances), advcl:pred (152; 0% instances), orphan (130; 0% instances), fixed (71; 0% instances), discourse (10; 0% instances), case (9; 0% instances), advmod (6; 0% instances), mark (4; 0% instances), compound (2; 0% instances)
Parents of VERB nodes belong to 17 different parts of speech: (141844; 44% instances), VERB (109891; 34% instances), NOUN (38713; 12% instances), ADJ (10822; 3% instances), DET (6985; 2% instances), ADV (4957; 2% instances), PROPN (2516; 1% instances), AUX (1659; 1% instances), PRON (861; 0% instances), PART (833; 0% instances), NUM (509; 0% instances), X (157; 0% instances), CCONJ (109; 0% instances), INTJ (49; 0% instances), SYM (23; 0% instances), ADP (12; 0% instances), SCONJ (2; 0% instances)
3024 (1%) VERB nodes are leaves.
21361 (7%) VERB nodes have one child.
36830 (12%) VERB nodes have two children.
258727 (81%) VERB nodes have three or more children.
The highest child degree of a VERB node is 19.
Children of VERB nodes are attached using 37 different relations: punct (275548; 23% instances), nsubj (149258; 12% instances), obl (129337; 11% instances), obj (125668; 10% instances), advmod (81299; 7% instances), obl:arg (65677; 5% instances), mark (51147; 4% instances), aux (47851; 4% instances), conj (46450; 4% instances), expl:pv (44402; 4% instances), cc (42334; 4% instances), ccomp (33939; 3% instances), xcomp (30470; 3% instances), advmod:emph (23870; 2% instances), advcl (19633; 2% instances), expl:pass (9782; 1% instances), advcl:pred (7049; 1% instances), nsubj:pass (5342; 0% instances), dep (4383; 0% instances), csubj (4004; 0% instances), iobj (2220; 0% instances), parataxis (2155; 0% instances), appos (1612; 0% instances), csubj:pass (928; 0% instances), discourse (704; 0% instances), vocative (333; 0% instances), nmod (98; 0% instances), orphan (44; 0% instances), case (18; 0% instances), fixed (11; 0% instances), cop (7; 0% instances), compound (5; 0% instances), det (4; 0% instances), amod (3; 0% instances), acl (2; 0% instances), reparandum (2; 0% instances), acl:relcl (1; 0% instances)
Children of VERB nodes belong to 17 different parts of speech: NOUN (348897; 29% instances), PUNCT (275548; 23% instances), VERB (109891; 9% instances), PRON (102819; 9% instances), ADV (94404; 8% instances), SCONJ (50491; 4% instances), AUX (49140; 4% instances), DET (48620; 4% instances), CCONJ (42556; 4% instances), PROPN (32593; 3% instances), ADJ (20346; 2% instances), PART (18912; 2% instances), NUM (9155; 1% instances), X (1497; 0% instances), SYM (405; 0% instances), ADP (162; 0% instances), INTJ (154; 0% instances)