home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kazakh-KTB: POS Tags: VERB

There are 421 VERB lemmas (16%), 1078 VERB types (23%) and 1587 VERB tokens (15%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: де, бол, ал, кел, біл, бер, көр, шық, тұр, тап

The 10 most frequent VERB types: деп, деді, алып, біледі, болды, бастап, келді, болған, өтті, береді

The 10 most frequent ambiguous lemmas: де (VERB 79, SCONJ 1, X 1), бол (AUX 123, VERB 51), ал (VERB 47, AUX 31, CCONJ 6, INTJ 2), кел (VERB 46, AUX 23), біл (VERB 40, AUX 1), бер (VERB 34, AUX 14), көр (VERB 26, AUX 3, NOUN 1), шық (VERB 26, AUX 5), тұр (VERB 24, AUX 20), баста (VERB 20, AUX 15, ADP 1)

The 10 most frequent ambiguous types: деп (VERB 42, X 1), алып (VERB 12, ADJ 1, AUX 1, X 1), болды (AUX 28, VERB 11), бастап (VERB 10, ADP 1), келді (VERB 10, AUX 4), болған (VERB 9, AUX 7), береді (VERB 8, AUX 4), шықты (VERB 8, AUX 2), басып (VERB 7, X 1), алды (AUX 6, VERB 6)

Morphology

The form / lemma ratio of VERB is 2.560570 (the average of all parts of speech is 1.743774).

The 1st highest number of forms (23) was observed with the lemma “біл”: біл, білген, білгенге, білгенді, білгендік, білді, біледі, білейік, білесіз, білесің, білетін, білмедім, білмейді, білмеймін, білмек, білсе, білсең, білсін, білуге, білінбей, білінбейді, білінген, біліп.

The 2nd highest number of forms (19) was observed with the lemma “ал”: ала, алады, алам, алды, алдым, алдық, алмаса, алмастан, алсам екен, алу, алуды, алуы, алуға, алынады, алынды, алып, алыңыз, алған, алғанын.

The 3rd highest number of forms (18) was observed with the lemma “кел”: келген, келгенде, келгендей, келгендеріңіз, келгенді, келгенін, келді, келдік, келе, келеді, келмей, келмейді, келмеуі, келуге, келуі, келіп, келіпті, келісті.

VERB occurs with 13 features: VerbForm (1587; 100% instances), Tense (866; 55% instances), Mood (767; 48% instances), Number (758; 48% instances), Person (754; 48% instances), Aspect (704; 44% instances), Case (270; 17% instances), Voice (266; 17% instances), Polarity (91; 6% instances), Number[psor] (70; 4% instances), Person[psor] (70; 4% instances), Evident (20; 1% instances), Polite (16; 1% instances)

VERB occurs with 40 feature-value pairs: Aspect=Hab, Aspect=Imp, Aspect=Perf, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Evident=Fh, Mood=Cnd, Mood=Des, Mood=Imp, Mood=Ind, Mood=Opt, Mood=Pot, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Plur,Sing, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polite=Form, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, Voice=Pass, Voice=Rcp

VERB occurs with 162 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin (215 tokens). Examples: деді, болды, өтті, шықты, келді, алды, кетті, құрды, бастады, енді

Relations

VERB nodes are attached to their parents using 16 different relations: root (740; 47% instances), advcl (336; 21% instances), conj (107; 7% instances), acl (95; 6% instances), ccomp (86; 5% instances), acl:relcl (84; 5% instances), csubj (69; 4% instances), parataxis (50; 3% instances), aux (6; 0% instances), xcomp (5; 0% instances), appos (2; 0% instances), obl (2; 0% instances), orphan (2; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances)

Parents of VERB nodes belong to 7 different parts of speech: (740; 47% instances), VERB (529; 33% instances), NOUN (218; 14% instances), ADJ (81; 5% instances), PROPN (9; 1% instances), PRON (6; 0% instances), NUM (4; 0% instances)

89 (6%) VERB nodes are leaves.

292 (18%) VERB nodes have one child.

312 (20%) VERB nodes have two children.

894 (56%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 29 different relations: punct (1140; 26% instances), nsubj (734; 16% instances), obl (665; 15% instances), obj (514; 12% instances), advcl (315; 7% instances), aux (216; 5% instances), advmod (205; 5% instances), nmod (133; 3% instances), ccomp (117; 3% instances), conj (98; 2% instances), dep (77; 2% instances), cc (61; 1% instances), parataxis (46; 1% instances), discourse (28; 1% instances), csubj (24; 1% instances), case (22; 0% instances), xcomp (17; 0% instances), cop (16; 0% instances), compound:lvc (7; 0% instances), nmod:poss (6; 0% instances), vocative (6; 0% instances), iobj (4; 0% instances), amod (3; 0% instances), acl:relcl (2; 0% instances), mark (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), det (1; 0% instances), obl:own (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (1639; 37% instances), PUNCT (1140; 26% instances), VERB (529; 12% instances), PRON (287; 6% instances), AUX (227; 5% instances), PROPN (168; 4% instances), ADV (154; 3% instances), ADJ (105; 2% instances), X (77; 2% instances), CCONJ (50; 1% instances), ADP (22; 0% instances), NUM (20; 0% instances), PART (19; 0% instances), SCONJ (15; 0% instances), INTJ (9; 0% instances), DET (1; 0% instances)