home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kazakh-KTB: POS Tags: VERB

There are 422 VERB lemmas (16%), 1089 VERB types (24%) and 1643 VERB tokens (16%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: де, бол, ал, кел, біл, баста, кет, бер, шық, көр

The 10 most frequent VERB types: деп, бастады, кетті, деді, алып, біледі, болды, бастап, келді, шықты

The 10 most frequent ambiguous lemmas: де (VERB 79, SCONJ 1, X 1), бол (AUX 125, VERB 50), ал (VERB 47, AUX 31, CCONJ 6, INTJ 2), кел (VERB 46, AUX 23), баста (VERB 35, ADP 1), бер (VERB 33, AUX 15), көр (VERB 29, NOUN 1), тұр (VERB 26, AUX 18), бар (ADJ 37, VERB 20), айт (VERB 19, NOUN 1)

The 10 most frequent ambiguous types: деп (VERB 42, X 1), алып (VERB 12, ADJ 1, AUX 1, X 1), болды (AUX 28, VERB 11), бастап (VERB 10, ADP 1), келді (VERB 10, AUX 4), береді (VERB 8, AUX 4), болған (AUX 8, VERB 8), басып (VERB 7, X 1), тұрады (VERB 7, AUX 2), алды (AUX 6, VERB 6)

Morphology

The form / lemma ratio of VERB is 2.580569 (the average of all parts of speech is 1.747153).

The 1st highest number of forms (24) was observed with the lemma “біл”: біл, білген, білгенге, білгенді, білгендік, білді, біледі, білейік, білесіз, білесің, білетін, білмедім, білмейді, білмеймін, білмек, білсе, білсең, білсін, білу, білуге, білінбей, білінбейді, білінген, біліп.

The 2nd highest number of forms (19) was observed with the lemma “ал”: ала, алады, алам, алды, алдым, алдық, алмаса, алмастан, алсам екен, алу, алуды, алуы, алуға, алынады, алынды, алып, алыңыз, алған, алғанын.

The 3rd highest number of forms (18) was observed with the lemma “кел”: келген, келгенде, келгендей, келгендеріңіз, келгенді, келгенін, келді, келдік, келе, келеді, келмей, келмейді, келмеуі, келуге, келуі, келіп, келіпті, келісті.

VERB occurs with 13 features: VerbForm (1643; 100% instances), Tense (913; 56% instances), Mood (816; 50% instances), Number (807; 49% instances), Person (803; 49% instances), Aspect (714; 43% instances), Case (275; 17% instances), Voice (266; 16% instances), Polarity (94; 6% instances), Number[psor] (72; 4% instances), Person[psor] (72; 4% instances), Evident (23; 1% instances), Polite (16; 1% instances)

VERB occurs with 40 feature-value pairs: Aspect=Hab, Aspect=Imp, Aspect=Perf, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Evident=Fh, Mood=Cnd, Mood=Des, Mood=Imp, Mood=Ind, Mood=Opt, Mood=Pot, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Plur,Sing, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polite=Form, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, Voice=Pass, Voice=Rcp

VERB occurs with 164 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin (243 tokens). Examples: кетті, деді, бастады, болды, шықты, өтті, келді, алды, құрды, атанды

Relations

VERB nodes are attached to their parents using 16 different relations: root (744; 45% instances), advcl (337; 21% instances), conj (107; 7% instances), acl (96; 6% instances), ccomp (87; 5% instances), acl:relcl (83; 5% instances), csubj (69; 4% instances), xcomp (59; 4% instances), parataxis (50; 3% instances), appos (2; 0% instances), compound (2; 0% instances), obl (2; 0% instances), orphan (2; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances)

Parents of VERB nodes belong to 7 different parts of speech: (744; 45% instances), VERB (582; 35% instances), NOUN (218; 13% instances), ADJ (81; 5% instances), PROPN (9; 1% instances), PRON (5; 0% instances), NUM (4; 0% instances)

108 (7%) VERB nodes are leaves.

326 (20%) VERB nodes have one child.

314 (19%) VERB nodes have two children.

895 (54%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 29 different relations: punct (1144; 26% instances), nsubj (738; 16% instances), obl (683; 15% instances), obj (514; 11% instances), advcl (317; 7% instances), advmod (191; 4% instances), aux (160; 4% instances), nmod (130; 3% instances), ccomp (117; 3% instances), conj (97; 2% instances), xcomp (77; 2% instances), dep (76; 2% instances), cc (62; 1% instances), parataxis (46; 1% instances), discourse (28; 1% instances), csubj (24; 1% instances), case (22; 0% instances), cop (16; 0% instances), compound:lvc (7; 0% instances), nmod:poss (6; 0% instances), vocative (6; 0% instances), iobj (4; 0% instances), amod (3; 0% instances), compound (3; 0% instances), mark (3; 0% instances), acl:relcl (2; 0% instances), acl (1; 0% instances), det (1; 0% instances), obl:own (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (1644; 37% instances), PUNCT (1144; 26% instances), VERB (582; 13% instances), PRON (289; 6% instances), AUX (177; 4% instances), PROPN (169; 4% instances), ADV (155; 3% instances), ADJ (108; 2% instances), X (76; 2% instances), CCONJ (54; 1% instances), ADP (22; 0% instances), NUM (20; 0% instances), PART (19; 0% instances), INTJ (9; 0% instances), SCONJ (9; 0% instances), DET (2; 0% instances)