home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kazakh-KTB: POS Tags: ADV

There are 109 ADV lemmas (4%), 118 ADV types (3%) and 303 ADV tokens (3%). Out of 17 observed tags, the rank of ADV is: 6 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: да, қайда, бүгін, енді, қазір, ғана, тағы, қайдан, өте, тек

The 10 most frequent ADV types: да, де, қайда, бүгін, енді, тағы, ғана, қазір, қайдан, өте

The 10 most frequent ambiguous lemmas: да (ADV 66, CCONJ 15, SCONJ 2), тек (ADV 6, NOUN 2), қайта (ADV 6, X 1), қалай (ADV 6, X 1), аса (ADV 4, ADJ 1), бірге (ADV 4, ADP 1), ең (ADV 4, X 1), жақсы (ADJ 14, ADV 3, X 2), сайын (ADP 4, ADV 3), сондай-ақ (ADV 3, CCONJ 3)

The 10 most frequent ambiguous types: да (ADV 37, CCONJ 8, SCONJ 2), де (ADV 27, CCONJ 5, SCONJ 1, X 1), енді (ADV 4, VERB 3), тек (ADV 2, NOUN 1), қайта (ADV 6, VERB 2, X 1), қалай (ADV 5, X 1), аса (ADV 4, ADJ 1), бірге (ADV 4, ADP 1), ең (ADV 4, X 1), емес (AUX 12, ADV 3)

Morphology

The form / lemma ratio of ADV is 1.082569 (the average of all parts of speech is 1.743774).

The 1st highest number of forms (3) was observed with the lemma “да”: да, де, те.

The 2nd highest number of forms (2) was observed with the lemma “бері”: бергі, бері.

The 3rd highest number of forms (2) was observed with the lemma “бүгін”: Бүгінгі, бүгін.

ADV occurs with 3 features: PronType (32; 11% instances), Case (14; 5% instances), Degree (1; 0% instances)

ADV occurs with 3 feature-value pairs: Case=Nom, Degree=Cmp, PronType=Int

ADV occurs with 5 feature combinations. The most frequent feature combination is _ (257 tokens). Examples: да, де, бүгін, енді, тағы, ғана, қазір, өте, тек, қайта

Relations

ADV nodes are attached to their parents using 10 different relations: advmod (267; 88% instances), amod (13; 4% instances), root (9; 3% instances), ccomp (4; 1% instances), compound (3; 1% instances), discourse (3; 1% instances), cc (1; 0% instances), conj (1; 0% instances), nmod (1; 0% instances), parataxis (1; 0% instances)

Parents of ADV nodes belong to 11 different parts of speech: VERB (154; 51% instances), NOUN (61; 20% instances), ADJ (43; 14% instances), PRON (14; 5% instances), (9; 3% instances), ADV (7; 2% instances), NUM (5; 2% instances), CCONJ (3; 1% instances), PROPN (3; 1% instances), AUX (2; 1% instances), SCONJ (2; 1% instances)

260 (86%) ADV nodes are leaves.

23 (8%) ADV nodes have one child.

16 (5%) ADV nodes have two children.

4 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 3.

Children of ADV nodes are attached using 10 different relations: punct (30; 45% instances), nsubj (13; 19% instances), dep (10; 15% instances), advmod (4; 6% instances), cop (4; 6% instances), compound (2; 3% instances), aux (1; 1% instances), conj (1; 1% instances), nmod (1; 1% instances), vocative (1; 1% instances)

Children of ADV nodes belong to 7 different parts of speech: PUNCT (30; 45% instances), X (10; 15% instances), ADV (7; 10% instances), NOUN (7; 10% instances), AUX (5; 7% instances), PROPN (5; 7% instances), PRON (3; 4% instances)