home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kazakh-KTB: POS Tags: CCONJ

There are 15 CCONJ lemmas (1%), 20 CCONJ types (0%) and 181 CCONJ tokens (2%). Out of 17 observed tags, the rank of CCONJ is: 13 in number of lemmas, 12 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: және, мен, бірақ, да, немесе, ал, не, сондай-ақ, әлде, яки

The 10 most frequent CCONJ types: және, мен, бірақ, немесе, пен, да, ал, де, Сондай-ақ, не

The 10 most frequent ambiguous lemmas: мен (PRON 47, CCONJ 44), да (ADV 66, CCONJ 15, SCONJ 2), ал (VERB 47, AUX 31, CCONJ 6, INTJ 2), не (PRON 21, DET 7, CCONJ 3, X 1), сондай-ақ (ADV 3, CCONJ 3), әрі (CCONJ 2, SCONJ 1)

The 10 most frequent ambiguous types: мен (CCONJ 34, PRON 7), да (ADV 37, CCONJ 8, SCONJ 2), ал (CCONJ 3, INTJ 1), де (ADV 27, CCONJ 5, SCONJ 1, X 1), не (PRON 12, DET 7, CCONJ 2, X 1), болмаса (AUX 3, CCONJ 1), те (ADV 2, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.333333 (the average of all parts of speech is 1.743774).

The 1st highest number of forms (4) was observed with the lemma “да”: да, де, та, те.

The 2nd highest number of forms (3) was observed with the lemma “мен”: бен, мен, пен.

The 3rd highest number of forms (1) was observed with the lemma “ал”: ал.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 2 different relations: cc (180; 99% instances), conj (1; 1% instances)

Parents of CCONJ nodes belong to 6 different parts of speech: NOUN (83; 46% instances), VERB (50; 28% instances), ADJ (31; 17% instances), PROPN (15; 8% instances), NUM (1; 1% instances), PRON (1; 1% instances)

169 (93%) CCONJ nodes are leaves.

12 (7%) CCONJ nodes have one child.

The highest child degree of a CCONJ node is 1.

Children of CCONJ nodes are attached using 3 different relations: punct (8; 67% instances), advmod (3; 25% instances), dep (1; 8% instances)

Children of CCONJ nodes belong to 3 different parts of speech: PUNCT (8; 67% instances), ADV (3; 25% instances), X (1; 8% instances)