home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: CCONJ

There are 28 CCONJ lemmas (0%), 32 CCONJ types (0%) and 8148 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: и, но, а, или, да, ни, то, однако, либо, причем

The 10 most frequent CCONJ types: и, но, а, или, да, ни, то, однако, либо, зато

The 10 most frequent ambiguous lemmas: и (CCONJ 4902, PART 547, X 4, NOUN 2), а (CCONJ 1095, INTJ 19, X 3, NOUN 1, PART 1, SCONJ 1), да (CCONJ 106, PART 106), ни (PART 106, CCONJ 92, VERB 1), то (PRON 464, PART 190, SCONJ 187, CCONJ 54, DET 1), однако (CCONJ 47, ADV 9), либо (CCONJ 43, PART 9), причем (CCONJ 20, ADV 3), зато (CCONJ 17, PART 4), плюс (NOUN 25, CCONJ 10, ADP 5)

The 10 most frequent ambiguous types: и (CCONJ 4395, PART 544, X 4, ADP 2, NOUN 2), а (CCONJ 694, INTJ 18, ADP 5, X 3, PART 1), да (CCONJ 68, PART 53), ни (PART 98, CCONJ 82, VERB 1), то (PART 190, SCONJ 178, PRON 174, CCONJ 49, DET 23, ADV 1), однако (CCONJ 13, ADV 8), либо (CCONJ 41, PART 9), зато (CCONJ 8, PART 4), причем (CCONJ 7, ADV 2), плюс (NOUN 15, CCONJ 6, ADP 3)

Morphology

The form / lemma ratio of CCONJ is 1.142857 (the average of all parts of speech is 1.875784).

The 1st highest number of forms (2) was observed with the lemma “и”: и, ин.

The 2nd highest number of forms (2) was observed with the lemma “или”: Иди, или.

The 3rd highest number of forms (2) was observed with the lemma “либо”: лбо, либо.

CCONJ occurs with 2 features: Polarity (91; 1% instances), Typo (4; 0% instances)

CCONJ occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (8053 tokens). Examples: и, но, а, или, да, то, однако, либо, зато, причем

Relations

CCONJ nodes are attached to their parents using 14 different relations: cc (8005; 98% instances), fixed (56; 1% instances), advmod (55; 1% instances), conj (9; 0% instances), root (8; 0% instances), orphan (4; 0% instances), mark (3; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), case (1; 0% instances), discourse (1; 0% instances), goeswith (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (3671; 45% instances), NOUN (2487; 31% instances), ADJ (956; 12% instances), ADV (331; 4% instances), PROPN (203; 2% instances), PRON (198; 2% instances), DET (85; 1% instances), NUM (61; 1% instances), PART (60; 1% instances), CCONJ (38; 0% instances), AUX (19; 0% instances), X (17; 0% instances), INTJ (8; 0% instances), (8; 0% instances), SYM (4; 0% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)

7968 (98%) CCONJ nodes are leaves.

166 (2%) CCONJ nodes have one child.

9 (0%) CCONJ nodes have two children.

5 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 13 different relations: fixed (133; 65% instances), punct (50; 24% instances), goeswith (6; 3% instances), parataxis (4; 2% instances), cc (2; 1% instances), det (2; 1% instances), nummod (2; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), conj (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 13 different parts of speech: PART (72; 35% instances), PUNCT (50; 24% instances), CCONJ (38; 19% instances), PRON (18; 9% instances), ADV (12; 6% instances), VERB (4; 2% instances), DET (3; 1% instances), NUM (3; 1% instances), ADJ (1; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances), NOUN (1; 0% instances), SYM (1; 0% instances)