home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: CCONJ

There are 27 CCONJ lemmas (0%), 32 CCONJ types (0%) and 8147 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: и, но, а, или, да, ни, то, однако, либо, причем

The 10 most frequent CCONJ types: и, но, а, или, да, ни, то, однако, либо, зато

The 10 most frequent ambiguous lemmas: и (CCONJ 4902, PART 547, X 4, NOUN 2), а (CCONJ 1095, INTJ 19, X 2, NOUN 1, PART 1, SCONJ 1), да (CCONJ 106, PART 105), ни (PART 104, CCONJ 90, VERB 1), то (PRON 464, SCONJ 185, PART 66, CCONJ 54, DET 1), однако (CCONJ 47, ADV 9), причем (CCONJ 20, ADV 3), зато (CCONJ 17, PART 4), плюс (NOUN 25, CCONJ 10, ADP 5), также (ADV 51, PART 16, CCONJ 8)

The 10 most frequent ambiguous types: и (CCONJ 4395, PART 544, X 4, ADP 2, NOUN 2), но (CCONJ 856, X 1), а (CCONJ 694, INTJ 18, ADP 5, X 3, PART 1), да (CCONJ 68, PART 52, INTJ 1), ни (PART 96, CCONJ 80, PRON 2, ADV 1, DET 1, VERB 1), то (SCONJ 176, PRON 174, X 127, PART 65, CCONJ 49, DET 23, ADV 1), однако (CCONJ 13, ADV 8), либо (CCONJ 41, X 9), зато (CCONJ 8, PART 4), причем (CCONJ 7, ADV 2)

Morphology

The form / lemma ratio of CCONJ is 1.185185 (the average of all parts of speech is 1.879397).

The 1st highest number of forms (2) was observed with the lemma “и”: и, ин.

The 2nd highest number of forms (2) was observed with the lemma “или”: Иди, или.

The 3rd highest number of forms (2) was observed with the lemma “либо”: лбо, либо.

CCONJ occurs with 2 features: Polarity (89; 1% instances), Typo (9; 0% instances)

CCONJ occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (8049 tokens). Examples: и, но, а, или, да, то, однако, либо, зато, причем

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (8007; 98% instances), fixed (56; 1% instances), advmod (55; 1% instances), conj (8; 0% instances), root (8; 0% instances), orphan (4; 0% instances), mark (3; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), case (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (3673; 45% instances), NOUN (2488; 31% instances), ADJ (954; 12% instances), ADV (330; 4% instances), PROPN (203; 2% instances), PRON (198; 2% instances), DET (85; 1% instances), NUM (61; 1% instances), PART (59; 1% instances), CCONJ (38; 0% instances), AUX (19; 0% instances), X (17; 0% instances), INTJ (8; 0% instances), (8; 0% instances), SYM (4; 0% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)

7968 (98%) CCONJ nodes are leaves.

166 (2%) CCONJ nodes have one child.

9 (0%) CCONJ nodes have two children.

4 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 12 different relations: fixed (133; 66% instances), punct (49; 24% instances), goeswith (5; 2% instances), parataxis (4; 2% instances), cc (2; 1% instances), det (2; 1% instances), nummod (2; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), conj (1; 0% instances), discourse (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 13 different parts of speech: PART (68; 34% instances), PUNCT (49; 24% instances), CCONJ (38; 19% instances), PRON (17; 8% instances), ADV (12; 6% instances), X (5; 2% instances), VERB (4; 2% instances), NUM (3; 1% instances), DET (2; 1% instances), ADJ (1; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)