home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: CCONJ

There are 35 CCONJ lemmas (0%), 40 CCONJ types (0%) and 73144 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: и, а, но, или, да, ни, то, однако, либо, зато

The 10 most frequent CCONJ types: и, а, но, или, да, ни, то, однако, либо, зато

The 10 most frequent ambiguous lemmas: и (CCONJ 47408, PART 5895, X 27, NOUN 26), а (CCONJ 10443, X 83, INTJ 74, NOUN 44, PART 31, ADP 1, SCONJ 1), но (CCONJ 7273, INTJ 1, X 1), да (CCONJ 1078, PART 677, X 4), ни (PART 1051, CCONJ 769, VERB 3, X 1), то (PRON 4713, SCONJ 990, PART 728, CCONJ 602, X 2, DET 1), однако (CCONJ 591, ADV 128), либо (CCONJ 190, PART 4), зато (CCONJ 170, PART 5), причем (CCONJ 158, ADV 9)

The 10 most frequent ambiguous types: и (CCONJ 42913, PART 5881, X 27, NOUN 19, ADP 2), а (CCONJ 6039, X 83, PART 29, NOUN 26, INTJ 25, ADP 6), но (CCONJ 3891, X 2, INTJ 1), да (CCONJ 548, PART 306, X 4, INTJ 1), ни (PART 954, CCONJ 707, VERB 3, PRON 2, ADV 1, DET 1, X 1), то (PRON 1530, SCONJ 973, PART 727, CCONJ 547, DET 421, X 130, ADV 1), однако (ADV 126, CCONJ 79), либо (CCONJ 185, X 10, PART 4), зато (CCONJ 58, PART 5), причем (CCONJ 96, ADV 4)

Morphology

The form / lemma ratio of CCONJ is 1.142857 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (2) was observed with the lemma “и”: и, ин.

The 2nd highest number of forms (2) was observed with the lemma “или”: Иди, или.

The 3rd highest number of forms (2) was observed with the lemma “либо”: лбо, либо.

CCONJ occurs with 3 features: ExtPos (1515; 2% instances), Polarity (769; 1% instances), Typo (10; 0% instances)

CCONJ occurs with 8 feature-value pairs: ExtPos=ADJ, ExtPos=ADV, ExtPos=CCONJ, ExtPos=NOUN, ExtPos=PART, ExtPos=VERB, Polarity=Neg, Typo=Yes

CCONJ occurs with 10 feature combinations. The most frequent feature combination is _ (70851 tokens). Examples: и, а, но, или, да, однако, то, либо, зато, причем

Relations

CCONJ nodes are attached to their parents using 18 different relations: cc (72266; 99% instances), fixed (740; 1% instances), conj (53; 0% instances), root (28; 0% instances), parataxis:discourse (13; 0% instances), advmod (10; 0% instances), appos (9; 0% instances), parataxis (6; 0% instances), orphan (5; 0% instances), nsubj (3; 0% instances), discourse (2; 0% instances), mark (2; 0% instances), obl (2; 0% instances), advcl (1; 0% instances), case (1; 0% instances), dep (1; 0% instances), flat:name (1; 0% instances), nmod (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (32640; 45% instances), NOUN (22484; 31% instances), ADJ (8332; 11% instances), PROPN (2488; 3% instances), ADV (2413; 3% instances), PRON (1670; 2% instances), DET (1402; 2% instances), CCONJ (696; 1% instances), PART (348; 0% instances), NUM (280; 0% instances), X (260; 0% instances), INTJ (44; 0% instances), (28; 0% instances), AUX (27; 0% instances), ADP (25; 0% instances), SYM (6; 0% instances), SCONJ (1; 0% instances)

71525 (98%) CCONJ nodes are leaves.

1504 (2%) CCONJ nodes have one child.

94 (0%) CCONJ nodes have two children.

21 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 15 different relations: fixed (1584; 89% instances), punct (140; 8% instances), parataxis (13; 1% instances), conj (11; 1% instances), goeswith (5; 0% instances), amod (2; 0% instances), case (2; 0% instances), cc (2; 0% instances), det (2; 0% instances), discourse (2; 0% instances), flat:name (2; 0% instances), nummod (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 15 different parts of speech: CCONJ (696; 39% instances), ADV (340; 19% instances), PART (319; 18% instances), PRON (237; 13% instances), PUNCT (140; 8% instances), VERB (12; 1% instances), DET (5; 0% instances), X (5; 0% instances), ADJ (4; 0% instances), ADP (3; 0% instances), NUM (3; 0% instances), INTJ (2; 0% instances), NOUN (2; 0% instances), PROPN (1; 0% instances), SYM (1; 0% instances)