home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: POS Tags: CCONJ

There are 29 CCONJ lemmas (0%), 30 CCONJ types (0%) and 24205 CCONJ tokens (5%). Out of 16 observed tags, the rank of CCONJ is: 12 in number of lemmas, 13 in number of types and 7 in number of tokens.

The 10 most frequent CCONJ lemmas: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

The 10 most frequent CCONJ types: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

The 10 most frequent ambiguous lemmas: a (CCONJ 15539, ADP 4), i (CCONJ 3420, ADJ 3, NOUN 1), tak (ADV 816, CCONJ 217), budit (CCONJ 9, VERB 5), proto (SCONJ 567, CCONJ 9), (PART 517, SCONJ 36, CCONJ 6), ovšem (PART 211, ADV 14, CCONJ 5), jak (ADV 751, SCONJ 15, CCONJ 2)

The 10 most frequent ambiguous types: a (CCONJ 15101, ADP 3), i (CCONJ 3266, ADJ 3), tak (ADV 681, CCONJ 212), buď (CCONJ 85, AUX 1), proto (SCONJ 364, CCONJ 7), (PART 502, SCONJ 32, CCONJ 6), ovšem (PART 190, ADV 12, CCONJ 5), jak (ADV 654, SCONJ 14, CCONJ 2), na (ADP 6587, CCONJ 1), ni (PRON 36, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.034483 (the average of all parts of speech is 2.186309).

The 1st highest number of forms (2) was observed with the lemma “a”: a, na.

The 2nd highest number of forms (2) was observed with the lemma “nebo”: anebo, nebo.

The 3rd highest number of forms (1) was observed with the lemma “ale”: ale.

CCONJ occurs with 4 features: Aspect (9; 0% instances), Foreign (3; 0% instances), ConjType (2; 0% instances), NameType (1; 0% instances)

CCONJ occurs with 4 feature-value pairs: Aspect=Imp, ConjType=Oper, Foreign=Yes, NameType=Com

CCONJ occurs with 5 feature combinations. The most frequent feature combination is _ (24191 tokens). Examples: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

Relations

CCONJ nodes are attached to their parents using 15 different relations: cc (21799; 90% instances), advmod:emph (2147; 9% instances), mark (196; 1% instances), advmod (24; 0% instances), nmod (10; 0% instances), root (10; 0% instances), dep (7; 0% instances), conj (4; 0% instances), case (2; 0% instances), cop (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), nsubj (1; 0% instances), nsubj:pass (1; 0% instances), orphan (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: NOUN (11788; 49% instances), VERB (5833; 24% instances), ADJ (3893; 16% instances), ADV (946; 4% instances), PROPN (603; 2% instances), DET (494; 2% instances), NUM (225; 1% instances), SYM (203; 1% instances), PRON (132; 1% instances), SCONJ (33; 0% instances), PART (18; 0% instances), AUX (11; 0% instances), (10; 0% instances), CCONJ (6; 0% instances), INTJ (5; 0% instances), ADP (3; 0% instances), PUNCT (2; 0% instances)

23987 (99%) CCONJ nodes are leaves.

190 (1%) CCONJ nodes have one child.

20 (0%) CCONJ nodes have two children.

8 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 13 different relations: fixed (172; 67% instances), dep (14; 5% instances), punct (12; 5% instances), mark (11; 4% instances), amod (10; 4% instances), cc (8; 3% instances), nmod (8; 3% instances), advmod:emph (6; 2% instances), aux (5; 2% instances), obl (5; 2% instances), case (2; 1% instances), orphan (2; 1% instances), conj (1; 0% instances)

Children of CCONJ nodes belong to 15 different parts of speech: SCONJ (171; 67% instances), NOUN (16; 6% instances), ADJ (12; 5% instances), PUNCT (12; 5% instances), DET (8; 3% instances), ADV (7; 3% instances), CCONJ (6; 2% instances), PART (6; 2% instances), AUX (5; 2% instances), SYM (4; 2% instances), VERB (3; 1% instances), PRON (2; 1% instances), PROPN (2; 1% instances), ADP (1; 0% instances), NUM (1; 0% instances)