home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: CCONJ

There are 44 CCONJ lemmas (0%), 47 CCONJ types (0%) and 56857 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 13 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: a, i, ale, však, nebo, ani, či, proto, až, ovšem

The 10 most frequent CCONJ types: a, i, ale, však, nebo, ani, či, proto, až, ovšem

The 10 most frequent ambiguous lemmas: a (CCONJ 32110, NOUN 133, ADJ 15, ADP 9, X 4), i (CCONJ 7804, NOUN 15, PROPN 2), proto (CCONJ 950, ADV 229), (PART 1384, CCONJ 639, SCONJ 139), ovšem (CCONJ 626, PART 42), tak (ADV 2354, CCONJ 389), jak (ADV 1801, SCONJ 399, CCONJ 52, PROPN 5), plus (NOUN 37, CCONJ 16), alias (CCONJ 6, NOUN 2), as (CCONJ 4, NOUN 2, SCONJ 2)

The 10 most frequent ambiguous types: a (CCONJ 31068, ADJ 183, NOUN 49, ADP 7), i (CCONJ 7339, NOUN 14, PROPN 2), proto (CCONJ 655, ADV 229), (PART 1295, CCONJ 639, SCONJ 111), ovšem (CCONJ 561, PART 42), tak (ADV 2201, CCONJ 358), buď (CCONJ 112, AUX 12, VERB 1), and (CCONJ 45, PROPN 2), jak (ADV 1350, SCONJ 222, CCONJ 50, PROPN 5), plus (NOUN 25, CCONJ 16)

Morphology

The form / lemma ratio of CCONJ is 1.068182 (the average of all parts of speech is 2.181849).

The 1st highest number of forms (2) was observed with the lemma “krát”: krát, kráte.

The 2nd highest number of forms (2) was observed with the lemma “nebo”: neb, nebo.

The 3rd highest number of forms (2) was observed with the lemma “neboť”: neboť, ť.

CCONJ occurs with 5 features: Abbr (182; 0% instances), Foreign (80; 0% instances), ConjType (50; 0% instances), Style (4; 0% instances), NameType (1; 0% instances)

CCONJ occurs with 5 feature-value pairs: Abbr=Yes, ConjType=Oper, Foreign=Yes, NameType=Com, Style=Arch

CCONJ occurs with 7 feature combinations. The most frequent feature combination is _ (56542 tokens). Examples: a, i, ale, však, nebo, ani, či, proto, až, ovšem

Relations

CCONJ nodes are attached to their parents using 17 different relations: cc (49290; 87% instances), advmod:emph (6446; 11% instances), mark (452; 1% instances), advmod (449; 1% instances), nmod (52; 0% instances), flat:foreign (46; 0% instances), dep (34; 0% instances), root (29; 0% instances), conj (23; 0% instances), discourse (16; 0% instances), orphan (9; 0% instances), fixed (3; 0% instances), appos (2; 0% instances), nsubj (2; 0% instances), obj (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: NOUN (22491; 40% instances), VERB (18202; 32% instances), ADJ (7500; 13% instances), PROPN (3935; 7% instances), ADV (1961; 3% instances), NUM (1162; 2% instances), DET (957; 2% instances), PRON (419; 1% instances), PART (92; 0% instances), CCONJ (38; 0% instances), (29; 0% instances), ADP (28; 0% instances), SYM (13; 0% instances), INTJ (11; 0% instances), SCONJ (11; 0% instances), AUX (5; 0% instances), PUNCT (3; 0% instances)

56248 (99%) CCONJ nodes are leaves.

493 (1%) CCONJ nodes have one child.

70 (0%) CCONJ nodes have two children.

46 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 7.

Children of CCONJ nodes are attached using 20 different relations: fixed (395; 49% instances), punct (81; 10% instances), advmod:emph (73; 9% instances), mark (44; 6% instances), aux (36; 5% instances), cc (35; 4% instances), nummod (28; 4% instances), conj (26; 3% instances), dep (26; 3% instances), flat:foreign (22; 3% instances), orphan (12; 2% instances), nmod (10; 1% instances), advcl (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of CCONJ nodes belong to 13 different parts of speech: SCONJ (434; 54% instances), PUNCT (87; 11% instances), ADV (55; 7% instances), CCONJ (38; 5% instances), AUX (36; 5% instances), NOUN (28; 4% instances), NUM (28; 4% instances), DET (22; 3% instances), PART (21; 3% instances), VERB (19; 2% instances), PROPN (15; 2% instances), ADJ (14; 2% instances), PRON (3; 0% instances)