Treebank Statistics: UD_Czech-PDTC: POS Tags: CCONJ
There are 47 CCONJ lemmas (0%), 48 CCONJ types (0%) and 112632 CCONJ tokens (3%).
Out of 17 observed tags, the rank of CCONJ is: 12 in number of lemmas, 14 in number of types and 11 in number of tokens.
The 10 most frequent CCONJ lemmas: a, ale, i, nebo, však, takže, či, až, proto, ani
The 10 most frequent CCONJ types: a, ale, i, nebo, však, takže, či, až, proto, ani
The 10 most frequent ambiguous lemmas: a (CCONJ 67954, NOUN 51, X 33), ale (CCONJ 11351, PART 6), i (CCONJ 6678, PART 6206, NOUN 23, X 3, ADJ 1), však (CCONJ 4367, PART 38), až (PART 2708, CCONJ 1402, SCONJ 445), proto (CCONJ 1043, ADV 593, PART 1), ani (PART 1881, CCONJ 1013), totiž (CCONJ 972, PART 22), jako (SCONJ 7286, CCONJ 827, PART 21), sice (CCONJ 689, PART 46, ADV 12)
The 10 most frequent ambiguous types: a (CCONJ 65840, ADJ 181, NOUN 50, X 32), ale (CCONJ 9340, PART 4), i (PART 6200, CCONJ 5716, NOUN 23, X 3, ADJ 1), však (CCONJ 4353, PART 37), až (PART 2529, CCONJ 1400, SCONJ 360), proto (CCONJ 665, ADV 579, PART 1), ani (PART 1547, CCONJ 963), totiž (CCONJ 967, PART 20), jako (SCONJ 6809, CCONJ 823, PART 15), sice (CCONJ 622, PART 46, ADV 12)
- a
- ale
- i
- PART 6200: Stejný názor má i řada našich soukromých podnikatelů .
- CCONJ 5716: * Takové hodnocení je snad běžné u soukromníka , podnikatele i firmy .
- NOUN 23: U článku o Hondě Legend byla zaměněna fotografie s Roverem 214 i .
- X 3: Výše uvedený dopis byl přetištěn také rusky ve sborníku “ I . Kant - Traktaty i pisma “ ( vydalo nakl . Nauka r . 1980 ) .
- ADJ 1: Zdvojnásobení zisku očekává v tomto roce Koh - i - noor Hardtmuth České Budějovice .
- však
- až
- proto
- ani
- totiž
- jako
- sice
Morphology
The form / lemma ratio of CCONJ is 1.021277 (the average of all parts of speech is 2.169184).
The 1st highest number of forms (2) was observed with the lemma “krát”: krát, kráte.
The 2nd highest number of forms (1) was observed with the lemma “a”: a.
The 3rd highest number of forms (1) was observed with the lemma “ale”: ale.
CCONJ occurs with 4 features: ExtPos (1784; 2% instances), Abbr (221; 0% instances), ConjType (133; 0% instances), Style (4; 0% instances)
CCONJ occurs with 5 feature-value pairs: Abbr=Yes, ConjType=Oper, ExtPos=CCONJ, ExtPos=SCONJ, Style=Coll
CCONJ occurs with 6 feature combinations.
The most frequent feature combination is _ (110494 tokens).
Examples: a, ale, i, nebo, však, takže, či, až, proto, ani
Relations
CCONJ nodes are attached to their parents using 11 different relations: cc (107058; 95% instances), advmod:emph (3133; 3% instances), mark (1830; 2% instances), root (557; 0% instances), advmod (19; 0% instances), conj (18; 0% instances), nmod (10; 0% instances), appos (3; 0% instances), ccomp (2; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)
Parents of CCONJ nodes belong to 17 different parts of speech: VERB (42556; 38% instances), NOUN (38674; 34% instances), ADJ (13473; 12% instances), PROPN (5102; 5% instances), ADV (4156; 4% instances), NUM (3419; 3% instances), DET (1299; 1% instances), PRON (1072; 1% instances), AUX (927; 1% instances), X (784; 1% instances), (557; 0% instances), PART (485; 0% instances), SYM (65; 0% instances), ADP (34; 0% instances), INTJ (14; 0% instances), CCONJ (11; 0% instances), SCONJ (4; 0% instances)
110245 (98%) CCONJ nodes are leaves.
1793 (2%) CCONJ nodes have one child.
261 (0%) CCONJ nodes have two children.
333 (0%) CCONJ nodes have three or more children.
The highest child degree of a CCONJ node is 8.
Children of CCONJ nodes are attached using 22 different relations: fixed (1798; 52% instances), punct (694; 20% instances), dep (600; 17% instances), conj (154; 4% instances), advmod:emph (100; 3% instances), appos (47; 1% instances), compound (24; 1% instances), cc (9; 0% instances), mark (6; 0% instances), obl (6; 0% instances), cop (4; 0% instances), discourse (4; 0% instances), nsubj (4; 0% instances), nummod (4; 0% instances), advcl (3; 0% instances), advmod (3; 0% instances), case (3; 0% instances), aux (2; 0% instances), nmod (2; 0% instances), ccomp (1; 0% instances), obl:arg (1; 0% instances), parataxis (1; 0% instances)
Children of CCONJ nodes belong to 15 different parts of speech: SCONJ (1156; 33% instances), PART (739; 21% instances), PUNCT (694; 20% instances), NOUN (341; 10% instances), ADV (184; 5% instances), VERB (109; 3% instances), PRON (84; 2% instances), DET (43; 1% instances), NUM (36; 1% instances), ADJ (33; 1% instances), PROPN (27; 1% instances), CCONJ (11; 0% instances), AUX (7; 0% instances), ADP (3; 0% instances), INTJ (3; 0% instances)