Treebank Statistics: UD_Upper_Sorbian-UFAL: POS Tags: CCONJ
There are 9 CCONJ lemmas (0%), 9 CCONJ types (0%) and 423 CCONJ tokens (4%).
Out of 16 observed tags, the rank of CCONJ is: 14 in number of lemmas, 15 in number of types and 8 in number of tokens.
The 10 most frequent CCONJ lemmas: a, abo, ale, hač, pak, wšak, et, tola, ani
The 10 most frequent CCONJ types: a, abo, ale, hač, pak, wšak, et, tola, ani
The 10 most frequent ambiguous lemmas: a (CCONJ 337, X 5), hač (SCONJ 17, ADV 12, CCONJ 7)
The 10 most frequent ambiguous types: a (CCONJ 337, X 4), hač (SCONJ 17, ADV 11, CCONJ 7)
- a
- hač
Morphology
The form / lemma ratio of CCONJ is 1.000000 (the average of all parts of speech is 1.419479).
The 1st highest number of forms (1) was observed with the lemma “a”: a.
The 2nd highest number of forms (1) was observed with the lemma “abo”: abo.
The 3rd highest number of forms (1) was observed with the lemma “ale”: ale.
CCONJ occurs with 2 features: ExtPos (22; 5% instances), Foreign (1; 0% instances)
CCONJ occurs with 2 feature-value pairs: ExtPos=CCONJ, Foreign=Yes
CCONJ occurs with 3 feature combinations.
The most frequent feature combination is _ (400 tokens).
Examples: a, abo, ale, hač, pak, wšak, tola, et, ani
Relations
CCONJ nodes are attached to their parents using 4 different relations: cc (420; 99% instances), advmod (1; 0% instances), dep:alt (1; 0% instances), fixed (1; 0% instances)
Parents of CCONJ nodes belong to 9 different parts of speech: NOUN (183; 43% instances), VERB (92; 22% instances), ADJ (57; 13% instances), PROPN (52; 12% instances), X (13; 3% instances), NUM (12; 3% instances), ADV (9; 2% instances), SYM (4; 1% instances), PRON (1; 0% instances)
399 (94%) CCONJ nodes are leaves.
23 (5%) CCONJ nodes have one child.
1 (0%) CCONJ nodes have two children.
The highest child degree of a CCONJ node is 2.
Children of CCONJ nodes are attached using 2 different relations: fixed (22; 88% instances), punct (3; 12% instances)
Children of CCONJ nodes belong to 3 different parts of speech: ADV (20; 80% instances), PUNCT (3; 12% instances), PRON (2; 8% instances)