home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: POS Tags: CCONJ

There are 19 CCONJ lemmas (0%), 21 CCONJ types (0%) and 13477 CCONJ tokens (4%). Out of 16 observed tags, the rank of CCONJ is: 15 in number of lemmas, 15 in number of types and 9 in number of tokens.

The 10 most frequent CCONJ lemmas: ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent

The 10 most frequent CCONJ types: ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent

The 10 most frequent ambiguous lemmas: või (CCONJ 1052, ADV 54, NOUN 4, AUX 3), aga (CCONJ 946, ADV 528), kui (SCONJ 2272, CCONJ 264, ADV 176), ega (CCONJ 226, ADV 78), vaid (ADV 343, CCONJ 180), ehk (CCONJ 173, ADV 63), kuni (ADP 73, SCONJ 64, ADV 63, CCONJ 47), & (SYM 14, CCONJ 8), nii (ADV 1040, CCONJ 4), e (NOUN 21, CCONJ 3)

The 10 most frequent ambiguous types: või (CCONJ 990, ADV 53, AUX 8, NOUN 2, VERB 2), aga (CCONJ 638, ADV 528), kuid (CCONJ 490, NOUN 6), kui (SCONJ 1653, CCONJ 264, ADV 144), ega (CCONJ 221, ADV 44), vaid (ADV 325, CCONJ 177), ehk (CCONJ 167, ADV 57), kuni (ADP 65, ADV 62, SCONJ 54, CCONJ 47), & (CCONJ 8, SYM 6), nii (ADV 835, CCONJ 4)

Morphology

The form / lemma ratio of CCONJ is 1.105263 (the average of all parts of speech is 1.912184).

The 1st highest number of forms (2) was observed with the lemma “aga”: A, aga.

The 2nd highest number of forms (2) was observed with the lemma “ja”: -ja, ja.

The 3rd highest number of forms (1) was observed with the lemma “&”: &.

CCONJ occurs with 3 features: Polarity (225; 2% instances), Abbr (11; 0% instances), Foreign (1; 0% instances)

CCONJ occurs with 3 feature-value pairs: Abbr=Yes, Foreign=Yes, Polarity=Neg

CCONJ occurs with 4 feature combinations. The most frequent feature combination is _ (13240 tokens). Examples: ja, ning, või, aga, kuid, kui, vaid, ehk, ent, kuni

Relations

CCONJ nodes are attached to their parents using 6 different relations: cc (13465; 100% instances), cc:preconj (6; 0% instances), mark (2; 0% instances), root (2; 0% instances), flat (1; 0% instances), nsubj:cop (1; 0% instances)

Parents of CCONJ nodes belong to 15 different parts of speech: NOUN (5354; 40% instances), VERB (4823; 36% instances), ADJ (1502; 11% instances), PROPN (926; 7% instances), ADV (432; 3% instances), PRON (228; 2% instances), NUM (167; 1% instances), DET (16; 0% instances), INTJ (6; 0% instances), SCONJ (6; 0% instances), SYM (5; 0% instances), X (5; 0% instances), ADP (4; 0% instances), (2; 0% instances), CCONJ (1; 0% instances)

13471 (100%) CCONJ nodes are leaves.

4 (0%) CCONJ nodes have one child.

1 (0%) CCONJ nodes have two children.

1 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 7 different relations: punct (5; 45% instances), advmod (1; 9% instances), amod (1; 9% instances), appos (1; 9% instances), cc (1; 9% instances), det (1; 9% instances), nummod (1; 9% instances)

Children of CCONJ nodes belong to 7 different parts of speech: PUNCT (5; 45% instances), ADJ (1; 9% instances), ADV (1; 9% instances), CCONJ (1; 9% instances), DET (1; 9% instances), NOUN (1; 9% instances), NUM (1; 9% instances)