This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home et/pos issue tracker

CONJ: coordinating conjunction

Definition

A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.
The following words are annotated as coordinated conjunctions CONJ in Estonian UD: aga “but”, ega “nor”, ehk “or”, elik “or”, ent “but”, ja “and”, kui “if, when”, kuid “but”, kuni “until”, ning “and”, vaid “but” või “or”. For subordinating conjunctions, see u-dep/sconj.


Treebank Statistics (UD_Estonian)

There are 14 CONJ lemmas (0%), 14 CONJ types (0%) and 8410 CONJ tokens (4%). Out of 15 observed tags, the rank of CONJ is: 14 in number of lemmas, 14 in number of types and 8 in number of tokens.

The 10 most frequent CONJ lemmas: ja, ning, aga, või, kuid, kui, ega, vaid, ehk, ent

The 10 most frequent CONJ types: ja, ning, aga, või, kuid, kui, ega, vaid, ehk, ent

The 10 most frequent ambiguous lemmas: aga (CONJ 697, ADV 277), või (CONJ 626, ADV 42, NOUN 4, AUX 3), kui (SCONJ 1504, CONJ 144, ADV 124), ega (CONJ 139, ADV 58), vaid (ADV 218, CONJ 121), ehk (CONJ 104, ADV 50), kuni (ADV 43, SCONJ 36, CONJ 27, ADP 22)

The 10 most frequent ambiguous types: aga (CONJ 477, ADV 277), või (CONJ 575, ADV 42, AUX 6, VERB 2, NOUN 2), kuid (CONJ 279, NOUN 3), kui (SCONJ 1080, CONJ 144, ADV 99), ega (CONJ 137, ADV 30), vaid (ADV 203, CONJ 118), ehk (CONJ 99, ADV 44), kuni (ADV 42, SCONJ 31, CONJ 27, ADP 18)

Morphology

The form / lemma ratio of CONJ is 1.000000 (the average of all parts of speech is 1.839644).

The 1st highest number of forms (2) was observed with the lemma “aga”: A, aga.

The 2nd highest number of forms (1) was observed with the lemma “CO”: ja.

The 3rd highest number of forms (1) was observed with the lemma “ega”: ega.

CONJ occurs with 1 features: Negative (138; 2% instances)

CONJ occurs with 1 feature-value pairs: Negative=Neg

CONJ occurs with 2 feature combinations. The most frequent feature combination is _ (8272 tokens). Examples: ja, ning, aga, või, kuid, kui, vaid, ehk, ent, kuni

Relations

CONJ nodes are attached to their parents using 5 different relations: cc (8398; 100% instances), cc:preconj (4; 0% instances), mark (3; 0% instances), root (3; 0% instances), foreign (2; 0% instances)

Parents of CONJ nodes belong to 14 different parts of speech: VERB (3489; 41% instances), NOUN (2947; 35% instances), ADJ (903; 11% instances), PROPN (631; 8% instances), ADV (183; 2% instances), PRON (129; 2% instances), NUM (79; 1% instances), ADP (24; 0% instances), INTJ (7; 0% instances), SCONJ (5; 0% instances), SYM (5; 0% instances), AUX (4; 0% instances), ROOT (3; 0% instances), X (1; 0% instances)

8388 (100%) CONJ nodes are leaves.

20 (0%) CONJ nodes have one child.

2 (0%) CONJ nodes have two children.

The highest child degree of a CONJ node is 2.

Children of CONJ nodes are attached using 5 different relations: punct (15; 63% instances), cc:preconj (4; 17% instances), foreign (3; 13% instances), advmod (1; 4% instances), mark (1; 4% instances)

Children of CONJ nodes belong to 5 different parts of speech: PUNCT (15; 63% instances), NOUN (4; 17% instances), PROPN (3; 13% instances), ADV (1; 4% instances), SCONJ (1; 4% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]