This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home sl/pos issue tracker

CONJ: coordinating conjunction

Definition

A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.

Examples

Conversion from JOS

All conjunctions with Type=coordinating become CONJ.


Treebank Statistics (UD_Slovenian)

There are 22 CONJ lemmas (0%), 25 CONJ types (0%) and 5701 CONJ tokens (4%). Out of 16 observed tags, the rank of CONJ is: 12 in number of lemmas, 13 in number of types and 9 in number of tokens.

The 10 most frequent CONJ lemmas: in, pa, ali, saj, ter, vendar, zato, namreč, a, toda

The 10 most frequent CONJ types: in, pa, ali, ter, saj, vendar, zato, namreč, a, toda

The 10 most frequent ambiguous lemmas: in (CONJ 3242, ADV 5, X 1), pa (CONJ 957, X 1), ali (CONJ 337, ADV 77), vendar (CONJ 140, ADV 5), zato (CONJ 119, ADV 64), a (CONJ 96, ADV 2, X 1), tako (ADV 368, CONJ 56), ne (PART 740, CONJ 22), niti (PART 49, CONJ 16), sicer (PART 82, CONJ 5)

The 10 most frequent ambiguous types: in (CONJ 3129, ADV 5, X 1), pa (CONJ 931, X 1), ali (CONJ 324, ADV 45), vendar (CONJ 91, ADV 5), zato (CONJ 67, ADV 60), a (CONJ 65, ADV 1), tako (ADV 296, CONJ 44, DET 8), ne (PART 664, CONJ 20), niti (PART 43, CONJ 15), sicer (PART 60, CONJ 5)

Morphology

The form / lemma ratio of CONJ is 1.136364 (the average of all parts of speech is 1.894262).

The 1st highest number of forms (2) was observed with the lemma “ali”: al, ali.

The 2nd highest number of forms (2) was observed with the lemma “saj”: saj, sej.

The 3rd highest number of forms (2) was observed with the lemma “tako”: tako, takó.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 6 different relations: cc (4588; 80% instances), advmod (1036; 18% instances), cc:preconj (61; 1% instances), mwe (14; 0% instances), conj (1; 0% instances), root (1; 0% instances)

Parents of CONJ nodes belong to 14 different parts of speech: VERB (2650; 46% instances), NOUN (1835; 32% instances), ADJ (806; 14% instances), PROPN (216; 4% instances), ADV (79; 1% instances), PRON (49; 1% instances), NUM (38; 1% instances), CONJ (8; 0% instances), PART (7; 0% instances), DET (5; 0% instances), ADP (3; 0% instances), X (3; 0% instances), AUX (1; 0% instances), ROOT (1; 0% instances)

5644 (99%) CONJ nodes are leaves.

53 (1%) CONJ nodes have one child.

3 (0%) CONJ nodes have two children.

1 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 3.

Children of CONJ nodes are attached using 4 different relations: mwe (53; 85% instances), punct (7; 11% instances), advmod (1; 2% instances), conj (1; 2% instances)

Children of CONJ nodes belong to 5 different parts of speech: SCONJ (25; 40% instances), PART (19; 31% instances), CONJ (8; 13% instances), PUNCT (7; 11% instances), ADV (3; 5% instances)


Treebank Statistics (UD_Slovenian-SST)

There are 23 CONJ lemmas (1%), 23 CONJ types (0%) and 1631 CONJ tokens (6%). Out of 16 observed tags, the rank of CONJ is: 13 in number of lemmas, 14 in number of types and 9 in number of tokens.

The 10 most frequent CONJ lemmas: pa, in, ali, ampak, saj, tako, oziroma, torej, sicer, vendarle

The 10 most frequent CONJ types: pa, in, ali, ampak, saj, tako, oziroma, torej, sicer, vendarle

The 10 most frequent ambiguous lemmas: in (CONJ 414, ADV 1, X 1), ali (CONJ 113, ADV 12), tako (ADV 233, CONJ 51), torej (CONJ 28, ADV 11), sicer (CONJ 15, PART 14), zato (ADV 20, CONJ 6), a (ADV 137, INTJ 16, NOUN 6, CONJ 3, X 1), ne (PART 787, CONJ 2), razen (ADJ 1, CONJ 1, ADP 1)

The 10 most frequent ambiguous types: in (CONJ 414, ADV 1, X 1), ali (CONJ 113, ADV 12), tako (ADV 233, CONJ 51, DET 9, PRON 2), torej (CONJ 28, ADV 11), sicer (CONJ 15, PART 14), zato (ADV 20, CONJ 6), a (ADV 137, INTJ 16, NOUN 6, X 4, CONJ 3), ne (PART 787, CONJ 2), razen (ADP 1, CONJ 1)

Morphology

The form / lemma ratio of CONJ is 1.000000 (the average of all parts of speech is 1.575031).

The 1st highest number of forms (1) was observed with the lemma “a”: a.

The 2nd highest number of forms (1) was observed with the lemma “aber”: aber.

The 3rd highest number of forms (1) was observed with the lemma “aj”: aj.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 10 different relations: cc (1069; 66% instances), advmod (444; 27% instances), mwe (43; 3% instances), reparandum (20; 1% instances), discourse (19; 1% instances), root (19; 1% instances), cc:preconj (13; 1% instances), parataxis (2; 0% instances), case (1; 0% instances), parataxis:restart (1; 0% instances)

Parents of CONJ nodes belong to 15 different parts of speech: VERB (935; 57% instances), NOUN (272; 17% instances), ADJ (134; 8% instances), PRON (68; 4% instances), CONJ (59; 4% instances), ADV (56; 3% instances), PROPN (45; 3% instances), ROOT (19; 1% instances), PART (15; 1% instances), NUM (12; 1% instances), X (7; 0% instances), AUX (5; 0% instances), ADP (2; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)

1497 (92%) CONJ nodes are leaves.

123 (8%) CONJ nodes have one child.

9 (1%) CONJ nodes have two children.

2 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 4.

Children of CONJ nodes are attached using 10 different relations: mwe (90; 60% instances), reparandum (28; 19% instances), discourse (13; 9% instances), discourse:filler (7; 5% instances), punct (5; 3% instances), cc (2; 1% instances), advcl (1; 1% instances), mark (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)

Children of CONJ nodes belong to 8 different parts of speech: CONJ (59; 40% instances), SCONJ (49; 33% instances), X (16; 11% instances), PART (12; 8% instances), INTJ (8; 5% instances), NOUN (2; 1% instances), VERB (2; 1% instances), PRON (1; 1% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]