This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cs/pos issue tracker

CONJ: coordinating conjunction

Definition

A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.

For subordinating conjunctions, see SCONJ.

Examples

References


Treebank Statistics (UD_Czech)

There are 45 CONJ lemmas (0%), 47 CONJ types (0%) and 56857 CONJ tokens (4%). Out of 17 observed tags, the rank of CONJ is: 13 in number of lemmas, 13 in number of types and 9 in number of tokens.

The 10 most frequent CONJ lemmas: a, i, ale, však, nebo, ani, či, proto, až, ovšem

The 10 most frequent CONJ types: a, i, ale, však, nebo, ani, či, proto, až, ovšem

The 10 most frequent ambiguous lemmas: a (CONJ 32110, NOUN 133, ADJ 15, ADP 9, X 4), i (CONJ 7804, NOUN 15, PROPN 2), proto (CONJ 950, ADV 229), (PART 1384, CONJ 639, SCONJ 139), ovšem (CONJ 626, PART 42), tak (ADV 2354, CONJ 389), jak (ADV 1801, SCONJ 399, CONJ 52, PROPN 5), plus (NOUN 37, CONJ 16), alias (CONJ 6, NOUN 2), as (CONJ 4, NOUN 2, SCONJ 2)

The 10 most frequent ambiguous types: a (CONJ 31068, ADJ 183, NOUN 49, ADP 7), i (CONJ 7339, NOUN 14, PROPN 2), proto (CONJ 655, ADV 229), (PART 1295, CONJ 639, SCONJ 111), ovšem (CONJ 561, PART 42), tak (ADV 2201, CONJ 358), buď (CONJ 112, VERB 13), and (CONJ 45, PROPN 2), jak (ADV 1350, SCONJ 222, CONJ 50, PROPN 5), plus (NOUN 25, CONJ 16)

Morphology

The form / lemma ratio of CONJ is 1.044444 (the average of all parts of speech is 2.195930).

The 1st highest number of forms (2) was observed with the lemma “krát”: krát, kráte.

The 2nd highest number of forms (2) was observed with the lemma “neboť”: neboť, ť.

The 3rd highest number of forms (1) was observed with the lemma “a”: a.

CONJ occurs with 5 features: cs-feat/Abbr (182; 0% instances), cs-feat/Foreign (80; 0% instances), cs-feat/ConjType (50; 0% instances), cs-feat/Style (4; 0% instances), cs-feat/NameType (1; 0% instances)

CONJ occurs with 5 feature-value pairs: Abbr=Yes, ConjType=Oper, Foreign=Foreign, NameType=Com, Style=Arch

CONJ occurs with 7 feature combinations. The most frequent feature combination is _ (56542 tokens). Examples: a, i, ale, však, nebo, ani, či, proto, až, ovšem

Relations

CONJ nodes are attached to their parents using 15 different relations: cs-dep/cc (49290; 87% instances), cs-dep/advmod:emph (6446; 11% instances), cs-dep/mark (452; 1% instances), cs-dep/advmod (449; 1% instances), cs-dep/foreign (51; 0% instances), cs-dep/nmod (49; 0% instances), cs-dep/dep (34; 0% instances), cs-dep/conj (32; 0% instances), cs-dep/root (29; 0% instances), cs-dep/discourse (16; 0% instances), cs-dep/mwe (3; 0% instances), cs-dep/appos (2; 0% instances), cs-dep/dobj (2; 0% instances), cs-dep/advcl (1; 0% instances), cs-dep/nsubj (1; 0% instances)

Parents of CONJ nodes belong to 17 different parts of speech: NOUN (22100; 39% instances), VERB (20074; 35% instances), ADJ (6266; 11% instances), PROPN (4267; 8% instances), ADV (1466; 3% instances), NUM (1238; 2% instances), PRON (1137; 2% instances), DET (128; 0% instances), PART (68; 0% instances), CONJ (34; 0% instances), ROOT (29; 0% instances), SCONJ (12; 0% instances), ADP (11; 0% instances), SYM (10; 0% instances), INTJ (9; 0% instances), AUX (5; 0% instances), PUNCT (3; 0% instances)

56267 (99%) CONJ nodes are leaves.

496 (1%) CONJ nodes have one child.

56 (0%) CONJ nodes have two children.

38 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 7.

Children of CONJ nodes are attached using 19 different relations: cs-dep/mwe (395; 52% instances), cs-dep/punct (73; 10% instances), cs-dep/advmod:emph (70; 9% instances), cs-dep/mark (44; 6% instances), cs-dep/aux (36; 5% instances), cs-dep/cc (29; 4% instances), cs-dep/conj (29; 4% instances), cs-dep/nummod (28; 4% instances), cs-dep/dep (26; 3% instances), cs-dep/foreign (12; 2% instances), cs-dep/nmod (4; 1% instances), cs-dep/advcl (3; 0% instances), cs-dep/neg (3; 0% instances), cs-dep/acl (2; 0% instances), cs-dep/advmod (2; 0% instances), cs-dep/appos (1; 0% instances), cs-dep/ccomp (1; 0% instances), cs-dep/parataxis (1; 0% instances), cs-dep/vocative (1; 0% instances)

Children of CONJ nodes belong to 12 different parts of speech: SCONJ (433; 57% instances), PUNCT (79; 10% instances), ADV (53; 7% instances), AUX (36; 5% instances), CONJ (34; 4% instances), NUM (28; 4% instances), PRON (23; 3% instances), NOUN (21; 3% instances), PART (21; 3% instances), VERB (17; 2% instances), ADJ (11; 1% instances), PROPN (4; 1% instances)


Treebank Statistics (UD_Czech-CAC)

There are 29 CONJ lemmas (0%), 30 CONJ types (0%) and 24205 CONJ tokens (5%). Out of 16 observed tags, the rank of CONJ is: 12 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent CONJ lemmas: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

The 10 most frequent CONJ types: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

The 10 most frequent ambiguous lemmas: a (CONJ 15539, ADP 4), i (CONJ 3420, ADJ 3, NOUN 1), tak (ADV 816, CONJ 217), budit (CONJ 9, VERB 5), proto (SCONJ 567, CONJ 9), (PART 517, SCONJ 36, CONJ 6), ovšem (PART 211, ADV 14, CONJ 5), jak (ADV 751, SCONJ 15, CONJ 2)

The 10 most frequent ambiguous types: a (CONJ 15101, ADP 3), i (CONJ 3266, ADJ 3), tak (ADV 681, CONJ 212), buď (CONJ 85, VERB 1), proto (SCONJ 364, CONJ 7), (PART 502, SCONJ 32, CONJ 6), ovšem (PART 190, ADV 12, CONJ 5), jak (ADV 654, SCONJ 14, CONJ 2), na (ADP 6587, CONJ 1), ni (PRON 36, CONJ 1)

Morphology

The form / lemma ratio of CONJ is 1.034483 (the average of all parts of speech is 2.206260).

The 1st highest number of forms (2) was observed with the lemma “a”: a, na.

The 2nd highest number of forms (2) was observed with the lemma “nebo”: anebo, nebo.

The 3rd highest number of forms (1) was observed with the lemma “ale”: ale.

CONJ occurs with 4 features: cs-feat/Aspect (9; 0% instances), cs-feat/Foreign (3; 0% instances), cs-feat/ConjType (2; 0% instances), cs-feat/NameType (1; 0% instances)

CONJ occurs with 4 feature-value pairs: Aspect=Imp, ConjType=Oper, Foreign=Foreign, NameType=Com

CONJ occurs with 5 feature combinations. The most frequent feature combination is _ (24191 tokens). Examples: a, i, nebo, ale, však, ani, či, tak, neboť, jednak

Relations

CONJ nodes are attached to their parents using 14 different relations: cs-dep/cc (21799; 90% instances), cs-dep/advmod:emph (2147; 9% instances), cs-dep/mark (196; 1% instances), cs-dep/advmod (24; 0% instances), cs-dep/nmod (10; 0% instances), cs-dep/root (10; 0% instances), cs-dep/dep (7; 0% instances), cs-dep/conj (5; 0% instances), cs-dep/case (2; 0% instances), cs-dep/cop (1; 0% instances), cs-dep/discourse (1; 0% instances), cs-dep/mwe (1; 0% instances), cs-dep/nsubj (1; 0% instances), cs-dep/nsubjpass (1; 0% instances)

Parents of CONJ nodes belong to 17 different parts of speech: NOUN (11541; 48% instances), VERB (6815; 28% instances), ADJ (3379; 14% instances), ADV (683; 3% instances), PROPN (675; 3% instances), PRON (508; 2% instances), NUM (268; 1% instances), SYM (166; 1% instances), DET (91; 0% instances), SCONJ (34; 0% instances), PART (11; 0% instances), ROOT (10; 0% instances), AUX (9; 0% instances), CONJ (5; 0% instances), INTJ (5; 0% instances), ADP (3; 0% instances), PUNCT (2; 0% instances)

23988 (99%) CONJ nodes are leaves.

189 (1%) CONJ nodes have one child.

21 (0%) CONJ nodes have two children.

7 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 4.

Children of CONJ nodes are attached using 13 different relations: cs-dep/mwe (172; 68% instances), cs-dep/dep (14; 6% instances), cs-dep/nmod (12; 5% instances), cs-dep/mark (11; 4% instances), cs-dep/punct (11; 4% instances), cs-dep/amod (10; 4% instances), cs-dep/cc (8; 3% instances), cs-dep/advmod:emph (5; 2% instances), cs-dep/aux (5; 2% instances), cs-dep/case (2; 1% instances), cs-dep/advmod (1; 0% instances), cs-dep/conj (1; 0% instances), cs-dep/neg (1; 0% instances)

Children of CONJ nodes belong to 14 different parts of speech: SCONJ (171; 68% instances), NOUN (16; 6% instances), ADJ (12; 5% instances), PUNCT (11; 4% instances), PRON (10; 4% instances), ADV (7; 3% instances), PART (6; 2% instances), AUX (5; 2% instances), CONJ (5; 2% instances), SYM (4; 2% instances), PROPN (2; 1% instances), VERB (2; 1% instances), ADP (1; 0% instances), NUM (1; 0% instances)


Treebank Statistics (UD_Czech-CLTT)

There are 14 CONJ lemmas (1%), 14 CONJ types (0%) and 1875 CONJ tokens (5%). Out of 15 observed tags, the rank of CONJ is: 11 in number of lemmas, 13 in number of types and 6 in number of tokens.

The 10 most frequent CONJ lemmas: a, nebo, i, či, anebo, ani, ale, však, buď, avšak

The 10 most frequent CONJ types: a, nebo, i, či, anebo, ani, ale, však, buď, avšak

The 10 most frequent ambiguous lemmas: (PART 30, SCONJ 7, CONJ 1), tak (ADV 23, CONJ 1)

The 10 most frequent ambiguous types: (PART 30, SCONJ 7, CONJ 1), tak (ADV 23, CONJ 1)

Morphology

The form / lemma ratio of CONJ is 1.000000 (the average of all parts of speech is 1.764161).

The 1st highest number of forms (1) was observed with the lemma “a”: a.

The 2nd highest number of forms (1) was observed with the lemma “ale”: ale.

The 3rd highest number of forms (1) was observed with the lemma “anebo”: anebo.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 5 different relations: cs-dep/cc (1805; 96% instances), cs-dep/advmod:emph (64; 3% instances), cs-dep/nmod (4; 0% instances), cs-dep/dep (1; 0% instances), cs-dep/mark (1; 0% instances)

Parents of CONJ nodes belong to 7 different parts of speech: NOUN (1331; 71% instances), VERB (205; 11% instances), ADJ (191; 10% instances), X (62; 3% instances), PRON (40; 2% instances), NUM (24; 1% instances), ADV (22; 1% instances)

1875 (100%) CONJ nodes are leaves.

The highest child degree of a CONJ node is 0.


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]