This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home en/pos issue tracker

CONJ: coordinating conjunction

The English CONJ corresponds to PTB CC.


Treebank Statistics (UD_English)

There are 27 CONJ lemmas (0%), 27 CONJ types (0%) and 8121 CONJ tokens (3%). Out of 17 observed tags, the rank of CONJ is: 16 in number of lemmas, 16 in number of types and 11 in number of tokens.

The 10 most frequent CONJ lemmas: and, but, or, &, both, either, yet, nor, plus, not

The 10 most frequent CONJ types: and, but, or, &, both, either, yet, nor, plus, not

The 10 most frequent ambiguous lemmas: and (CONJ 6017, X 6, DET 5, ADP 2), but (CONJ 896, ADP 11, SCONJ 2, NOUN 1, ADV 1), or (CONJ 860, ADP 2, INTJ 1), & (CONJ 138, X 4), both (CONJ 58, DET 57, ADV 12), either (CONJ 35, ADV 12, DET 9), yet (ADV 50, CONJ 22), plus (CONJ 20, NOUN 3, ADV 1), not (PART 1942, ADV 188, CONJ 17), neither (CONJ 9, DET 2)

The 10 most frequent ambiguous types: and (CONJ 5839, X 6, DET 5, ADP 2), but (CONJ 741, ADP 10, SCONJ 2, NOUN 1, ADV 1), or (CONJ 840, ADP 2), & (CONJ 138, X 4), both (CONJ 51, DET 45, ADV 12), either (CONJ 32, ADV 12, DET 7), yet (ADV 49, CONJ 17), plus (CONJ 12, NOUN 3, ADV 1), not (PART 956, ADV 158, CONJ 15), neither (CONJ 5, DET 1)

Morphology

The form / lemma ratio of CONJ is 1.000000 (the average of all parts of speech is 1.173735).

The 1st highest number of forms (1) was observed with the lemma “&”: &.

The 2nd highest number of forms (1) was observed with the lemma “’n”: ‘n.

The 3rd highest number of forms (1) was observed with the lemma “+”: +.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 13 different relations: en-dep/cc (7993; 98% instances), en-dep/cc:preconj (98; 1% instances), en-dep/mwe (10; 0% instances), en-dep/conj (5; 0% instances), en-dep/neg (5; 0% instances), en-dep/mark (2; 0% instances), en-dep/root (2; 0% instances), en-dep/advmod (1; 0% instances), en-dep/case (1; 0% instances), en-dep/compound (1; 0% instances), en-dep/nmod (1; 0% instances), en-dep/remnant (1; 0% instances), en-dep/reparandum (1; 0% instances)

Parents of CONJ nodes belong to 18 different parts of speech: VERB (3374; 42% instances), NOUN (2393; 29% instances), ADJ (985; 12% instances), PROPN (871; 11% instances), ADV (165; 2% instances), PRON (122; 2% instances), NUM (86; 1% instances), CONJ (30; 0% instances), DET (24; 0% instances), SYM (20; 0% instances), SCONJ (12; 0% instances), X (11; 0% instances), AUX (7; 0% instances), PART (7; 0% instances), INTJ (6; 0% instances), ADP (4; 0% instances), PUNCT (2; 0% instances), ROOT (2; 0% instances)

8077 (99%) CONJ nodes are leaves.

12 (0%) CONJ nodes have one child.

29 (0%) CONJ nodes have two children.

3 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 4.

Children of CONJ nodes are attached using 10 different relations: en-dep/punct (35; 44% instances), en-dep/cc (29; 36% instances), en-dep/conj (5; 6% instances), en-dep/nmod (4; 5% instances), en-dep/advmod (2; 3% instances), en-dep/advcl (1; 1% instances), en-dep/case (1; 1% instances), en-dep/cc:preconj (1; 1% instances), en-dep/discourse (1; 1% instances), en-dep/neg (1; 1% instances)

Children of CONJ nodes belong to 10 different parts of speech: PUNCT (38; 48% instances), CONJ (30; 38% instances), NOUN (3; 4% instances), ADV (2; 3% instances), SYM (2; 3% instances), ADP (1; 1% instances), INTJ (1; 1% instances), PART (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)


Treebank Statistics (UD_English-ESL)

There are 1 CONJ lemmas (6%), 1 CONJ types (6%) and 3198 CONJ tokens (3%). Out of 17 observed tags, the rank of CONJ is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent CONJ lemmas: _

The 10 most frequent CONJ types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 15635, VERB 15080, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

The 10 most frequent ambiguous types: _ (NOUN 15635, VERB 15080, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

Morphology

The form / lemma ratio of CONJ is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 5 different relations: en-dep/cc (3172; 99% instances), en-dep/cc:preconj (17; 1% instances), en-dep/conj (6; 0% instances), en-dep/advmod (2; 0% instances), en-dep/mwe (1; 0% instances)

Parents of CONJ nodes belong to 12 different parts of speech: VERB (1577; 49% instances), NOUN (905; 28% instances), ADJ (469; 15% instances), PROPN (101; 3% instances), PRON (53; 2% instances), ADV (44; 1% instances), NUM (30; 1% instances), ADP (7; 0% instances), DET (4; 0% instances), SYM (3; 0% instances), X (3; 0% instances), INTJ (2; 0% instances)

3194 (100%) CONJ nodes are leaves.

4 (0%) CONJ nodes have one child.

The highest child degree of a CONJ node is 1.

Children of CONJ nodes are attached using 3 different relations: en-dep/punct (2; 50% instances), en-dep/conj (1; 25% instances), en-dep/dobj (1; 25% instances)

Children of CONJ nodes belong to 2 different parts of speech: NOUN (2; 50% instances), PUNCT (2; 50% instances)


Treebank Statistics (UD_English-LinES)

There are 1 CONJ lemmas (6%), 14 CONJ types (0%) and 2535 CONJ tokens (3%). Out of 17 observed tags, the rank of CONJ is: 5 in number of lemmas, 15 in number of types and 11 in number of tokens.

The 10 most frequent CONJ lemmas: _

The 10 most frequent CONJ types: and, but, or, both, nor, as, either, neither, for, only

The 10 most frequent ambiguous lemmas: _ (NOUN 14939, VERB 11076, PUNCT 10025, ADP 8281, DET 7865, PRON 7793, ADJ 5305, ADV 4610, AUX 3168, PROPN 2792, CONJ 2535, PART 2131, SCONJ 1512, NUM 581, INTJ 159, X 43, SYM 6)

The 10 most frequent ambiguous types: and (CONJ 1841, ADV 6), but (CONJ 202, ADP 15), both (CONJ 13, PRON 6, DET 3), as (ADP 219, SCONJ 176, ADV 73, CONJ 8), either (ADV 8, CONJ 3, PRON 2, DET 2), neither (CONJ 5, ADV 2, PRON 1), for (ADP 513, SCONJ 17, ADV 6, CONJ 1), only (ADV 79, ADJ 16, CONJ 3), except (ADP 5, CONJ 1), let (VERB 31, CONJ 1)

Morphology

The form / lemma ratio of CONJ is 14.000000 (the average of all parts of speech is 597.705882).

The 1st highest number of forms (14) was observed with the lemma “_”: &, and, as, both, but, either, except, for, let, neither, nor, only, or, rather.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 6 different relations: en-dep/cc (2505; 99% instances), en-dep/mwe (12; 0% instances), en-dep/advmod (8; 0% instances), en-dep/mark (8; 0% instances), en-dep/amod (1; 0% instances), en-dep/root (1; 0% instances)

Parents of CONJ nodes belong to 14 different parts of speech: VERB (1227; 48% instances), NOUN (751; 30% instances), ADJ (256; 10% instances), PROPN (121; 5% instances), ADV (68; 3% instances), PRON (43; 2% instances), NUM (31; 1% instances), AUX (19; 1% instances), ADP (9; 0% instances), DET (3; 0% instances), INTJ (2; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances), ROOT (1; 0% instances)

2499 (99%) CONJ nodes are leaves.

16 (1%) CONJ nodes have one child.

20 (1%) CONJ nodes have two children.

The highest child degree of a CONJ node is 2.

Children of CONJ nodes are attached using 6 different relations: en-dep/punct (31; 55% instances), en-dep/mwe (20; 36% instances), en-dep/nmod (2; 4% instances), en-dep/conj (1; 2% instances), en-dep/dobj (1; 2% instances), en-dep/neg (1; 2% instances)

Children of CONJ nodes belong to 6 different parts of speech: PUNCT (31; 55% instances), ADP (10; 18% instances), ADV (10; 18% instances), NOUN (3; 5% instances), PART (1; 2% instances), VERB (1; 2% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]