home fi/pos edit page issue tracker

CONJ: coordinating conjunction

A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.

For subordinating conjunctions, see SCONJ.

In correlative (paired) coordinating conjuctions such as sekä - että “both - and” and joko - tai “either - or”, both words are annotated CONJ (see also cc:preconj).

Examples

References


Treebank Statistics (UD_Finnish)

There are 35 CONJ lemmas (0%), 41 CONJ types (0%) and 7355 CONJ tokens (4%). Out of 15 observed tags, the rank of CONJ is: 12 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent CONJ lemmas: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

The 10 most frequent CONJ types: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

The 10 most frequent ambiguous lemmas: ja (CONJ 5240, ADV 8), tai (CONJ 646, ADV 1), mutta (CONJ 578, ADV 2, SCONJ 1), eli (CONJ 123, ADV 1), vaan (CONJ 108, ADV 37), että (SCONJ 1297, CONJ 58, INTJ 1), vai (CONJ 52, ADV 2), kuin (SCONJ 428, ADV 49, CONJ 48), niin (ADV 357, CONJ 34, SCONJ 4, INTJ 2), mitä (ADV 14, CONJ 7, SCONJ 1)

The 10 most frequent ambiguous types: ja (CONJ 5154, ADV 8), mutta (CONJ 501, ADV 1), eli (CONJ 113, VERB 6), vaan (CONJ 103, ADV 31), että (SCONJ 1239, CONJ 58, INTJ 1), vai (CONJ 44, ADV 2), kuin (SCONJ 426, ADV 47, CONJ 46), niin (ADV 290, CONJ 34, INTJ 2, SCONJ 2), mut (CONJ 2, PRON 1), mitä (PRON 177, ADV 12, CONJ 4, SCONJ 1)

Morphology

The form / lemma ratio of CONJ is 1.171429 (the average of all parts of speech is 2.036755).

The 1st highest number of forms (3) was observed with the lemma “mutta”: mut, mutt, mutta.

The 2nd highest number of forms (2) was observed with the lemma “joko”: joko, jokö.

The 3rd highest number of forms (2) was observed with the lemma “kuin”: ku, kuin.

CONJ occurs with 3 features: fi-feat/Style (14; 0% instances), fi-feat/Typo (2; 0% instances), fi-feat/Clitic (1; 0% instances)

CONJ occurs with 3 feature-value pairs: Clitic=Ko, Style=Coll, Typo=Yes

CONJ occurs with 4 feature combinations. The most frequent feature combination is _ (7338 tokens). Examples: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

Relations

CONJ nodes are attached to their parents using 7 different relations: fi-dep/cc (7192; 98% instances), fi-dep/cc:preconj (128; 2% instances), fi-dep/advmod (12; 0% instances), fi-dep/mark (11; 0% instances), fi-dep/mwe (8; 0% instances), fi-dep/root (3; 0% instances), fi-dep/compound:nn (1; 0% instances)

Parents of CONJ nodes belong to 13 different parts of speech: NOUN (2993; 41% instances), VERB (2787; 38% instances), ADJ (619; 8% instances), PROPN (601; 8% instances), ADV (149; 2% instances), NUM (88; 1% instances), PRON (79; 1% instances), SYM (13; 0% instances), X (12; 0% instances), ADP (5; 0% instances), CONJ (4; 0% instances), ROOT (3; 0% instances), AUX (2; 0% instances)

7324 (100%) CONJ nodes are leaves.

26 (0%) CONJ nodes have one child.

3 (0%) CONJ nodes have two children.

2 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 5.

Children of CONJ nodes are attached using 5 different relations: fi-dep/punct (19; 48% instances), fi-dep/mwe (16; 40% instances), fi-dep/nmod (3; 8% instances), fi-dep/compound:nn (1; 3% instances), fi-dep/nsubj (1; 3% instances)

Children of CONJ nodes belong to 5 different parts of speech: PUNCT (19; 48% instances), ADV (13; 33% instances), CONJ (4; 10% instances), NOUN (3; 8% instances), PROPN (1; 3% instances)


Treebank Statistics (UD_Finnish-FTB)

There are 22 CONJ lemmas (0%), 25 CONJ types (0%) and 4824 CONJ tokens (3%). Out of 14 observed tags, the rank of CONJ is: 12 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent CONJ lemmas: ja, mutta, tai, vai, vaan, sekä, kuin, eli, että, joko

The 10 most frequent CONJ types: ja, mutta, tai, vai, vaan, sekä, kuin, eli, että, mut

The 10 most frequent ambiguous lemmas: ja (CONJ 3231, ADV 6), mutta (CONJ 605, SCONJ 1), vaan (CONJ 126, ADV 87), kuin (SCONJ 614, CONJ 59), eli (CONJ 55, ADV 12), että (SCONJ 1779, CONJ 38, ADV 3), elikkä (CONJ 4, ADV 4), joskaan (CONJ 4, SCONJ 1), vaikka (SCONJ 207, ADV 72, CONJ 4), joskin (SCONJ 5, CONJ 2)

The 10 most frequent ambiguous types: ja (CONJ 3085, ADV 2), vaan (CONJ 126, ADV 74), kuin (SCONJ 584, CONJ 59), eli (CONJ 55, VERB 5), että (SCONJ 1410, CONJ 38, VERB 2), mut (CONJ 21, PRON 3), elikkä (CONJ 4, ADV 1), joskaan (CONJ 4, SCONJ 1), vaikka (SCONJ 137, ADV 64, CONJ 3), joskin (SCONJ 5, CONJ 2)

Morphology

The form / lemma ratio of CONJ is 1.136364 (the average of all parts of speech is 2.041153).

The 1st highest number of forms (3) was observed with the lemma “mutta”: mut, mutt, mutta.

The 2nd highest number of forms (2) was observed with the lemma “vaikka”: vaikk, vaikka.

The 3rd highest number of forms (1) was observed with the lemma “eli”: eli.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 5 different relations: fi-dep/cc (4756; 99% instances), fi-dep/advmod (23; 0% instances), fi-dep/dep (17; 0% instances), fi-dep/mark:comparator (15; 0% instances), fi-dep/conj (13; 0% instances)

Parents of CONJ nodes belong to 13 different parts of speech: VERB (1994; 41% instances), NOUN (1634; 34% instances), ADJ (522; 11% instances), PROPN (331; 7% instances), ADV (156; 3% instances), PRON (81; 2% instances), NUM (59; 1% instances), CONJ (12; 0% instances), ADP (11; 0% instances), SCONJ (10; 0% instances), DET (7; 0% instances), X (5; 0% instances), INTJ (2; 0% instances)

4704 (98%) CONJ nodes are leaves.

118 (2%) CONJ nodes have one child.

1 (0%) CONJ nodes have two children.

1 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 3.

Children of CONJ nodes are attached using 3 different relations: fi-dep/punct (88; 72% instances), fi-dep/dep (23; 19% instances), fi-dep/conj (12; 10% instances)

Children of CONJ nodes belong to 5 different parts of speech: PUNCT (88; 72% instances), ADV (15; 12% instances), CONJ (12; 10% instances), NOUN (7; 6% instances), NUM (1; 1% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]