home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: CCONJ

There are 27 CCONJ lemmas (0%), 37 CCONJ types (0%) and 8657 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 14 in number of lemmas, 14 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: en, maar, of, want, tot, ofwel, doch, noch, respectievelijk, à

The 10 most frequent CCONJ types: en, maar, of, want, tot, ofwel, doch, noch, én, respectievelijk

The 10 most frequent ambiguous lemmas: en (CCONJ 7250, PROPN 56, X 1), maar (CCONJ 720, ADV 104), of (CCONJ 472, X 74, SCONJ 34, PROPN 22), want (CCONJ 63, X 2), tot (ADP 1037, CCONJ 29), respectievelijk (CCONJ 15, ADJ 6), à (X 10, CCONJ 9, PROPN 2, ADP 1), enzovoorts (CCONJ 8, ADV 1, X 1), & (PROPN 25, SYM 8, X 4, CCONJ 2), dus (ADV 111, CCONJ 1)

The 10 most frequent ambiguous types: en (CCONJ 7160, PROPN 56, DET 1, NUM 1, X 1), maar (CCONJ 616, ADV 102), of (CCONJ 462, X 74, SCONJ 31, PROPN 22), want (CCONJ 59, X 2), tot (ADP 987, CCONJ 29), respectievelijk (CCONJ 13, ADJ 3), à (X 10, CCONJ 8, PROPN 2, ADP 1), enz. (CCONJ 6, X 1), & (PROPN 25, SYM 8, X 4, CCONJ 2), dus (ADV 100, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.370370 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (5) was observed with the lemma “en”: eb, een, en, èn, én.

The 2nd highest number of forms (2) was observed with the lemma “enzovoorts”: enz, enz..

The 3rd highest number of forms (2) was observed with the lemma “etcetera”: etc., etcetera.

CCONJ occurs with 1 features: ExtPos (13; 0% instances)

CCONJ occurs with 2 feature-value pairs: ExtPos=CCONJ, ExtPos=SCONJ

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (8644 tokens). Examples: en, maar, of, want, tot, ofwel, doch, noch, én, respectievelijk

Relations

CCONJ nodes are attached to their parents using 8 different relations: cc (8165; 94% instances), mark (216; 2% instances), flat (169; 2% instances), fixed (85; 1% instances), cc:preconj (19; 0% instances), amod (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: VERB (3050; 35% instances), NOUN (2933; 34% instances), PROPN (1348; 16% instances), ADJ (691; 8% instances), NUM (219; 3% instances), ADV (98; 1% instances), X (91; 1% instances), PRON (85; 1% instances), ADP (61; 1% instances), SYM (47; 1% instances), DET (28; 0% instances), AUX (3; 0% instances), CCONJ (3; 0% instances)

8609 (99%) CCONJ nodes are leaves.

44 (1%) CCONJ nodes have one child.

4 (0%) CCONJ nodes have two children.

The highest child degree of a CCONJ node is 2.

Children of CCONJ nodes are attached using 3 different relations: punct (34; 65% instances), fixed (17; 33% instances), conj (1; 2% instances)

Children of CCONJ nodes belong to 6 different parts of speech: PUNCT (34; 65% instances), ADV (8; 15% instances), ADJ (3; 6% instances), CCONJ (3; 6% instances), SYM (3; 6% instances), PRON (1; 2% instances)