home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: CCONJ

There are 32 CCONJ lemmas (0%), 43 CCONJ types (0%) and 8762 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 13 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: en, maar, of, zowel, want, tot, ofwel, doch, noch, respectievelijk

The 10 most frequent CCONJ types: en, maar, of, zowel, want, tot, ofwel, doch, noch, én

The 10 most frequent ambiguous lemmas: en (CCONJ 7250, PROPN 56, X 1), maar (CCONJ 720, ADV 104), of (CCONJ 472, X 74, SCONJ 34, PROPN 22), zowel (CCONJ 78, ADV 1), want (CCONJ 63, X 2), tot (ADP 1037, CCONJ 29), respectievelijk (CCONJ 15, ADJ 6), enzovoorts (CCONJ 9, X 1), niet (ADV 960, CCONJ 9), à (X 10, CCONJ 9, PROPN 2, ADP 1)

The 10 most frequent ambiguous types: en (CCONJ 7160, PROPN 56, DET 1, NUM 1, X 1), maar (CCONJ 616, ADV 102), of (CCONJ 462, X 74, SCONJ 31, PROPN 22), zowel (CCONJ 66, ADV 1), want (CCONJ 59, X 2), tot (ADP 987, CCONJ 29), respectievelijk (CCONJ 13, ADJ 3), niet (ADV 945, CCONJ 8), à (X 10, CCONJ 8, PROPN 2, ADP 1), dus (ADV 94, CCONJ 7)

Morphology

The form / lemma ratio of CCONJ is 1.343750 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (5) was observed with the lemma “en”: eb, een, en, èn, én.

The 2nd highest number of forms (3) was observed with the lemma “enzovoorts”: enz, enz., enzovoorts.

The 3rd highest number of forms (2) was observed with the lemma “etcetera”: etc., etcetera.

CCONJ occurs with 1 features: ExtPos (11; 0% instances)

CCONJ occurs with 2 feature-value pairs: ExtPos=CCONJ, ExtPos=SCONJ

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (8751 tokens). Examples: en, maar, of, zowel, want, tot, ofwel, doch, noch, én

Relations

CCONJ nodes are attached to their parents using 9 different relations: cc (8187; 93% instances), mark (218; 2% instances), flat (177; 2% instances), cc:preconj (100; 1% instances), fixed (76; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: VERB (3064; 35% instances), NOUN (2976; 34% instances), PROPN (1372; 16% instances), ADJ (711; 8% instances), NUM (220; 3% instances), ADV (95; 1% instances), X (95; 1% instances), PRON (85; 1% instances), ADP (62; 1% instances), SYM (47; 1% instances), DET (29; 0% instances), AUX (3; 0% instances), CCONJ (3; 0% instances)

8712 (99%) CCONJ nodes are leaves.

45 (1%) CCONJ nodes have one child.

5 (0%) CCONJ nodes have two children.

The highest child degree of a CCONJ node is 2.

Children of CCONJ nodes are attached using 4 different relations: punct (36; 65% instances), fixed (17; 31% instances), conj (1; 2% instances), parataxis (1; 2% instances)

Children of CCONJ nodes belong to 7 different parts of speech: PUNCT (36; 65% instances), ADV (8; 15% instances), ADJ (3; 5% instances), CCONJ (3; 5% instances), SYM (3; 5% instances), NOUN (1; 2% instances), PRON (1; 2% instances)