home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Basque-BDT: POS Tags: CCONJ

There are 47 CCONJ lemmas (0%), 52 CCONJ types (0%) and 6187 CCONJ tokens (5%). Out of 16 observed tags, the rank of CCONJ is: 9 in number of lemmas, 11 in number of types and 6 in number of tokens.

The 10 most frequent CCONJ lemmas: eta, ere, baina, edo, berriz, arren, ordea, gainera, beraz, bestalde

The 10 most frequent CCONJ types: eta, ere, baina, edo, berriz, arren, ordea, gainera, beraz, bestalde

The 10 most frequent ambiguous lemmas: berriz (CCONJ 100, ADV 28), arren (CCONJ 90, INTJ 1, NOUN 1), gainera (CCONJ 80, ADV 16, ADP 9), zein (DET 46, CCONJ 35, ADV 1), orduan (ADV 29, CCONJ 16), alta (CCONJ 14, NOUN 1), ezik (CCONJ 13, ADV 1), ostera (CCONJ 12, ADV 4), alegia (CCONJ 11, NOUN 4), hots (NOUN 8, CCONJ 7)

The 10 most frequent ambiguous types: baina (CCONJ 444, VERB 1), berriz (CCONJ 100, ADV 27, ADJ 1), arren (CCONJ 90, INTJ 1, NOUN 1), gainera (CCONJ 36, ADP 9, ADV 5, NOUN 2), beraz (CCONJ 29, DET 1), aldiz (CCONJ 39, NOUN 39), zein (CCONJ 35, DET 32), orduan (ADV 21, NOUN 7, CCONJ 6), alta (CCONJ 1, NOUN 1), ezik (CCONJ 13, ADV 1)

Morphology

The form / lemma ratio of CCONJ is 1.106383 (the average of all parts of speech is 2.172787).

The 1st highest number of forms (3) was observed with the lemma “eta”: ea, era, eta.

The 2nd highest number of forms (2) was observed with the lemma “baina”: baina, bainan.

The 3rd highest number of forms (2) was observed with the lemma “bestela”: Bertzela, bestela.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 19 different relations: cc (4620; 75% instances), advmod (1051; 17% instances), mark (148; 2% instances), fixed (132; 2% instances), advcl (82; 1% instances), obl (42; 1% instances), conj (38; 1% instances), dep (26; 0% instances), nmod (11; 0% instances), compound (10; 0% instances), aux (6; 0% instances), discourse (6; 0% instances), punct (5; 0% instances), flat (4; 0% instances), root (2; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Parents of CCONJ nodes belong to 15 different parts of speech: VERB (3657; 59% instances), NOUN (1117; 18% instances), PROPN (492; 8% instances), AUX (300; 5% instances), ADJ (234; 4% instances), ADV (176; 3% instances), NUM (111; 2% instances), CCONJ (37; 1% instances), DET (32; 1% instances), PUNCT (15; 0% instances), PRON (7; 0% instances), ADP (5; 0% instances), (2; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)

5795 (94%) CCONJ nodes are leaves.

202 (3%) CCONJ nodes have one child.

145 (2%) CCONJ nodes have two children.

45 (1%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 23 different relations: nmod (161; 25% instances), nsubj (99; 15% instances), punct (89; 14% instances), obj (66; 10% instances), conj (58; 9% instances), advmod (38; 6% instances), cc (30; 5% instances), advcl (22; 3% instances), compound (20; 3% instances), ccomp (12; 2% instances), iobj (7; 1% instances), fixed (6; 1% instances), xcomp (6; 1% instances), amod (5; 1% instances), dep (5; 1% instances), det (4; 1% instances), flat (4; 1% instances), discourse (3; 0% instances), csubj (2; 0% instances), aux (1; 0% instances), cop (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances)

Children of CCONJ nodes belong to 14 different parts of speech: NOUN (178; 28% instances), PUNCT (89; 14% instances), VERB (87; 14% instances), PROPN (73; 11% instances), PART (53; 8% instances), ADV (46; 7% instances), CCONJ (37; 6% instances), NUM (33; 5% instances), ADJ (19; 3% instances), DET (19; 3% instances), X (3; 0% instances), PRON (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)