home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Penn: POS Tags: CCONJ

There are 28 CCONJ lemmas (0%), 35 CCONJ types (0%) and 5883 CCONJ tokens (3%). Out of 15 observed tags, the rank of CCONJ is: 9 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent CCONJ lemmas: ve, da, de, ama, ancak, ile, veya, hem, ya, ila

The 10 most frequent CCONJ types: ve, da, de, ama, ancak, ile, veya, hem, ya, ila

The 10 most frequent ambiguous lemmas: de (VERB 640, CCONJ 636, NOUN 16, ADJ 9, ADV 6, PROPN 5), ancak (CCONJ 352, ADV 274), ile (CCONJ 342, ADP 163), hem (CCONJ 94, ADV 18), ya (CCONJ 84, INTJ 2), ne (PRON 85, ADV 49, CCONJ 38, ADJ 29, VERB 28), hatta (ADV 16, CCONJ 15), gerek (VERB 79, NOUN 68, ADJ 48, CCONJ 4), oysa (CCONJ 3, ADV 2)

The 10 most frequent ambiguous types: da (CCONJ 642, X 2, ADV 1), de (CCONJ 635, X 6, NOUN 2, ADV 1, VERB 1), ancak (ADV 82, CCONJ 64), ile (CCONJ 341, ADP 163, NOUN 2), hem (CCONJ 77, ADV 15), ya (CCONJ 76, INTJ 2), ne (PRON 51, ADV 38, CCONJ 28, ADJ 27), hatta (ADV 14, CCONJ 8), yoksa (CCONJ 8, VERB 2), gerek (CCONJ 4, NOUN 4)

Morphology

The form / lemma ratio of CCONJ is 1.250000 (the average of all parts of speech is 2.012465).

The 1st highest number of forms (4) was observed with the lemma “ve”: ,, eğer, ve, vesaire.

The 2nd highest number of forms (3) was observed with the lemma “da”: da, de, için.

The 3rd highest number of forms (3) was observed with the lemma “de”: da, de, karşılık.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 18 different relations: cc (3913; 67% instances), case (667; 11% instances), advmod (619; 11% instances), discourse (462; 8% instances), mark (118; 2% instances), fixed (43; 1% instances), advcl (12; 0% instances), flat (11; 0% instances), compound (8; 0% instances), conj (7; 0% instances), amod (4; 0% instances), ccomp (4; 0% instances), nmod (4; 0% instances), nsubj (3; 0% instances), root (3; 0% instances), obj (2; 0% instances), obl (2; 0% instances), list (1; 0% instances)

Parents of CCONJ nodes belong to 14 different parts of speech: NOUN (2550; 43% instances), VERB (1700; 29% instances), PROPN (567; 10% instances), ADJ (492; 8% instances), ADV (302; 5% instances), NUM (138; 2% instances), PRON (75; 1% instances), DET (29; 0% instances), X (11; 0% instances), AUX (5; 0% instances), ADP (4; 0% instances), CCONJ (4; 0% instances), INTJ (3; 0% instances), (3; 0% instances)

5841 (99%) CCONJ nodes are leaves.

32 (1%) CCONJ nodes have one child.

9 (0%) CCONJ nodes have two children.

1 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 12 different relations: flat (10; 19% instances), punct (10; 19% instances), compound (9; 17% instances), fixed (8; 15% instances), nsubj (6; 11% instances), nmod (3; 6% instances), advcl (2; 4% instances), case (2; 4% instances), appos (1; 2% instances), cc (1; 2% instances), csubj (1; 2% instances), obj (1; 2% instances)

Children of CCONJ nodes belong to 10 different parts of speech: NOUN (15; 28% instances), PUNCT (10; 19% instances), ADV (8; 15% instances), PROPN (7; 13% instances), CCONJ (4; 7% instances), SCONJ (3; 6% instances), VERB (3; 6% instances), INTJ (2; 4% instances), ADP (1; 2% instances), PRON (1; 2% instances)