home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: CCONJ

There are 17 CCONJ lemmas (0%), 18 CCONJ types (0%) and 145 CCONJ tokens (1%). Out of 16 observed tags, the rank of CCONJ is: 11 in number of lemmas, 11 in number of types and 15 in number of tokens.

The 10 most frequent CCONJ lemmas: しかし, また, および, さらに, そして, 一方, それ, あるいは, だ, 及び

The 10 most frequent CCONJ types: しかし, また, および, さらに, そして, 一方, それ, あるいは, 及び, かつ

The 10 most frequent ambiguous lemmas: また (CCONJ 35, ADV 5), さらに (CCONJ 13, ADV 10), 一方 (CCONJ 10, NOUN 3), それ (PRON 68, CCONJ 4), だ (AUX 733, CCONJ 2), ところ (NOUN 10, CCONJ 1), に (ADP 982, SCONJ 27, CCONJ 1), よる (VERB 76, CCONJ 1), 従う (CCONJ 1, VERB 1)

The 10 most frequent ambiguous types: また (CCONJ 35, ADV 5), さらに (CCONJ 13, ADV 10), 一方 (CCONJ 10, NOUN 3), それ (PRON 68, CCONJ 4), 及び (CCONJ 2, VERB 1), だ (AUX 73, CCONJ 1), です (AUX 19, CCONJ 1), ところ (NOUN 10, CCONJ 1), に (ADP 982, AUX 152, SCONJ 27, CCONJ 1), よっ (VERB 17, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.058824 (the average of all parts of speech is 1.050009).

The 1st highest number of forms (2) was observed with the lemma “だ”: だ, です.

The 2nd highest number of forms (1) was observed with the lemma “あるいは”: あるいは.

The 3rd highest number of forms (1) was observed with the lemma “および”: および.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 1 different relations: cc (145; 100% instances)

Parents of CCONJ nodes belong to 5 different parts of speech: VERB (87; 60% instances), NOUN (47; 32% instances), ADJ (5; 3% instances), PROPN (5; 3% instances), ADV (1; 1% instances)

54 (37%) CCONJ nodes are leaves.

81 (56%) CCONJ nodes have one child.

8 (6%) CCONJ nodes have two children.

2 (1%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 2 different relations: punct (70; 67% instances), fixed (34; 33% instances)

Children of CCONJ nodes belong to 4 different parts of speech: PUNCT (70; 67% instances), ADP (27; 26% instances), SCONJ (6; 6% instances), VERB (1; 1% instances)