Treebank Statistics: UD_Javanese-CSUI: POS Tags: CCONJ
There are 9 CCONJ lemmas (0%), 9 CCONJ types (0%) and 306 CCONJ tokens (2%).
Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 13 in number of tokens.
The 10 most frequent CCONJ lemmas: lan, utawa, nanging, sarta, karo, utawi, tur, saha, ugi
The 10 most frequent CCONJ types: lan, utawa, nanging, sarta, karo, utawi, tur, saha, ugi
The 10 most frequent ambiguous lemmas: nanging (CCONJ 33, ADV 11), sarta (CCONJ 10, NOUN 1), karo (ADP 40, SCONJ 11, CCONJ 5), ugi (ADV 7, CCONJ 1)
The 10 most frequent ambiguous types: karo (ADP 40, SCONJ 9, CCONJ 5), ugi (ADV 7, CCONJ 1)
- karo
- ugi
Morphology
The form / lemma ratio of CCONJ is 1.000000 (the average of all parts of speech is 1.145928).
The 1st highest number of forms (1) was observed with the lemma “karo”: karo.
The 2nd highest number of forms (1) was observed with the lemma “lan”: lan.
The 3rd highest number of forms (1) was observed with the lemma “nanging”: nanging.
CCONJ occurs with 1 features: Polite (303; 99% instances)
CCONJ occurs with 2 feature-value pairs: Polite=Form, Polite=Infm
CCONJ occurs with 3 feature combinations.
The most frequent feature combination is Polite=Infm (298 tokens).
Examples: lan, utawa, nanging, sarta, karo
Relations
CCONJ nodes are attached to their parents using 1 different relations: cc (306; 100% instances)
Parents of CCONJ nodes belong to 9 different parts of speech: NOUN (106; 35% instances), VERB (105; 34% instances), ADJ (39; 13% instances), PROPN (29; 9% instances), X (12; 4% instances), NUM (8; 3% instances), PRON (4; 1% instances), ADV (2; 1% instances), SYM (1; 0% instances)
305 (100%) CCONJ nodes are leaves.
1 (0%) CCONJ nodes have one child.
The highest child degree of a CCONJ node is 1.
Children of CCONJ nodes are attached using 1 different relations: punct (1; 100% instances)
Children of CCONJ nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)