Treebank Statistics: UD_Romanian-RRT: POS Tags: CCONJ
There are 12 CCONJ lemmas (0%), 17 CCONJ types (0%) and 6930 CCONJ tokens (3%).
Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 14 in number of types and 10 in number of tokens.
The 10 most frequent CCONJ lemmas: și, sau, dar, însă, ci, ori, fie, deci, căci, insă
The 10 most frequent CCONJ types: și, sau, dar, însă, ci, și-, ori, fie, deci, căci
The 10 most frequent ambiguous lemmas: dar (CCONJ 357, NOUN 6), ori (CCONJ 37, NOUN 3)
The 10 most frequent ambiguous types: și (CCONJ 5155, PRON 22), și- (PRON 163, CCONJ 37), ori (NOUN 63, CCONJ 37), fie (AUX 199, CCONJ 30, VERB 10), ce (PRON 640, DET 27, ADV 3, CCONJ 1)
- și
- și-
- ori
- fie
- ce
- PRON 640: N- ați auzit ce -a zis după ce i- ați bușit mutra .
- DET 27: Auzi dumneata ce rușine pe biata Tudorița , după atâția ani de carieră !
- ADV 3: ’ Tatăl meu chiar obișnuia să joace , până ce și- a rupt șoldul mai devreme anul acesta . ‘
- CCONJ 1: Eu nu sunt înșălătoare , precum tu mă ocărăști , ce sunt dreaptă și toate pe dreptate fac .
Morphology
The form / lemma ratio of CCONJ is 1.416667 (the average of all parts of speech is 1.814866).
The 1st highest number of forms (4) was observed with the lemma “și”: Ș-, ș., și, și-.
The 2nd highest number of forms (2) was observed with the lemma “ci”: ce, ci.
The 3rd highest number of forms (2) was observed with the lemma “dar”: da’, dar.
CCONJ occurs with 4 features: Polarity (6929; 100% instances), ExtPos (9; 0% instances), Variant (3; 0% instances), Abbr (1; 0% instances)
CCONJ occurs with 5 feature-value pairs: Abbr=Yes, ExtPos=ADV, ExtPos=CCONJ, Polarity=Pos, Variant=Short
CCONJ occurs with 5 feature combinations.
The most frequent feature combination is Polarity=Pos (6917 tokens).
Examples: și, sau, dar, însă, ci, și-, ori, fie, deci, căci
Relations
CCONJ nodes are attached to their parents using 14 different relations: cc (6220; 90% instances), advmod (391; 6% instances), fixed (187; 3% instances), cc:preconj (67; 1% instances), conj (29; 0% instances), mark (13; 0% instances), compound (11; 0% instances), discourse (3; 0% instances), root (3; 0% instances), iobj (2; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), flat (1; 0% instances), obl (1; 0% instances)
Parents of CCONJ nodes belong to 16 different parts of speech: NOUN (2934; 42% instances), VERB (2407; 35% instances), ADJ (627; 9% instances), ADV (274; 4% instances), PROPN (218; 3% instances), NUM (178; 3% instances), PRON (147; 2% instances), ADP (62; 1% instances), DET (23; 0% instances), CCONJ (19; 0% instances), SCONJ (14; 0% instances), PART (11; 0% instances), AUX (8; 0% instances), (3; 0% instances), X (3; 0% instances), INTJ (2; 0% instances)
6886 (99%) CCONJ nodes are leaves.
13 (0%) CCONJ nodes have one child.
28 (0%) CCONJ nodes have two children.
3 (0%) CCONJ nodes have three or more children.
The highest child degree of a CCONJ node is 4.
Children of CCONJ nodes are attached using 9 different relations: punct (42; 53% instances), conj (18; 23% instances), fixed (11; 14% instances), discourse (3; 4% instances), advmod (1; 1% instances), amod (1; 1% instances), case (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)
Children of CCONJ nodes belong to 7 different parts of speech: PUNCT (42; 53% instances), CCONJ (19; 24% instances), ADV (11; 14% instances), PRON (3; 4% instances), VERB (2; 3% instances), ADP (1; 1% instances), NOUN (1; 1% instances)