home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: CCONJ

There are 12 CCONJ lemmas (0%), 17 CCONJ types (0%) and 6930 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 14 in number of types and 10 in number of tokens.

The 10 most frequent CCONJ lemmas: și, sau, dar, însă, ci, ori, fie, deci, căci, insă

The 10 most frequent CCONJ types: și, sau, dar, însă, ci, și-, ori, fie, deci, căci

The 10 most frequent ambiguous lemmas: dar (CCONJ 357, NOUN 6), ori (CCONJ 37, NOUN 3)

The 10 most frequent ambiguous types: și (CCONJ 5155, PRON 22), și- (PRON 163, CCONJ 37), ori (NOUN 63, CCONJ 37), fie (AUX 199, CCONJ 30, VERB 10), ce (PRON 640, DET 27, ADV 3, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.416667 (the average of all parts of speech is 1.814866).

The 1st highest number of forms (4) was observed with the lemma “și”: Ș-, ș., și, și-.

The 2nd highest number of forms (2) was observed with the lemma “ci”: ce, ci.

The 3rd highest number of forms (2) was observed with the lemma “dar”: da’, dar.

CCONJ occurs with 4 features: Polarity (6929; 100% instances), ExtPos (9; 0% instances), Variant (3; 0% instances), Abbr (1; 0% instances)

CCONJ occurs with 5 feature-value pairs: Abbr=Yes, ExtPos=ADV, ExtPos=CCONJ, Polarity=Pos, Variant=Short

CCONJ occurs with 5 feature combinations. The most frequent feature combination is Polarity=Pos (6917 tokens). Examples: și, sau, dar, însă, ci, și-, ori, fie, deci, căci

Relations

CCONJ nodes are attached to their parents using 14 different relations: cc (6220; 90% instances), advmod (391; 6% instances), fixed (187; 3% instances), cc:preconj (67; 1% instances), conj (29; 0% instances), mark (13; 0% instances), compound (11; 0% instances), discourse (3; 0% instances), root (3; 0% instances), iobj (2; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), flat (1; 0% instances), obl (1; 0% instances)

Parents of CCONJ nodes belong to 16 different parts of speech: NOUN (2934; 42% instances), VERB (2407; 35% instances), ADJ (627; 9% instances), ADV (274; 4% instances), PROPN (218; 3% instances), NUM (178; 3% instances), PRON (147; 2% instances), ADP (62; 1% instances), DET (23; 0% instances), CCONJ (19; 0% instances), SCONJ (14; 0% instances), PART (11; 0% instances), AUX (8; 0% instances), (3; 0% instances), X (3; 0% instances), INTJ (2; 0% instances)

6886 (99%) CCONJ nodes are leaves.

13 (0%) CCONJ nodes have one child.

28 (0%) CCONJ nodes have two children.

3 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 9 different relations: punct (42; 53% instances), conj (18; 23% instances), fixed (11; 14% instances), discourse (3; 4% instances), advmod (1; 1% instances), amod (1; 1% instances), case (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)

Children of CCONJ nodes belong to 7 different parts of speech: PUNCT (42; 53% instances), CCONJ (19; 24% instances), ADV (11; 14% instances), PRON (3; 4% instances), VERB (2; 3% instances), ADP (1; 1% instances), NOUN (1; 1% instances)