home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: CCONJ

There are 12 CCONJ lemmas (0%), 17 CCONJ types (0%) and 6930 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 14 in number of types and 10 in number of tokens.

The 10 most frequent CCONJ lemmas: și, sau, dar, însă, ci, ori, fie, deci, căci, insă

The 10 most frequent CCONJ types: și, sau, dar, însă, ci, și-, ori, fie, deci, căci

The 10 most frequent ambiguous lemmas: dar (CCONJ 357, NOUN 6), ori (CCONJ 37, NOUN 3)

The 10 most frequent ambiguous types: și (CCONJ 5155, PRON 22), și- (PRON 163, CCONJ 37), ori (NOUN 63, CCONJ 37), fie (AUX 198, CCONJ 30, VERB 11), ce (PRON 640, DET 27, ADV 3, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 1.416667 (the average of all parts of speech is 1.814756).

The 1st highest number of forms (4) was observed with the lemma “și”: Ș-, ș., și, și-.

The 2nd highest number of forms (2) was observed with the lemma “ci”: ce, ci.

The 3rd highest number of forms (2) was observed with the lemma “dar”: da’, dar.

CCONJ occurs with 3 features: Polarity (6929; 100% instances), Variant (3; 0% instances), Abbr (1; 0% instances)

CCONJ occurs with 3 feature-value pairs: Abbr=Yes, Polarity=Pos, Variant=Short

CCONJ occurs with 3 feature combinations. The most frequent feature combination is Polarity=Pos (6926 tokens). Examples: și, sau, dar, însă, ci, și-, ori, fie, deci, căci

Relations

CCONJ nodes are attached to their parents using 14 different relations: cc (6215; 90% instances), advmod (387; 6% instances), fixed (195; 3% instances), cc:preconj (66; 1% instances), conj (30; 0% instances), mark (16; 0% instances), compound (9; 0% instances), root (4; 0% instances), discourse (2; 0% instances), iobj (2; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), flat (1; 0% instances), obl (1; 0% instances)

Parents of CCONJ nodes belong to 16 different parts of speech: NOUN (2930; 42% instances), VERB (2404; 35% instances), ADJ (628; 9% instances), ADV (271; 4% instances), PROPN (218; 3% instances), NUM (178; 3% instances), PRON (148; 2% instances), ADP (67; 1% instances), DET (23; 0% instances), CCONJ (21; 0% instances), SCONJ (14; 0% instances), PART (11; 0% instances), AUX (8; 0% instances), (4; 0% instances), X (3; 0% instances), INTJ (2; 0% instances)

6880 (99%) CCONJ nodes are leaves.

18 (0%) CCONJ nodes have one child.

28 (0%) CCONJ nodes have two children.

4 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 4.

Children of CCONJ nodes are attached using 9 different relations: punct (42; 48% instances), conj (18; 21% instances), fixed (18; 21% instances), discourse (4; 5% instances), advmod (1; 1% instances), amod (1; 1% instances), case (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)

Children of CCONJ nodes belong to 8 different parts of speech: PUNCT (42; 48% instances), CCONJ (21; 24% instances), ADV (14; 16% instances), PRON (3; 3% instances), ADP (2; 2% instances), SCONJ (2; 2% instances), VERB (2; 2% instances), NOUN (1; 1% instances)