home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: CCONJ

There are 10 CCONJ lemmas (0%), 11 CCONJ types (0%) and 5034 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 14 in number of types and 8 in number of tokens.

The 10 most frequent CCONJ lemmas: și, sau, dar, însă, fie, deci, ci, ori, încă, or

The 10 most frequent CCONJ types: și, sau, dar, însă, fie, deci, ci, ori, si, încă

The 10 most frequent ambiguous lemmas: ci (CCONJ 23, NOUN 3), ori (CCONJ 11, NOUN 2), încă (ADV 48, CCONJ 2), or (CCONJ 1, X 1)

The 10 most frequent ambiguous types: fie (AUX 70, CCONJ 45), ori (NOUN 85, CCONJ 11), încă (ADV 46, CCONJ 2), or (CCONJ 1, X 1)

Morphology

The form / lemma ratio of CCONJ is 1.100000 (the average of all parts of speech is 1.666462).

The 1st highest number of forms (2) was observed with the lemma “și”: si, și.

The 2nd highest number of forms (1) was observed with the lemma “ci”: ci.

The 3rd highest number of forms (1) was observed with the lemma “dar”: dar.

CCONJ occurs with 1 features: Polarity (5034; 100% instances)

CCONJ occurs with 1 feature-value pairs: Polarity=Pos

CCONJ occurs with 1 feature combinations. The most frequent feature combination is Polarity=Pos (5034 tokens). Examples: și, sau, dar, însă, fie, deci, ci, ori, si, încă

Relations

CCONJ nodes are attached to their parents using 8 different relations: cc (4453; 88% instances), fixed (274; 5% instances), advmod (254; 5% instances), conj (30; 1% instances), cc:preconj (18; 0% instances), mark (3; 0% instances), compound (1; 0% instances), nmod (1; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: NOUN (2974; 59% instances), VERB (716; 14% instances), ADJ (611; 12% instances), ADV (348; 7% instances), NUM (146; 3% instances), PRON (73; 1% instances), X (54; 1% instances), CCONJ (31; 1% instances), ADP (29; 1% instances), PROPN (28; 1% instances), DET (14; 0% instances), PART (6; 0% instances), AUX (4; 0% instances)

4985 (99%) CCONJ nodes are leaves.

14 (0%) CCONJ nodes have one child.

34 (1%) CCONJ nodes have two children.

1 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 3.

Children of CCONJ nodes are attached using 5 different relations: punct (48; 56% instances), conj (30; 35% instances), fixed (5; 6% instances), case (1; 1% instances), nummod (1; 1% instances)

Children of CCONJ nodes belong to 5 different parts of speech: PUNCT (48; 56% instances), CCONJ (31; 36% instances), ADV (3; 4% instances), ADP (2; 2% instances), NUM (1; 1% instances)