home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-Nonstandard: POS Tags: CCONJ

There are 40 CCONJ lemmas (0%), 67 CCONJ types (0%) and 32957 CCONJ tokens (6%). Out of 16 observed tags, the rank of CCONJ is: 11 in number of lemmas, 12 in number of types and 7 in number of tokens.

The 10 most frequent CCONJ lemmas: și, iar, ci, sau, dar, nici, deci, ori, ce, însă

The 10 most frequent CCONJ types: și, şi, iară, ce, iar, sau, dar, ș-, deci, au

The 10 most frequent ambiguous lemmas: și (CCONJ 24473, ADV 3128, PRON 14, SCONJ 9), iar (CCONJ 3568, ADV 405, SCONJ 1), ci (CCONJ 1587, PRON 6, DET 3, SCONJ 1), sau (CCONJ 1022, ADV 24), dar (CCONJ 754, NOUN 259, ADV 25, SCONJ 2), nici (ADV 603, CCONJ 586), deci (CCONJ 544, ADV 1), ori (CCONJ 141, ADV 55, DET 7, NOUN 1), ce (PRON 4667, DET 385, CCONJ 72, ADV 3, ADP 2), însă (CCONJ 72, ADV 9, NOUN 1, SCONJ 1)

The 10 most frequent ambiguous types: și (CCONJ 13522, ADV 2325, PRON 168, SCONJ 2), şi (CCONJ 2042, ADV 719, PRON 6), iară (CCONJ 794, ADV 184, SCONJ 1), ce (PRON 3876, CCONJ 898, DET 347, ADP 2, ADV 1, SCONJ 1), iar (CCONJ 318, ADV 192), sau (CCONJ 687, ADV 17), dar (CCONJ 80, NOUN 75, ADV 18), ș- (CCONJ 326, PRON 177, ADV 23, VERB 1), au (AUX 8701, CCONJ 191, VERB 167, INTJ 72, DET 15, ADV 7, PRON 2), ş- (CCONJ 64, PRON 59)

Morphology

The form / lemma ratio of CCONJ is 1.675000 (the average of all parts of speech is 2.491875).

The 1st highest number of forms (17) was observed with the lemma “și”: $i, -Și, S, Si-, i, s-, si, Şî, ş-, şi, şi-, Șî, ș, ș-, și, și-, șâ.

The 2nd highest number of forms (6) was observed with the lemma “deci”: Deci-, Decii, Dice, deacii, dece, deci.

The 3rd highest number of forms (6) was observed with the lemma “nici”: Nice-, Nîci, nece, neci, nice, nici.

CCONJ occurs with 2 features: Polarity (32957; 100% instances), Compound (1081; 3% instances)

CCONJ occurs with 3 feature-value pairs: Compound=Yes, Polarity=Neg, Polarity=Pos

CCONJ occurs with 4 feature combinations. The most frequent feature combination is Polarity=Pos (31748 tokens). Examples: și, şi, iară, ce, iar, sau, dar, ș-, au, ş-

Relations

CCONJ nodes are attached to their parents using 17 different relations: cc (32480; 99% instances), advmod (223; 1% instances), cc:preconj (162; 0% instances), mark (41; 0% instances), fixed (18; 0% instances), compound (6; 0% instances), nmod (5; 0% instances), case (3; 0% instances), nsubj (3; 0% instances), obl (3; 0% instances), xcomp (3; 0% instances), discourse (2; 0% instances), iobj (2; 0% instances), obj (2; 0% instances), root (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances)

Parents of CCONJ nodes belong to 16 different parts of speech: VERB (22127; 67% instances), NOUN (7241; 22% instances), PROPN (1065; 3% instances), ADJ (934; 3% instances), PRON (772; 2% instances), ADV (514; 2% instances), NUM (151; 0% instances), AUX (58; 0% instances), DET (35; 0% instances), INTJ (35; 0% instances), ADP (17; 0% instances), CCONJ (2; 0% instances), (2; 0% instances), X (2; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)

32923 (100%) CCONJ nodes are leaves.

28 (0%) CCONJ nodes have one child.

4 (0%) CCONJ nodes have two children.

2 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 13 different relations: punct (16; 36% instances), fixed (9; 20% instances), conj (5; 11% instances), case (3; 7% instances), compound (2; 5% instances), nummod (2; 5% instances), advcl (1; 2% instances), aux (1; 2% instances), cc (1; 2% instances), cop (1; 2% instances), det (1; 2% instances), nmod (1; 2% instances), nsubj (1; 2% instances)

Children of CCONJ nodes belong to 11 different parts of speech: PUNCT (16; 36% instances), ADP (7; 16% instances), NOUN (4; 9% instances), NUM (4; 9% instances), AUX (3; 7% instances), VERB (3; 7% instances), CCONJ (2; 5% instances), PART (2; 5% instances), ADV (1; 2% instances), DET (1; 2% instances), SCONJ (1; 2% instances)