home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: SCONJ

There are 12 SCONJ lemmas (0%), 15 SCONJ types (0%) and 2201 SCONJ tokens (1%). Out of 16 observed tags, the rank of SCONJ is: 14 in number of lemmas, 15 in number of types and 14 in number of tokens.

The 10 most frequent SCONJ lemmas: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât

The 10 most frequent SCONJ types: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât

The 10 most frequent ambiguous lemmas: ca (ADP 451, SCONJ 216, ADV 67), până (ADP 170, SCONJ 129), fără (ADP 215, SCONJ 22), de (ADP 9438, SCONJ 8, X 1), (PART 2409, SCONJ 1)

The 10 most frequent ambiguous types: ca (ADP 450, SCONJ 216, ADV 25), până (ADP 160, SCONJ 111), fără (ADP 196, SCONJ 20), c- (SCONJ 8, ADP 2), de (ADP 9207, SCONJ 4, X 1), de- (ADP 58, SCONJ 2, ADV 1)

Morphology

The form / lemma ratio of SCONJ is 1.250000 (the average of all parts of speech is 1.814756).

The 1st highest number of forms (2) was observed with the lemma “că”: c-, că.

The 2nd highest number of forms (2) was observed with the lemma “dacă”: dac-, dacă.

The 3rd highest number of forms (2) was observed with the lemma “de”: de, de-.

SCONJ occurs with 2 features: Polarity (2201; 100% instances), Variant (11; 0% instances)

SCONJ occurs with 2 feature-value pairs: Polarity=Pos, Variant=Short

SCONJ occurs with 2 feature combinations. The most frequent feature combination is Polarity=Pos (2190 tokens). Examples: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât

Relations

SCONJ nodes are attached to their parents using 10 different relations: mark (1881; 85% instances), fixed (207; 9% instances), case (83; 4% instances), advmod (23; 1% instances), advcl (2; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)

Parents of SCONJ nodes belong to 12 different parts of speech: VERB (1623; 74% instances), NOUN (166; 8% instances), ADV (136; 6% instances), ADJ (130; 6% instances), ADP (112; 5% instances), NUM (13; 1% instances), PRON (11; 0% instances), PROPN (4; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances), (1; 0% instances)

2081 (95%) SCONJ nodes are leaves.

110 (5%) SCONJ nodes have one child.

10 (0%) SCONJ nodes have two children.

The highest child degree of a SCONJ node is 2.

Children of SCONJ nodes are attached using 8 different relations: fixed (118; 91% instances), advmod (5; 4% instances), conj (2; 2% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), discourse (1; 1% instances), punct (1; 1% instances)

Children of SCONJ nodes belong to 8 different parts of speech: ADP (89; 68% instances), CCONJ (14; 11% instances), ADV (9; 7% instances), NOUN (8; 6% instances), PART (4; 3% instances), VERB (3; 2% instances), SCONJ (2; 2% instances), PUNCT (1; 1% instances)