home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: SCONJ

There are 12 SCONJ lemmas (0%), 15 SCONJ types (0%) and 2200 SCONJ tokens (1%). Out of 16 observed tags, the rank of SCONJ is: 14 in number of lemmas, 15 in number of types and 14 in number of tokens.

The 10 most frequent SCONJ lemmas: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât

The 10 most frequent SCONJ types: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât

The 10 most frequent ambiguous lemmas: ca (ADP 451, SCONJ 216, ADV 67), până (ADP 171, SCONJ 128), fără (ADP 215, SCONJ 22), de (ADP 9437, SCONJ 8, PRON 1, X 1), (PART 2409, SCONJ 1)

The 10 most frequent ambiguous types: ca (ADP 450, SCONJ 216, ADV 25), până (ADP 160, SCONJ 111), fără (ADP 196, SCONJ 20), c- (SCONJ 8, ADP 2), de (ADP 9206, SCONJ 4, PRON 1, X 1), de- (ADP 58, SCONJ 2, ADV 1)

Morphology

The form / lemma ratio of SCONJ is 1.250000 (the average of all parts of speech is 1.814866).

The 1st highest number of forms (2) was observed with the lemma “că”: c-, că.

The 2nd highest number of forms (2) was observed with the lemma “dacă”: dac-, dacă.

The 3rd highest number of forms (2) was observed with the lemma “de”: de, de-.

SCONJ occurs with 3 features: Polarity (2200; 100% instances), ExtPos (106; 5% instances), Variant (11; 1% instances)

SCONJ occurs with 5 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ, Polarity=Pos, Variant=Short

SCONJ occurs with 5 feature combinations. The most frequent feature combination is Polarity=Pos (2083 tokens). Examples: că, dacă, ca, încât, deoarece, până, deși, fiindcă, fără, întrucât

Relations

SCONJ nodes are attached to their parents using 10 different relations: mark (1882; 86% instances), fixed (203; 9% instances), case (87; 4% instances), advmod (21; 1% instances), advcl (2; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)

Parents of SCONJ nodes belong to 12 different parts of speech: VERB (1621; 74% instances), NOUN (167; 8% instances), ADV (137; 6% instances), ADJ (131; 6% instances), ADP (110; 5% instances), NUM (13; 1% instances), PRON (11; 1% instances), PROPN (4; 0% instances), AUX (2; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances), (1; 0% instances)

2084 (95%) SCONJ nodes are leaves.

106 (5%) SCONJ nodes have one child.

10 (0%) SCONJ nodes have two children.

The highest child degree of a SCONJ node is 2.

Children of SCONJ nodes are attached using 8 different relations: fixed (114; 90% instances), advmod (5; 4% instances), conj (2; 2% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), discourse (1; 1% instances), punct (1; 1% instances)

Children of SCONJ nodes belong to 8 different parts of speech: ADP (88; 70% instances), CCONJ (14; 11% instances), NOUN (8; 6% instances), ADV (6; 5% instances), PART (4; 3% instances), VERB (3; 2% instances), SCONJ (2; 2% instances), PUNCT (1; 1% instances)