home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: SCONJ

There are 11 SCONJ lemmas (0%), 14 SCONJ types (0%) and 1985 SCONJ tokens (1%). Out of 16 observed tags, the rank of SCONJ is: 14 in number of lemmas, 15 in number of types and 14 in number of tokens.

The 10 most frequent SCONJ lemmas: că, dacă, până, încât, deoarece, deși, fără, fiindcă, întrucât, de

The 10 most frequent SCONJ types: că, dacă, până, încât, deoarece, deși, fără, fiindcă, întrucât, c-

The 10 most frequent ambiguous lemmas: până (ADP 171, SCONJ 128), fără (ADP 215, SCONJ 22), de (ADP 9438, SCONJ 9, PRON 1, X 1), (PART 2409, SCONJ 1)

The 10 most frequent ambiguous types: până (ADP 160, SCONJ 111), fără (ADP 196, SCONJ 20), c- (SCONJ 8, ADP 2), de (ADP 9199, SCONJ 6, PRON 1, X 1), de- (ADP 67, SCONJ 1)

Morphology

The form / lemma ratio of SCONJ is 1.272727 (the average of all parts of speech is 1.819791).

The 1st highest number of forms (2) was observed with the lemma “că”: c-, că.

The 2nd highest number of forms (2) was observed with the lemma “dacă”: dac-, dacă.

The 3rd highest number of forms (2) was observed with the lemma “de”: de, de-.

SCONJ occurs with 3 features: Polarity (1985; 100% instances), ExtPos (106; 5% instances), Variant (10; 1% instances)

SCONJ occurs with 5 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ, Polarity=Pos, Variant=Short

SCONJ occurs with 5 feature combinations. The most frequent feature combination is Polarity=Pos (1869 tokens). Examples: că, dacă, încât, deoarece, până, deși, fiindcă, fără, întrucât, de

Relations

SCONJ nodes are attached to their parents using 10 different relations: mark (1710; 86% instances), fixed (160; 8% instances), case (87; 4% instances), advmod (21; 1% instances), advcl (2; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)

Parents of SCONJ nodes belong to 12 different parts of speech: VERB (1463; 74% instances), NOUN (162; 8% instances), ADV (130; 7% instances), ADJ (124; 6% instances), ADP (74; 4% instances), NUM (13; 1% instances), PRON (11; 1% instances), PROPN (3; 0% instances), AUX (2; 0% instances), PART (1; 0% instances), (1; 0% instances), SCONJ (1; 0% instances)

1869 (94%) SCONJ nodes are leaves.

106 (5%) SCONJ nodes have one child.

10 (1%) SCONJ nodes have two children.

The highest child degree of a SCONJ node is 2.

Children of SCONJ nodes are attached using 8 different relations: fixed (114; 90% instances), advmod (5; 4% instances), conj (2; 2% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), discourse (1; 1% instances), punct (1; 1% instances)

Children of SCONJ nodes belong to 8 different parts of speech: ADP (88; 70% instances), CCONJ (14; 11% instances), NOUN (8; 6% instances), ADV (7; 6% instances), PART (4; 3% instances), VERB (3; 2% instances), PUNCT (1; 1% instances), SCONJ (1; 1% instances)