Treebank Statistics: UD_Romanian-RRT: POS Tags: SCONJ
There are 12 SCONJ lemmas (0%), 15 SCONJ types (0%) and 2200 SCONJ tokens (1%).
Out of 16 observed tags, the rank of SCONJ is: 14 in number of lemmas, 15 in number of types and 14 in number of tokens.
The 10 most frequent SCONJ lemmas: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât
The 10 most frequent SCONJ types: că, dacă, ca, până, încât, deoarece, deși, fără, fiindcă, întrucât
The 10 most frequent ambiguous lemmas: ca (ADP 451, SCONJ 216, ADV 67), până (ADP 171, SCONJ 128), fără (ADP 215, SCONJ 22), de (ADP 9437, SCONJ 8, PRON 1, X 1), să (PART 2409, SCONJ 1)
The 10 most frequent ambiguous types: ca (ADP 450, SCONJ 216, ADV 25), până (ADP 160, SCONJ 111), fără (ADP 196, SCONJ 20), c- (SCONJ 8, ADP 2), de (ADP 9206, SCONJ 4, PRON 1, X 1), de- (ADP 58, SCONJ 2, ADV 1)
- ca
- până
- fără
- c-
- de
- ADP 9206: Textul de dedesubt suna : FRATELE CEL MARE ESTE CU OCHII PE TINE .
- SCONJ 4: Că maica de ar putea și din astea m- ar scăpa .
- PRON 1: Ăl mai mare era Oprică al lui coana Mărita , una de da în cărți și făcea de dragoste la fetele nemăritate .
- X 1: Astfel , în numărul 35 / 1934 , este publicată o fișă biografică Céline , alături de câteva impresii despre „ o carte dantescă ” , Voyage au bout de la nuit .
- de-
Morphology
The form / lemma ratio of SCONJ is 1.250000 (the average of all parts of speech is 1.814866).
The 1st highest number of forms (2) was observed with the lemma “că”: c-, că.
The 2nd highest number of forms (2) was observed with the lemma “dacă”: dac-, dacă.
The 3rd highest number of forms (2) was observed with the lemma “de”: de, de-.
SCONJ occurs with 3 features: Polarity (2200; 100% instances), ExtPos (106; 5% instances), Variant (11; 1% instances)
SCONJ occurs with 5 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ, Polarity=Pos, Variant=Short
SCONJ occurs with 5 feature combinations.
The most frequent feature combination is Polarity=Pos (2083 tokens).
Examples: că, dacă, ca, încât, deoarece, până, deși, fiindcă, fără, întrucât
Relations
SCONJ nodes are attached to their parents using 10 different relations: mark (1882; 86% instances), fixed (203; 9% instances), case (87; 4% instances), advmod (21; 1% instances), advcl (2; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)
Parents of SCONJ nodes belong to 12 different parts of speech: VERB (1621; 74% instances), NOUN (167; 8% instances), ADV (137; 6% instances), ADJ (131; 6% instances), ADP (110; 5% instances), NUM (13; 1% instances), PRON (11; 1% instances), PROPN (4; 0% instances), AUX (2; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances), (1; 0% instances)
2084 (95%) SCONJ nodes are leaves.
106 (5%) SCONJ nodes have one child.
10 (0%) SCONJ nodes have two children.
The highest child degree of a SCONJ node is 2.
Children of SCONJ nodes are attached using 8 different relations: fixed (114; 90% instances), advmod (5; 4% instances), conj (2; 2% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), discourse (1; 1% instances), punct (1; 1% instances)
Children of SCONJ nodes belong to 8 different parts of speech: ADP (88; 70% instances), CCONJ (14; 11% instances), NOUN (8; 6% instances), ADV (6; 5% instances), PART (4; 3% instances), VERB (3; 2% instances), SCONJ (2; 2% instances), PUNCT (1; 1% instances)