home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: POS Tags: SCONJ

There are 1 SCONJ lemmas (5%), 22 SCONJ types (0%) and 326 SCONJ tokens (2%). Out of 16 observed tags, the rank of SCONJ is: 13 in number of lemmas, 13 in number of types and 13 in number of tokens.

The 10 most frequent SCONJ lemmas: _

The 10 most frequent SCONJ types: dass, als, um, wenn, nachdem, weil, ob, indem, wie, bevor

The 10 most frequent ambiguous lemmas: _ (NOUN 4261, PUNCT 2767, DET 2515, VERB 1913, ADP 1715, ADJ 1387, PROPN 1219, PRON 1185, ADV 1139, AUX 950, CCONJ 743, NUM 352, SCONJ 326, PART 144, X 31, SYM 22)

The 10 most frequent ambiguous types: als (CCONJ 83, SCONJ 36), um (SCONJ 37, ADP 27), wenn (SCONJ 19, CCONJ 1), ob (SCONJ 10, CCONJ 2), wie (CCONJ 33, ADV 14, SCONJ 7), während (ADP 13, SCONJ 4), da (SCONJ 5, ADV 3), ohne (ADP 6, SCONJ 5), damit (ADV 9, SCONJ 3), bis (ADP 22, CCONJ 5, ADV 2, SCONJ 2)

Morphology

The form / lemma ratio of SCONJ is 22.000000 (the average of all parts of speech is 307.454545).

The 1st highest number of forms (22) was observed with the lemma “_”: Anstatt, If, Sobald, als, außer, bevor, bis, da, damit, dass, indem, nachdem, ob, obwohl, ohne, sodass, um, weil, wenn, wie, währen, während.

SCONJ does not occur with any features.

Relations

SCONJ nodes are attached to their parents using 4 different relations: mark (311; 95% instances), case (13; 4% instances), acl (1; 0% instances), fixed (1; 0% instances)

Parents of SCONJ nodes belong to 8 different parts of speech: VERB (268; 82% instances), NOUN (29; 9% instances), ADJ (23; 7% instances), X (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances)

319 (98%) SCONJ nodes are leaves.

7 (2%) SCONJ nodes have one child.

The highest child degree of a SCONJ node is 1.

Children of SCONJ nodes are attached using 3 different relations: advmod (4; 57% instances), punct (2; 29% instances), conj (1; 14% instances)

Children of SCONJ nodes belong to 2 different parts of speech: ADV (5; 71% instances), PUNCT (2; 29% instances)