home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: SCONJ

There are 43 SCONJ lemmas (2%), 46 SCONJ types (2%) and 792 SCONJ tokens (4%). Out of 16 observed tags, the rank of SCONJ is: 10 in number of lemmas, 12 in number of types and 9 in number of tokens.

The 10 most frequent SCONJ lemmas: tu, ɗan, yâːn, séː, ɗa, hár, ín, kóː, dón, dòmín

The 10 most frequent SCONJ types: tu, ɗan, yâːn, ɗa, séː, dón, hár, ín, kóː, dùmín

The 10 most frequent ambiguous lemmas: tu (SCONJ 300, VERB 64, ADP 18), ɗan (SCONJ 260, ADV 54, ADP 53, ADJ 1), yâːn (SCONJ 56, PRON 4), séː (ADV 138, SCONJ 20, ADP 10, CCONJ 9), ɗa (ADP 56, ADV 36, SCONJ 18, PART 14, AUX 7), hár (ADP 19, SCONJ 14, ADV 7, CCONJ 5, X 3), kóː (CCONJ 92, PART 13, SCONJ 12, DET 5, ADV 3, ADP 1, X 1), dón (SCONJ 11, ADP 2), dòːmín (SCONJ 8, ADP 1), dùmín (SCONJ 6, ADV 2)

The 10 most frequent ambiguous types: tu (SCONJ 300, VERB 45, ADP 18), ɗan (SCONJ 247, ADP 52, ADV 40, ADJ 1), yâːn (SCONJ 58, PRON 37), ɗa (ADP 55, ADV 49, SCONJ 29, PART 12, AUX 7), séː (ADV 81, SCONJ 16, ADP 9, CCONJ 3), dón (SCONJ 14, ADP 2), hár (ADP 18, SCONJ 14, ADV 7, CCONJ 5, X 3), kóː (CCONJ 92, SCONJ 12, PART 10, DET 5, ADV 3, ADP 1, X 1), dùmín (SCONJ 9, ADV 2), kàmán (ADP 9, SCONJ 6)

Morphology

The form / lemma ratio of SCONJ is 1.069767 (the average of all parts of speech is 1.692524).

The 1st highest number of forms (4) was observed with the lemma “ɗan”: ɗa, ɗam, ɗan, ɗaŋ.

The 2nd highest number of forms (3) was observed with the lemma “dòːmín”: dòːmín, dón, dùmín.

The 3rd highest number of forms (2) was observed with the lemma “domin”: domin, d~.

SCONJ occurs with 3 features: ExtPos (15; 2% instances), Foreign (11; 1% instances), Mood (3; 0% instances)

SCONJ occurs with 3 feature-value pairs: ExtPos=SCONJ, Foreign=Yes, Mood=Irr

SCONJ occurs with 5 feature combinations. The most frequent feature combination is _ (765 tokens). Examples: tu, ɗan, yâːn, ɗa, dón, hár, séː, ín, kóː, dùmín

Relations

SCONJ nodes are attached to their parents using 15 different relations: mark (638; 81% instances), case (45; 6% instances), dep (41; 5% instances), discourse (26; 3% instances), fixed (19; 2% instances), root (6; 1% instances), reparandum (5; 1% instances), acl:relcl (3; 0% instances), ccomp (3; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), conj (1; 0% instances), dislocated (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of SCONJ nodes belong to 14 different parts of speech: VERB (613; 77% instances), NOUN (55; 7% instances), PRON (30; 4% instances), INTJ (20; 3% instances), AUX (16; 2% instances), SCONJ (15; 2% instances), ADP (7; 1% instances), ADV (7; 1% instances), PROPN (7; 1% instances), PART (6; 1% instances), (6; 1% instances), X (5; 1% instances), ADJ (4; 1% instances), DET (1; 0% instances)

760 (96%) SCONJ nodes are leaves.

20 (3%) SCONJ nodes have one child.

5 (1%) SCONJ nodes have two children.

7 (1%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 6.

Children of SCONJ nodes are attached using 13 different relations: fixed (15; 28% instances), punct (13; 24% instances), dep (6; 11% instances), discourse (5; 9% instances), flat:foreign (5; 9% instances), acl:relcl (2; 4% instances), nsubj (2; 4% instances), advmod (1; 2% instances), ccomp (1; 2% instances), conj (1; 2% instances), cop (1; 2% instances), dislocated (1; 2% instances), reparandum (1; 2% instances)

Children of SCONJ nodes belong to 13 different parts of speech: SCONJ (15; 28% instances), PUNCT (13; 24% instances), X (7; 13% instances), PRON (4; 7% instances), PART (3; 6% instances), VERB (3; 6% instances), CCONJ (2; 4% instances), INTJ (2; 4% instances), ADP (1; 2% instances), ADV (1; 2% instances), AUX (1; 2% instances), NOUN (1; 2% instances), PROPN (1; 2% instances)