home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: SCONJ

There are 31 SCONJ lemmas (0%), 44 SCONJ types (0%) and 7995 SCONJ tokens (4%). Out of 16 observed tags, the rank of SCONJ is: 11 in number of lemmas, 11 in number of types and 6 in number of tokens.

The 10 most frequent SCONJ lemmas: て, の, が, と, ば, に, ながら, から, 為, けれど

The 10 most frequent SCONJ types: て, の, が, と, で, ば, に, ながら, から, ため

The 10 most frequent ambiguous lemmas: の (ADP 8882, SCONJ 880, PART 2), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), に (ADP 6429, SCONJ 137, CCONJ 2), から (ADP 985, SCONJ 71), 為 (NOUN 304, SCONJ 63), 物 (NOUN 287, SCONJ 38), も (ADP 1844, SCONJ 33), で (ADP 2601, CCONJ 26, SCONJ 16), 上 (NOUN 105, SCONJ 16)

The 10 most frequent ambiguous types: て (SCONJ 5109, AUX 47), の (ADP 8880, SCONJ 835, PART 2), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), で (ADP 2600, AUX 1641, SCONJ 163, CCONJ 24, VERB 2), に (ADP 6428, AUX 808, SCONJ 137, CCONJ 3), ながら (SCONJ 76, NOUN 3), から (ADP 985, SCONJ 71), ため (NOUN 280, SCONJ 61), し (AUX 2933, VERB 417, SCONJ 54)

Morphology

The form / lemma ratio of SCONJ is 1.419355 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (4) was observed with the lemma “て”: ちゃ, って, て, で.

The 2nd highest number of forms (3) was observed with the lemma “言う”: いえ, いっ, 言え.

The 3rd highest number of forms (3) was observed with the lemma “関わる”: かかわら, 拘ら, 関わら.

SCONJ occurs with 1 features: Polarity (13; 0% instances)

SCONJ occurs with 1 feature-value pairs: Polarity=Neg

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (7982 tokens). Examples: て, の, が, と, で, ば, に, ながら, から, ため

Relations

SCONJ nodes are attached to their parents using 5 different relations: mark (6601; 83% instances), fixed (1363; 17% instances), obl (13; 0% instances), nmod (10; 0% instances), dep (8; 0% instances)

Parents of SCONJ nodes belong to 12 different parts of speech: VERB (5809; 73% instances), ADP (1117; 14% instances), NOUN (448; 6% instances), ADJ (311; 4% instances), SCONJ (183; 2% instances), CCONJ (47; 1% instances), ADV (44; 1% instances), AUX (12; 0% instances), PRON (10; 0% instances), PROPN (9; 0% instances), SYM (4; 0% instances), NUM (1; 0% instances)

4476 (56%) SCONJ nodes are leaves.

3420 (43%) SCONJ nodes have one child.

60 (1%) SCONJ nodes have two children.

39 (0%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 4.

Children of SCONJ nodes are attached using 7 different relations: fixed (3623; 99% instances), advcl (19; 1% instances), punct (9; 0% instances), obl (3; 0% instances), acl (2; 0% instances), nmod (2; 0% instances), advmod (1; 0% instances)

Children of SCONJ nodes belong to 8 different parts of speech: VERB (2848; 78% instances), AUX (582; 16% instances), SCONJ (183; 5% instances), ADP (29; 1% instances), PUNCT (9; 0% instances), NOUN (5; 0% instances), ADJ (2; 0% instances), ADV (1; 0% instances)