home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: SCONJ

There are 28 SCONJ lemmas (1%), 30 SCONJ types (1%) and 1135 SCONJ tokens (4%). Out of 16 observed tags, the rank of SCONJ is: 7 in number of lemmas, 8 in number of types and 7 in number of tokens.

The 10 most frequent SCONJ lemmas: て, ない, が, の, と, ぬ, に, ば, ため, で

The 10 most frequent SCONJ types: て, が, の, ない, と, なかっ, に, ず, ば, ため

The 10 most frequent ambiguous lemmas: ない (SCONJ 115, ADJ 38, AUX 32), が (ADP 596, SCONJ 85), の (ADP 1670, SCONJ 83), と (ADP 549, SCONJ 38), に (ADP 982, SCONJ 27, CCONJ 1), ため (NOUN 48, SCONJ 19), で (ADP 312, SCONJ 19), から (ADP 100, SCONJ 7), も (ADP 149, SCONJ 6), かかわる (SCONJ 5, VERB 1)

The 10 most frequent ambiguous types: が (ADP 596, SCONJ 85), の (ADP 1670, SCONJ 83), ない (SCONJ 75, ADJ 20, AUX 15), と (ADP 549, SCONJ 38), なかっ (SCONJ 30, ADJ 5, AUX 4), に (ADP 982, AUX 152, SCONJ 27, CCONJ 1), ため (NOUN 48, SCONJ 19), で (ADP 312, AUX 254, SCONJ 19), から (ADP 100, SCONJ 7), し (AUX 489, VERB 52, SCONJ 7)

Morphology

The form / lemma ratio of SCONJ is 1.071429 (the average of all parts of speech is 1.050009).

The 1st highest number of forms (4) was observed with the lemma “ない”: ない, なかっ, なく, なけれ.

The 2nd highest number of forms (2) was observed with the lemma “ぬ”: ず, ん.

The 3rd highest number of forms (1) was observed with the lemma “いう”: いっ.

SCONJ occurs with 1 features: Polarity (150; 13% instances)

SCONJ occurs with 1 feature-value pairs: Polarity=Neg

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (985 tokens). Examples: て, が, の, と, に, ば, ため, で, ながら, から

Relations

SCONJ nodes are attached to their parents using 4 different relations: mark (880; 78% instances), fixed (253; 22% instances), compound (1; 0% instances), dep (1; 0% instances)

Parents of SCONJ nodes belong to 11 different parts of speech: VERB (815; 72% instances), ADP (175; 15% instances), SCONJ (60; 5% instances), ADJ (32; 3% instances), NOUN (30; 3% instances), CCONJ (6; 1% instances), PART (6; 1% instances), PRON (4; 0% instances), ADV (3; 0% instances), AUX (3; 0% instances), PROPN (1; 0% instances)

653 (58%) SCONJ nodes are leaves.

459 (40%) SCONJ nodes have one child.

5 (0%) SCONJ nodes have two children.

18 (2%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 3.

Children of SCONJ nodes are attached using 3 different relations: fixed (521; 100% instances), obl (1; 0% instances), punct (1; 0% instances)

Children of SCONJ nodes belong to 5 different parts of speech: VERB (414; 79% instances), SCONJ (60; 11% instances), AUX (42; 8% instances), ADP (6; 1% instances), PUNCT (1; 0% instances)