home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: SCONJ

There are 22 SCONJ lemmas (0%), 25 SCONJ types (0%) and 990 SCONJ tokens (3%). Out of 16 observed tags, the rank of SCONJ is: 9 in number of lemmas, 8 in number of types and 7 in number of tokens.

The 10 most frequent SCONJ lemmas: て, が, の, と, に, ば, 為, ながら, から, ず

The 10 most frequent SCONJ types: て, が, の, と, に, ば, ため, で, ながら, から

The 10 most frequent ambiguous lemmas: が (ADP 596, SCONJ 85), の (ADP 1670, SCONJ 84), と (ADP 549, SCONJ 38), に (ADP 982, SCONJ 27, CCONJ 1), 為 (NOUN 49, SCONJ 19), から (ADP 100, SCONJ 7), ず (AUX 29, SCONJ 6), も (ADP 149, SCONJ 6), 関わる (SCONJ 6, VERB 4), 物 (NOUN 40, SCONJ 3)

The 10 most frequent ambiguous types: が (ADP 596, SCONJ 85), の (ADP 1670, SCONJ 83), と (ADP 549, SCONJ 38), に (ADP 982, AUX 152, SCONJ 27, CCONJ 1), ため (NOUN 48, SCONJ 19), で (ADP 312, AUX 254, SCONJ 19), から (ADP 100, SCONJ 7), し (AUX 489, VERB 52, SCONJ 7), ず (AUX 19, SCONJ 6), も (ADP 149, SCONJ 6)

Morphology

The form / lemma ratio of SCONJ is 1.136364 (the average of all parts of speech is 1.068660).

The 1st highest number of forms (2) was observed with the lemma “て”: て, で.

The 2nd highest number of forms (2) was observed with the lemma “の”: の, ん.

The 3rd highest number of forms (2) was observed with the lemma “言う”: いっ, 言え.

SCONJ occurs with 1 features: Polarity (6; 1% instances)

SCONJ occurs with 1 feature-value pairs: Polarity=Neg

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (984 tokens). Examples: て, が, の, と, に, ば, ため, で, ながら, から

Relations

SCONJ nodes are attached to their parents using 4 different relations: mark (761; 77% instances), fixed (227; 23% instances), compound (1; 0% instances), dep (1; 0% instances)

Parents of SCONJ nodes belong to 10 different parts of speech: VERB (698; 71% instances), ADP (169; 17% instances), SCONJ (43; 4% instances), ADJ (32; 3% instances), NOUN (28; 3% instances), AUX (7; 1% instances), CCONJ (5; 1% instances), PRON (4; 0% instances), ADV (3; 0% instances), PROPN (1; 0% instances)

515 (52%) SCONJ nodes are leaves.

459 (46%) SCONJ nodes have one child.

5 (1%) SCONJ nodes have two children.

11 (1%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 3.

Children of SCONJ nodes are attached using 3 different relations: fixed (500; 100% instances), obl (1; 0% instances), punct (1; 0% instances)

Children of SCONJ nodes belong to 5 different parts of speech: VERB (407; 81% instances), AUX (45; 9% instances), SCONJ (43; 9% instances), ADP (6; 1% instances), PUNCT (1; 0% instances)