Treebank Statistics: UD_Japanese-GSD: POS Tags: SCONJ
There are 31 SCONJ
lemmas (0%), 44 SCONJ
types (0%) and 7995 SCONJ
tokens (4%).
Out of 16 observed tags, the rank of SCONJ
is: 11 in number of lemmas, 11 in number of types and 6 in number of tokens.
The 10 most frequent SCONJ
lemmas: て, の, が, と, ば, に, ながら, から, 為, けれど
The 10 most frequent SCONJ
types: て, の, が, と, で, ば, に, ながら, から, ため
The 10 most frequent ambiguous lemmas: の (ADP 8882, SCONJ 880, PART 2), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), に (ADP 6429, SCONJ 137, CCONJ 2), から (ADP 985, SCONJ 71), 為 (NOUN 304, SCONJ 63), 物 (NOUN 287, SCONJ 38), も (ADP 1844, SCONJ 33), で (ADP 2601, CCONJ 26, SCONJ 16), 上 (NOUN 105, SCONJ 16)
The 10 most frequent ambiguous types: て (SCONJ 5109, AUX 47), の (ADP 8880, SCONJ 835, PART 2), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), で (ADP 2600, AUX 1641, SCONJ 163, CCONJ 24, VERB 2), に (ADP 6428, AUX 808, SCONJ 137, CCONJ 3), ながら (SCONJ 76, NOUN 3), から (ADP 985, SCONJ 71), ため (NOUN 280, SCONJ 61), し (AUX 2933, VERB 417, SCONJ 54)
- て
- の
- が
- と
- で
- に
- ながら
- から
- ため
- し
Morphology
The form / lemma ratio of SCONJ
is 1.419355 (the average of all parts of speech is 1.115220).
The 1st highest number of forms (4) was observed with the lemma “て”: ちゃ, って, て, で.
The 2nd highest number of forms (3) was observed with the lemma “言う”: いえ, いっ, 言え.
The 3rd highest number of forms (3) was observed with the lemma “関わる”: かかわら, 拘ら, 関わら.
SCONJ
occurs with 1 features: Polarity (13; 0% instances)
SCONJ
occurs with 1 feature-value pairs: Polarity=Neg
SCONJ
occurs with 2 feature combinations.
The most frequent feature combination is _
(7982 tokens).
Examples: て, の, が, と, で, ば, に, ながら, から, ため
Relations
SCONJ
nodes are attached to their parents using 5 different relations: mark (6601; 83% instances), fixed (1369; 17% instances), obl (13; 0% instances), nmod (10; 0% instances), dep (2; 0% instances)
Parents of SCONJ
nodes belong to 12 different parts of speech: VERB (5799; 73% instances), ADP (1121; 14% instances), NOUN (449; 6% instances), ADJ (309; 4% instances), SCONJ (187; 2% instances), CCONJ (47; 1% instances), ADV (44; 1% instances), AUX (14; 0% instances), PRON (11; 0% instances), PROPN (9; 0% instances), SYM (4; 0% instances), NUM (1; 0% instances)
4440 (56%) SCONJ
nodes are leaves.
3458 (43%) SCONJ
nodes have one child.
58 (1%) SCONJ
nodes have two children.
39 (0%) SCONJ
nodes have three or more children.
The highest child degree of a SCONJ
node is 4.
Children of SCONJ
nodes are attached using 5 different relations: fixed (3663; 99% instances), advcl (18; 0% instances), punct (8; 0% instances), acl (2; 0% instances), nmod (2; 0% instances)
Children of SCONJ
nodes belong to 7 different parts of speech: VERB (2883; 78% instances), AUX (582; 16% instances), SCONJ (187; 5% instances), ADP (29; 1% instances), PUNCT (8; 0% instances), ADJ (2; 0% instances), NOUN (2; 0% instances)