Treebank Statistics: UD_Japanese-BCCWJ: POS Tags: SCONJ
There are 1 SCONJ
lemmas (6%), 1 SCONJ
types (6%) and 56288 SCONJ
tokens (4%).
Out of 17 observed tags, the rank of SCONJ
is: 14 in number of lemmas, 14 in number of types and 6 in number of tokens.
The 10 most frequent SCONJ
lemmas: _
The 10 most frequent SCONJ
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 366690, ADP 251140, PUNCT 146577, VERB 132574, AUX 122189, SCONJ 56288, NUM 38937, PROPN 35938, ADJ 26812, SYM 19199, ADV 18943, PART 14834, PRON 11340, DET 6057, CCONJ 5110, INTJ 915, X 360)
The 10 most frequent ambiguous types: _ (NOUN 366690, ADP 251140, PUNCT 146577, VERB 132574, AUX 122189, SCONJ 56288, NUM 38937, PROPN 35938, ADJ 26812, SYM 19199, ADV 18943, PART 14834, PRON 11340, DET 6057, CCONJ 5110, INTJ 915, X 360)
- _
- NOUN 366690: _ _ _ _ _ _ _ _ _ _ _ _
- ADP 251140: _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 146577: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 132574: _ _ _ _ _ _ _ _ _ _ _ _
- AUX 122189: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 56288: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 38937: _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 35938: _ _ _ _ _ _ _ _ _ _ _
- ADJ 26812: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 19199: _ _ _ _ _ _ _ _ _ _
- ADV 18943: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 14834: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 11340: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 6057: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 5110: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 915: _ _
- X 360: _
Morphology
The form / lemma ratio of SCONJ
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
SCONJ
occurs with 1 features: Polarity (57; 0% instances)
SCONJ
occurs with 1 feature-value pairs: Polarity=Neg
SCONJ
occurs with 2 feature combinations.
The most frequent feature combination is _
(56231 tokens).
Examples: _
Relations
SCONJ
nodes are attached to their parents using 5 different relations: mark (48368; 86% instances), fixed (7808; 14% instances), dep (106; 0% instances), obl (5; 0% instances), nmod (1; 0% instances)
Parents of SCONJ
nodes belong to 16 different parts of speech: VERB (41919; 74% instances), ADP (5970; 11% instances), NOUN (3391; 6% instances), ADJ (2500; 4% instances), SCONJ (1190; 2% instances), ADV (387; 1% instances), AUX (321; 1% instances), CCONJ (305; 1% instances), PRON (182; 0% instances), PROPN (78; 0% instances), X (23; 0% instances), NUM (18; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), SYM (1; 0% instances)
33370 (59%) SCONJ
nodes are leaves.
21899 (39%) SCONJ
nodes have one child.
603 (1%) SCONJ
nodes have two children.
416 (1%) SCONJ
nodes have three or more children.
The highest child degree of a SCONJ
node is 6.
Children of SCONJ
nodes are attached using 8 different relations: fixed (24336; 100% instances), punct (8; 0% instances), advcl (5; 0% instances), nsubj (5; 0% instances), amod (3; 0% instances), advmod (1; 0% instances), det (1; 0% instances), obl (1; 0% instances)
Children of SCONJ
nodes belong to 10 different parts of speech: VERB (15931; 65% instances), AUX (6812; 28% instances), SCONJ (1190; 5% instances), ADP (406; 2% instances), PUNCT (8; 0% instances), NOUN (7; 0% instances), ADJ (3; 0% instances), ADV (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances)