Treebank Statistics: UD_Shanghainese-ShUD: POS Tags: SCONJ
There are 31 SCONJ lemmas (2%), 31 SCONJ types (2%) and 352 SCONJ tokens (4%).
Out of 15 observed tags, the rank of SCONJ is: 13 in number of lemmas, 13 in number of types and 8 in number of tokens.
The 10 most frequent SCONJ lemmas: 就, 呃, 还, 侪, 也, 啊, 又, 去, 还是, 但是
The 10 most frequent SCONJ types: 就, 呃, 还, 侪, 也, 啊, 又, 去, 还是, 但是
The 10 most frequent ambiguous lemmas: 就 (SCONJ 77, ADP 2), 呃 (PART 148, SCONJ 58, PUNCT 2, NOUN 1), 侪 (SCONJ 34, ADV 2), 也 (SCONJ 25, CCONJ 1), 啊 (PART 79, SCONJ 22, INTJ 2, ADJ 1, AUX 1), 又 (SCONJ 19, CCONJ 3), 去 (VERB 14, SCONJ 11, ADP 1), 还是 (SCONJ 10, AUX 3, PART 2, CCONJ 1), 的 (SCONJ 5, PART 3), 之 (SCONJ 2, AUX 1)
The 10 most frequent ambiguous types: 就 (SCONJ 77, ADP 2), 呃 (PART 148, SCONJ 58, PUNCT 2, NOUN 1), 侪 (SCONJ 34, ADV 2), 也 (SCONJ 25, CCONJ 1), 啊 (PART 79, SCONJ 22, INTJ 2, ADJ 1, AUX 1), 又 (SCONJ 19, CCONJ 3), 去 (VERB 14, SCONJ 11, ADP 1), 还是 (SCONJ 10, AUX 3, PART 2, CCONJ 1), 的 (SCONJ 5, PART 3), 之 (SCONJ 2, AUX 1)
- 就
- 呃
- 侪
- 也
- 啊
- 又
- 去
- 还是
- 的
- 之
Morphology
The form / lemma ratio of SCONJ is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “不过”: 不过.
The 2nd highest number of forms (1) was observed with the lemma “之”: 之.
The 3rd highest number of forms (1) was observed with the lemma “也”: 也.
SCONJ does not occur with any features.
Relations
SCONJ nodes are attached to their parents using 3 different relations: mark (350; 99% instances), case (1; 0% instances), parataxis (1; 0% instances)
Parents of SCONJ nodes belong to 6 different parts of speech: VERB (266; 76% instances), ADJ (67; 19% instances), AUX (9; 3% instances), PRON (5; 1% instances), NOUN (4; 1% instances), ADV (1; 0% instances)
351 (100%) SCONJ nodes are leaves.
0 (0%) SCONJ nodes have one child.
1 (0%) SCONJ nodes have two children.
The highest child degree of a SCONJ node is 2.
Children of SCONJ nodes are attached using 2 different relations: discourse (1; 50% instances), punct (1; 50% instances)
Children of SCONJ nodes belong to 2 different parts of speech: PART (1; 50% instances), PUNCT (1; 50% instances)