Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: SCONJ
There are 53 SCONJ lemmas (0%), 53 SCONJ types (0%) and 5202 SCONJ tokens (4%).
Out of 16 observed tags, the rank of SCONJ is: 11 in number of lemmas, 12 in number of types and 8 in number of tokens.
The 10 most frequent SCONJ lemmas: 的、 也、 而、 并、 但、 所、 都、 则、 就、 还
The 10 most frequent SCONJ types: 的、 也、 而、 并、 但、 所、 都、 则、 就、 还
The 10 most frequent ambiguous lemmas: 的 (PART 3233, SCONJ 2404), 也 (SCONJ 325, CCONJ 7), 而 (SCONJ 305, CCONJ 13, ADV 1), 并 (SCONJ 281, ADV 30, CCONJ 16), 所 (SCONJ 153, NOUN 25, PART 25), 都 (SCONJ 151, NOUN 1, PART 1), 则 (SCONJ 112, NOUN 3), 就 (SCONJ 109, ADP 7), 来 (SCONJ 98, VERB 45, ADP 17, ADV 1), 以 (ADP 136, VERB 106, SCONJ 80)
The 10 most frequent ambiguous types: 的 (PART 3233, SCONJ 2404), 也 (SCONJ 325, CCONJ 7), 而 (SCONJ 305, CCONJ 13, ADV 1), 并 (SCONJ 281, ADV 30, CCONJ 16), 所 (SCONJ 153, NOUN 25, PART 25), 都 (SCONJ 151, NOUN 1, PART 1), 则 (SCONJ 112, NOUN 3), 就 (SCONJ 109, ADP 7), 来 (SCONJ 98, VERB 45, ADP 17, ADV 1), 以 (ADP 136, VERB 106, SCONJ 80)
- 的
- 也
- 而
- 并
- 所
- 都
- 则
- 就
- 来
- 以
Morphology
The form / lemma ratio of SCONJ is 1.000000 (the average of all parts of speech is 1.004572).
The 1st highest number of forms (1) was observed with the lemma “不过”: 不过.
The 2nd highest number of forms (1) was observed with the lemma “且”: 且.
The 3rd highest number of forms (1) was observed with the lemma “乃”: 乃.
SCONJ does not occur with any features.
Relations
SCONJ nodes are attached to their parents using 3 different relations: mark (2672; 51% instances), mark:rel (2426; 47% instances), mark:adv (104; 2% instances)
Parents of SCONJ nodes belong to 10 different parts of speech: VERB (4168; 80% instances), ADJ (897; 17% instances), NOUN (86; 2% instances), PART (24; 0% instances), ADV (11; 0% instances), ADP (5; 0% instances), AUX (4; 0% instances), PROPN (4; 0% instances), NUM (2; 0% instances), PRON (1; 0% instances)
5147 (99%) SCONJ nodes are leaves.
55 (1%) SCONJ nodes have one child.
The highest child degree of a SCONJ node is 1.
Children of SCONJ nodes are attached using 1 different relations: punct (55; 100% instances)
Children of SCONJ nodes belong to 1 different parts of speech: PUNCT (55; 100% instances)