Treebank Statistics: UD_Chinese-Beginner: POS Tags: PART
There are 26 PART
lemmas (1%), 26 PART
types (1%) and 1427 PART
tokens (7%).
Out of 15 observed tags, the rank of PART
is: 9 in number of lemmas, 9 in number of types and 6 in number of tokens.
The 10 most frequent PART
lemmas: 的、 了、 吗、 吧、 得、 呢、 地、 啊、 什么的、 之
The 10 most frequent PART
types: 的、 了、 吗、 吧、 得、 呢、 地、 啊、 什么的、 之
The 10 most frequent ambiguous lemmas: 了 (PART 421, AUX 156, VERB 1), 得 (PART 61, AUX 12, ADV 2, VERB 1), 地 (PART 22, NOUN 1), 之 (PART 7, NOUN 2), 等 (VERB 10, PART 6, ADP 1, SCONJ 1), 之后 (PART 3, ADV 1), 等等 (PART 3, VERB 1), 之前 (ADV 3, PART 2), 之类 (NOUN 8, PART 1), 它 (NOUN 3, PRON 2, PART 1)
The 10 most frequent ambiguous types: 了 (PART 421, AUX 156, VERB 1), 得 (PART 61, AUX 12, ADV 2, VERB 1), 地 (PART 22, NOUN 1), 之 (PART 7, NOUN 2), 等 (VERB 10, PART 6, ADP 1, SCONJ 1), 之后 (PART 3, ADV 1), 等等 (PART 3, VERB 1), 之前 (ADV 3, PART 2), 之类 (NOUN 8, PART 1), 它 (NOUN 3, PRON 2, PART 1)
- 了
- 得
- 地
- 之
- 等
- 之后
- 等等
- 之前
- 之类
- 它
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “之”: 之.
The 2nd highest number of forms (1) was observed with the lemma “之前”: 之前.
The 3rd highest number of forms (1) was observed with the lemma “之后”: 之后.
PART
occurs with 3 features: Case (460; 32% instances), Number (2; 0% instances), Aspect (1; 0% instances)
PART
occurs with 3 feature-value pairs: Aspect=Perf
, Case=Gen
, Number=Plur
PART
occurs with 4 feature combinations.
The most frequent feature combination is _
(964 tokens).
Examples: 了、 吗、 吧、 的、 得、 呢、 地、 啊、 之、 等
Relations
PART
nodes are attached to their parents using 10 different relations: discourse (449; 31% instances), case (415; 29% instances), discourse:sp (346; 24% instances), mark (138; 10% instances), advmod (32; 2% instances), dep (19; 1% instances), conj (18; 1% instances), obl:tmod (5; 0% instances), cc (4; 0% instances), flat (1; 0% instances)
Parents of PART
nodes belong to 10 different parts of speech: VERB (758; 53% instances), ADJ (275; 19% instances), NOUN (183; 13% instances), PRON (137; 10% instances), PROPN (26; 2% instances), ADV (19; 1% instances), DET (16; 1% instances), AUX (10; 1% instances), NUM (2; 0% instances), PART (1; 0% instances)
1414 (99%) PART
nodes are leaves.
8 (1%) PART
nodes have one child.
3 (0%) PART
nodes have two children.
2 (0%) PART
nodes have three or more children.
The highest child degree of a PART
node is 5.
Children of PART
nodes are attached using 8 different relations: advcl (7; 32% instances), punct (4; 18% instances), obl:arg (3; 14% instances), cop (2; 9% instances), csubj (2; 9% instances), obl (2; 9% instances), clf (1; 5% instances), conj (1; 5% instances)
Children of PART
nodes belong to 7 different parts of speech: VERB (7; 32% instances), NOUN (5; 23% instances), PUNCT (4; 18% instances), ADJ (2; 9% instances), AUX (2; 9% instances), PART (1; 5% instances), PRON (1; 5% instances)