home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese: POS Tags: PART

There are 578 PART lemmas (3%), 578 PART types (3%) and 13172 PART tokens (11%). Out of 15 observed tags, the rank of PART is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: 的、 了、 之、 人、 大、 者、 市、 區、 會、 軍

The 10 most frequent PART types: 的、 了、 之、 人、 大、 者、 市、 區、 會、 軍

The 10 most frequent ambiguous lemmas: 的 (PART 5503, X 134), 了 (PART 765, X 43, VERB 2), 之 (PART 247, PRON 23, X 1), 人 (NOUN 385, PART 240, VERB 1), 大 (PART 163, ADJ 31, ADV 21, PROPN 3, NOUN 1), 者 (PART 156, NOUN 13), 市 (PART 148, NOUN 20, PROPN 1), 區 (PART 143, NOUN 6), 會 (AUX 200, PART 137, NOUN 3), 軍 (PART 134, NOUN 19)

The 10 most frequent ambiguous types: 的 (PART 5503, X 134), 了 (PART 765, X 43, VERB 2), 之 (PART 247, PRON 23, X 1), 人 (NOUN 365, PART 240, VERB 1), 大 (PART 163, ADJ 31, ADV 21, PROPN 3, NOUN 1), 者 (PART 156, NOUN 13), 市 (PART 148, NOUN 20, PROPN 1), 區 (PART 143, NOUN 6), 會 (AUX 200, PART 137, NOUN 3), 軍 (PART 134, NOUN 19)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.000266).

The 1st highest number of forms (1) was observed with the lemma “不”: 不.

The 2nd highest number of forms (1) was observed with the lemma “中”: 中.

The 3rd highest number of forms (1) was observed with the lemma “主”: 主.

PART occurs with 3 features: Case (3283; 25% instances), Aspect (956; 7% instances), Number (33; 0% instances)

PART occurs with 4 feature-value pairs: Aspect=Perf, Aspect=Prog, Case=Gen, Number=Plur

PART occurs with 5 feature combinations. The most frequent feature combination is _ (8900 tokens). Examples: 的、 人、 大、 者、 市、 區、 會、 軍、 省、 家

Relations

PART nodes are attached to their parents using 26 different relations: case:dec (3283; 25% instances), mark:relcl (2428; 18% instances), nmod (1459; 11% instances), nsubj (1263; 10% instances), obj (1083; 8% instances), case:aspect (955; 7% instances), case:pref (734; 6% instances), conj (524; 4% instances), det (429; 3% instances), obl (232; 2% instances), case:suff (142; 1% instances), appos (133; 1% instances), mark:advb (103; 1% instances), root (100; 1% instances), dep (85; 1% instances), acl (53; 0% instances), ccomp (34; 0% instances), advmod (27; 0% instances), nsubj:pass (27; 0% instances), mark:comp (25; 0% instances), acl:relcl (11; 0% instances), nmod:tmod (11; 0% instances), iobj (10; 0% instances), xcomp (10; 0% instances), amod (8; 0% instances), csubj (3; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: VERB (5204; 40% instances), NOUN (4169; 32% instances), PART (1460; 11% instances), ADJ (899; 7% instances), PROPN (723; 5% instances), PRON (258; 2% instances), ADP (196; 1% instances), (100; 1% instances), NUM (77; 1% instances), X (57; 0% instances), DET (27; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)

7537 (57%) PART nodes are leaves.

2533 (19%) PART nodes have one child.

1545 (12%) PART nodes have two children.

1557 (12%) PART nodes have three or more children.

The highest child degree of a PART node is 28.

Children of PART nodes are attached using 31 different relations: case:suff (5523; 46% instances), nmod (2248; 19% instances), punct (856; 7% instances), conj (467; 4% instances), case:dec (429; 4% instances), det (414; 3% instances), acl (277; 2% instances), case (272; 2% instances), cc (256; 2% instances), cop (247; 2% instances), amod (206; 2% instances), acl:relcl (200; 2% instances), nsubj (200; 2% instances), appos (159; 1% instances), nummod (70; 1% instances), dep (68; 1% instances), case:pref (65; 1% instances), advmod (27; 0% instances), mark (13; 0% instances), mark:relcl (12; 0% instances), nmod:tmod (12; 0% instances), dislocated (9; 0% instances), obj (8; 0% instances), aux (4; 0% instances), csubj (4; 0% instances), xcomp (4; 0% instances), ccomp (3; 0% instances), case:aspect (2; 0% instances), discourse (2; 0% instances), advcl (1; 0% instances), aux:caus (1; 0% instances)

Children of PART nodes belong to 15 different parts of speech: NOUN (3726; 31% instances), PROPN (2815; 23% instances), VERB (1566; 13% instances), PART (1460; 12% instances), PUNCT (855; 7% instances), ADP (401; 3% instances), ADJ (347; 3% instances), CCONJ (256; 2% instances), AUX (251; 2% instances), X (118; 1% instances), NUM (93; 1% instances), DET (79; 1% instances), PRON (58; 0% instances), ADV (33; 0% instances), SYM (1; 0% instances)