home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: PART

There are 1 PART lemmas (7%), 29 PART types (0%) and 1881 PART tokens (9%). Out of 15 observed tags, the rank of PART is: 9 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: _

The 10 most frequent PART types: 的、 了、 著、 地、 之、 過、 得、 嗎、 人、 者

The 10 most frequent ambiguous lemmas: _ (NOUN 5410, VERB 3467, PUNCT 2902, PART 1881, PROPN 1361, ADP 1288, ADV 1283, NUM 873, PRON 710, ADJ 650, AUX 618, DET 355, X 306, CCONJ 283, SCONJ 28)

The 10 most frequent ambiguous types: 的 (PART 1361, X 1), 地 (PART 22, NOUN 2), 之 (PART 21, PRON 6), 過 (PART 18, AUX 1, VERB 1), 得 (PART 9, VERB 3), 人 (NOUN 91, PART 3), 者 (NOUN 2, PART 2), 區 (NOUN 2, PART 1), 家 (NOUN 9, PART 1), 法 (PROPN 2, PART 1)

Morphology

The form / lemma ratio of PART is 29.000000 (the average of all parts of speech is 388.466667).

The 1st highest number of forms (29) was observed with the lemma “_”: 之, 了, 人, 區, 呢, 嗎, 地, 家, 得, 河, 法, 的, 瞭, 緣, 罪, 者, 肺, 舟, 著, 處, 號, 街, 賽, 過, 配, 鎊, 體, 點, 黨.

PART occurs with 2 features: Case (709; 38% instances), Aspect (398; 21% instances)

PART occurs with 3 feature-value pairs: Aspect=Perf, Aspect=Prog, Case=Gen

PART occurs with 4 feature combinations. The most frequent feature combination is _ (774 tokens). Examples: 的、 了、 地、 得、 之、 嗎、 人、 者、 黨、 區

Relations

PART nodes are attached to their parents using 14 different relations: case (709; 38% instances), mark:relcl (626; 33% instances), aux (398; 21% instances), discourse:sp (87; 5% instances), mark:adv (22; 1% instances), mark:prt (16; 1% instances), obj (8; 0% instances), appos (4; 0% instances), compound (3; 0% instances), conj (3; 0% instances), nsubj (2; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Parents of PART nodes belong to 9 different parts of speech: VERB (933; 50% instances), NOUN (366; 19% instances), ADJ (245; 13% instances), PROPN (139; 7% instances), PRON (108; 6% instances), ADP (63; 3% instances), X (15; 1% instances), DET (6; 0% instances), NUM (6; 0% instances)

1859 (99%) PART nodes are leaves.

11 (1%) PART nodes have one child.

4 (0%) PART nodes have two children.

7 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 13 different relations: dep (11; 25% instances), compound (10; 23% instances), punct (6; 14% instances), nummod (3; 7% instances), case (2; 5% instances), case:loc (2; 5% instances), cc (2; 5% instances), conj (2; 5% instances), nsubj (2; 5% instances), advmod (1; 2% instances), clf (1; 2% instances), cop (1; 2% instances), nmod (1; 2% instances)

Children of PART nodes belong to 10 different parts of speech: NOUN (19; 43% instances), PUNCT (6; 14% instances), ADP (4; 9% instances), PROPN (4; 9% instances), VERB (3; 7% instances), CCONJ (2; 5% instances), NUM (2; 5% instances), X (2; 5% instances), AUX (1; 2% instances), PRON (1; 2% instances)