Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: PART
There are 574 PART
lemmas (3%), 574 PART
types (3%) and 9882 PART
tokens (8%).
Out of 16 observed tags, the rank of PART
is: 7 in number of lemmas, 7 in number of types and 5 in number of tokens.
The 10 most frequent PART
lemmas: 的、 人、 之、 大、 者、 市、 区、 会、 军、 省
The 10 most frequent PART
types: 的、 人、 之、 大、 者、 市、 区、 会、 军、 省
The 10 most frequent ambiguous lemmas: 的 (PART 3232, SCONJ 2405), 人 (NOUN 385, PART 240, VERB 1), 之 (PART 186, SCONJ 62, PRON 23), 大 (PART 163, ADJ 26, ADV 21, ADP 5, PROPN 3, NOUN 1), 者 (PART 156, NOUN 13), 市 (PART 148, NOUN 20, PROPN 1), 区 (PART 143, NOUN 6), 会 (AUX 224, PART 137, NOUN 3), 军 (PART 134, NOUN 19), 省 (PART 132, NOUN 5)
The 10 most frequent ambiguous types: 的 (PART 3232, SCONJ 2405), 人 (NOUN 365, PART 240, VERB 1), 之 (PART 186, SCONJ 62, PRON 23), 大 (PART 163, ADJ 26, ADV 21, ADP 5, PROPN 3, NOUN 1), 者 (PART 156, NOUN 13), 市 (PART 148, NOUN 20, PROPN 1), 区 (PART 143, NOUN 6), 会 (AUX 200, PART 137, NOUN 3), 军 (PART 134, NOUN 19), 省 (PART 132, NOUN 5)
- 的
- 人
- 之
- 大
- PART 163: 当年 为 培养 “ 天下 最恶 的 人 ” , 说服 十 大 恶人 饶过 小鱼儿 。
- ADJ 26: 1956 年 , 首次 推行 的 许多 改革 措施 在 卡达尔 当政 期间 虽 依然 保留 , 但 在 外交 上 却 没有 任何 大 的 改变 。
- ADV 21: 大 多数 的 加长 型 礼车 则是 租车 公司 的 财产 。
- ADP 5: 船上 最为 奢华 之 处 是 头等 舱 的 大 楼梯 , 位 于 第一 和 第二 烟囱 之间 。
- PROPN 3: 民国 二 年 ( 1913 年 ) , 山西 省 政府 接管 大 朔 中 学校 , 将 校名 改 为 山西 省 立 第三 中学 。
- NOUN 1: 许多 士兵 拥护 他 的 观点 , 声势 之 大 , 甚至 惊动 了 执政 官 贝尔苏斯 。
- 者
- 市
- 区
- 会
- 军
- 省
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.004660).
The 1st highest number of forms (1) was observed with the lemma “不”: 不.
The 2nd highest number of forms (1) was observed with the lemma “业”: 业.
The 3rd highest number of forms (1) was observed with the lemma “中”: 中.
PART
occurs with 4 features: Case (3285; 33% instances), Number (33; 0% instances), PartType (8; 0% instances), Aspect (2; 0% instances)
PART
occurs with 4 feature-value pairs: Aspect=Perf
, Case=Gen
, Number=Plur
, PartType=Int
PART
occurs with 5 feature combinations.
The most frequent feature combination is _
(6554 tokens).
Examples: 人、 大、 者、 市、 区、 会、 军、 的、 省、 家
Relations
PART
nodes are attached to their parents using 25 different relations: case (4019; 41% instances), nmod (1890; 19% instances), nsubj (1295; 13% instances), obj (1065; 11% instances), conj (526; 5% instances), obl (232; 2% instances), discourse (187; 2% instances), compound (142; 1% instances), root (102; 1% instances), appos (92; 1% instances), parataxis (71; 1% instances), advcl (53; 1% instances), ccomp (33; 0% instances), nsubj:pass (28; 0% instances), advmod (26; 0% instances), compound:ext (25; 0% instances), obl:patient (21; 0% instances), xcomp (16; 0% instances), acl (11; 0% instances), nmod:tmod (11; 0% instances), acl:relcl (10; 0% instances), iobj (10; 0% instances), discourse:sp (8; 0% instances), amod (6; 0% instances), csubj (3; 0% instances)
Parents of PART
nodes belong to 12 different parts of speech: NOUN (4379; 44% instances), VERB (2728; 28% instances), PART (1442; 15% instances), PROPN (658; 7% instances), PRON (259; 3% instances), ADJ (149; 2% instances), (102; 1% instances), NUM (77; 1% instances), DET (63; 1% instances), X (21; 0% instances), ADP (3; 0% instances), ADV (1; 0% instances)
4246 (43%) PART
nodes are leaves.
2455 (25%) PART
nodes have one child.
1547 (16%) PART
nodes have two children.
1634 (17%) PART
nodes have three or more children.
The highest child degree of a PART
node is 29.
Children of PART
nodes are attached using 29 different relations: compound (5522; 45% instances), nmod (2546; 21% instances), punct (979; 8% instances), case (920; 8% instances), conj (470; 4% instances), cc (259; 2% instances), cop (242; 2% instances), amod (212; 2% instances), acl:relcl (199; 2% instances), nsubj (197; 2% instances), appos (196; 2% instances), acl (150; 1% instances), det (102; 1% instances), nummod (79; 1% instances), parataxis (67; 1% instances), advmod (21; 0% instances), mark (13; 0% instances), mark:rel (12; 0% instances), nmod:tmod (10; 0% instances), dislocated (9; 0% instances), obj (7; 0% instances), aux (6; 0% instances), obl (6; 0% instances), advcl (4; 0% instances), csubj (4; 0% instances), ccomp (3; 0% instances), xcomp (3; 0% instances), discourse (2; 0% instances), obl:patient (1; 0% instances)
Children of PART
nodes belong to 15 different parts of speech: NOUN (3706; 30% instances), PROPN (2867; 23% instances), VERB (1572; 13% instances), PART (1442; 12% instances), PUNCT (979; 8% instances), ADP (421; 3% instances), ADJ (347; 3% instances), CCONJ (259; 2% instances), AUX (248; 2% instances), X (118; 1% instances), NUM (96; 1% instances), DET (83; 1% instances), PRON (58; 0% instances), SCONJ (24; 0% instances), ADV (21; 0% instances)