Treebank Statistics: UD_Shanghainese-ShUD: POS Tags: PART
There are 74 PART lemmas (4%), 74 PART types (4%) and 907 PART tokens (11%).
Out of 15 observed tags, the rank of PART is: 5 in number of lemmas, 5 in number of types and 5 in number of tokens.
The 10 most frequent PART lemmas: 了, 伐, 呃, 啊, 吗, 啦, 呀, 呢, 哦, 勒
The 10 most frequent PART types: 了, 伐, 呃, 啊, 吗, 啦, 呀, 呢, 哦, 勒
The 10 most frequent ambiguous lemmas: 了 (PART 213, AUX 76, SCONJ 2, VERB 2), 伐 (PART 179, ADV 83, AUX 3, VERB 3), 呃 (PART 148, SCONJ 58, PUNCT 2, NOUN 1), 啊 (PART 79, SCONJ 22, INTJ 2, ADJ 1, AUX 1), 啦 (PART 32, VERB 1), 呀 (PART 31, ADJ 1), 呢 (PART 25, INTJ 1), 哦 (PART 17, INTJ 11, ADJ 2, PRON 2), 勒 (PART 14, AUX 8, VERB 5, ADV 4, ADP 1), 吧 (PART 11, NOUN 1, PUNCT 1)
The 10 most frequent ambiguous types: 了 (PART 213, AUX 76, SCONJ 2, VERB 2), 伐 (PART 179, ADV 83, AUX 3, VERB 3), 呃 (PART 148, SCONJ 58, PUNCT 2, NOUN 1), 啊 (PART 79, SCONJ 22, INTJ 2, ADJ 1, AUX 1), 啦 (PART 32, VERB 1), 呀 (PART 31, ADJ 1), 呢 (PART 25, INTJ 1), 哦 (PART 17, INTJ 11, ADJ 2, PRON 2), 勒 (PART 14, AUX 8, VERB 5, ADV 4, ADP 1), 吧 (PART 11, NOUN 1, PUNCT 1)
- 了
- 伐
- 呃
- 啊
- 啦
- 呀
- 呢
- 哦
- 勒
- 吧
Morphology
The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “么”: 么.
The 2nd highest number of forms (1) was observed with the lemma “乖”: 乖.
The 3rd highest number of forms (1) was observed with the lemma “了”: 了.
PART does not occur with any features.
Relations
PART nodes are attached to their parents using 14 different relations: discourse (766; 84% instances), case (80; 9% instances), mark (16; 2% instances), nsubj (13; 1% instances), obj (12; 1% instances), compound (5; 1% instances), nmod (4; 0% instances), obl (3; 0% instances), parataxis (2; 0% instances), root (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances), clf (1; 0% instances), flat (1; 0% instances)
Parents of PART nodes belong to 12 different parts of speech: VERB (679; 75% instances), ADJ (97; 11% instances), PRON (61; 7% instances), NOUN (35; 4% instances), AUX (12; 1% instances), ADV (11; 1% instances), DET (3; 0% instances), PROPN (3; 0% instances), ADP (2; 0% instances), (2; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)
861 (95%) PART nodes are leaves.
35 (4%) PART nodes have one child.
7 (1%) PART nodes have two children.
4 (0%) PART nodes have three or more children.
The highest child degree of a PART node is 4.
Children of PART nodes are attached using 10 different relations: compound (32; 51% instances), punct (17; 27% instances), case (5; 8% instances), nsubj (3; 5% instances), acl (1; 2% instances), appos (1; 2% instances), det (1; 2% instances), discourse (1; 2% instances), nmod (1; 2% instances), xcomp (1; 2% instances)
Children of PART nodes belong to 8 different parts of speech: PUNCT (17; 27% instances), NOUN (14; 22% instances), PRON (10; 16% instances), ADJ (7; 11% instances), VERB (7; 11% instances), ADP (5; 8% instances), PROPN (2; 3% instances), INTJ (1; 2% instances)