Treebank Statistics: UD_Shanghainese-ShUD: POS Tags: NOUN
There are 390 NOUN lemmas (22%), 390 NOUN types (22%) and 929 NOUN tokens (11%).
Out of 15 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent NOUN lemmas: 电话, 今朝, 辰光, 现在, 明朝, 拧, 老婆, 事体, 个, 闲话
The 10 most frequent NOUN types: 电话, 今朝, 辰光, 现在, 明朝, 拧, 老婆, 事体, 个, 闲话
The 10 most frequent ambiguous lemmas: 明朝 (NOUN 27, PROPN 3), 拧 (NOUN 22, PART 2), 事体 (NOUN 20, PART 1, VERB 1), 闲话 (NOUN 15, PART 4, SCONJ 1), 意思 (NOUN 13, VERB 2), 只 (NOUN 12, ADV 2), 宝贝 (NOUN 7, PROPN 5, VERB 1), 心 (NOUN 7, PART 3), 一道 (NOUN 6, ADV 3), 前头 (NOUN 6, ADV 2, ADP 1)
The 10 most frequent ambiguous types: 明朝 (NOUN 27, PROPN 3), 拧 (NOUN 22, PART 2), 事体 (NOUN 20, PART 1, VERB 1), 闲话 (NOUN 15, PART 4, SCONJ 1), 意思 (NOUN 13, VERB 2), 只 (NOUN 12, ADV 2), 宝贝 (NOUN 7, PROPN 5, VERB 1), 心 (NOUN 7, PART 3), 一道 (NOUN 6, ADV 3), 前头 (NOUN 6, ADV 2, ADP 1)
- 明朝
- 拧
- 事体
- 闲话
- 意思
- 只
- 宝贝
- 心
- 一道
- 前头
Morphology
The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “一切”: 一切.
The 2nd highest number of forms (1) was observed with the lemma “一声”: 一声.
The 3rd highest number of forms (1) was observed with the lemma “一年”: 一年.
NOUN does not occur with any features.
Relations
NOUN nodes are attached to their parents using 21 different relations: obj (337; 36% instances), nmod (217; 23% instances), nsubj (147; 16% instances), vocative (57; 6% instances), obl (49; 5% instances), root (34; 4% instances), clf (25; 3% instances), compound (17; 2% instances), dislocated (8; 1% instances), ccomp (7; 1% instances), parataxis (7; 1% instances), acl (6; 1% instances), xcomp (5; 1% instances), appos (4; 0% instances), advcl (2; 0% instances), conj (2; 0% instances), amod (1; 0% instances), csubj (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)
Parents of NOUN nodes belong to 10 different parts of speech: VERB (676; 73% instances), NOUN (99; 11% instances), ADJ (64; 7% instances), (34; 4% instances), NUM (19; 2% instances), PART (14; 2% instances), PRON (11; 1% instances), AUX (8; 1% instances), ADV (2; 0% instances), PROPN (2; 0% instances)
511 (55%) NOUN nodes are leaves.
299 (32%) NOUN nodes have one child.
64 (7%) NOUN nodes have two children.
55 (6%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 7.
Children of NOUN nodes are attached using 24 different relations: nmod (142; 22% instances), punct (98; 15% instances), det (95; 14% instances), nummod (64; 10% instances), case (63; 10% instances), amod (40; 6% instances), acl (29; 4% instances), cop (26; 4% instances), nsubj (20; 3% instances), discourse (17; 3% instances), aux (8; 1% instances), parataxis (8; 1% instances), clf (7; 1% instances), compound (7; 1% instances), advmod (6; 1% instances), csubj (6; 1% instances), appos (5; 1% instances), conj (4; 1% instances), advcl (3; 0% instances), mark (3; 0% instances), vocative (3; 0% instances), cc (1; 0% instances), flat (1; 0% instances), reparandum (1; 0% instances)
Children of NOUN nodes belong to 15 different parts of speech: PRON (135; 21% instances), NOUN (99; 15% instances), PUNCT (98; 15% instances), NUM (65; 10% instances), ADJ (43; 7% instances), ADP (40; 6% instances), VERB (40; 6% instances), AUX (37; 6% instances), PART (35; 5% instances), DET (34; 5% instances), PROPN (12; 2% instances), ADV (7; 1% instances), INTJ (7; 1% instances), SCONJ (4; 1% instances), CCONJ (1; 0% instances)