Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Chinese-HK: POS Tags: `NOUN`

There are 508 NOUN lemmas (30%), 508 NOUN types (29%) and 1766 NOUN tokens (18%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: 議員、主席、個、問題、會議、人、規則、立法會、程序、現在

The 10 most frequent NOUN types: 議員、主席、個、問題、會議、人、規則、立法會、程序、現在

The 10 most frequent ambiguous lemmas: 個 (NOUN 54, NUM 1, PART 1), 選舉 (NOUN 24, VERB 24), 決定 (NOUN 22, VERB 7), 宣誓 (VERB 18, NOUN 14), 點 (NOUN 9, ADV 1), 澄清 (NOUN 7, VERB 7), 工作 (NOUN 6, VERB 2), 投票 (VERB 8, NOUN 6), 提問 (NOUN 5, VERB 2), 規定 (NOUN 5, VERB 2)

The 10 most frequent ambiguous types: 個 (NOUN 54, NUM 1, PART 1), 選舉 (NOUN 24, VERB 24), 決定 (NOUN 22, VERB 7), 宣誓 (VERB 18, NOUN 14), 點 (NOUN 9, ADV 1), 澄清 (NOUN 7, VERB 7), 工作 (NOUN 6, VERB 2), 投票 (VERB 8, NOUN 6), 提問 (NOUN 5, VERB 2), 規定 (NOUN 5, VERB 2)

個
- NOUN 54: 我要十個一元！
- NUM 1: 我只有一個女，沒兒子。
- PART 1: 但就是每天也聽個不停。
選舉
- NOUN 24: 我剛才說過，你的提問祇能夠涉及選舉過程或與選舉有關的事情。
- VERB 24: 而第二部分是選舉立法會主席。
決定
- NOUN 22: 面對這些法律觀點或風險，你認為今天是否不適宜就選舉立法會主席作出決定？
- VERB 7: 如果大家認為梁君彥議員的答覆未能回應大家的提問，大家可以決定尋求 ……
宣誓
- VERB 18: 第二，站在你身後的數位議員，剛才在宣誓時遇到一些障礙。
- NOUN 14: 第一部分是剛才大家已完成的宣誓。
點
- NOUN 9: 豪仔今晚點來吃飯嗎？
- ADV 1: 快點！
澄清
- NOUN 7: 要求候選人就某些方面作出澄清。
- VERB 7: 不過，無論當事人如何澄清及解釋，在這個會議上，我不能保證令大家滿意。
工作
- NOUN 6: 起初是來唱歌的，漸漸就變成工作。
- VERB 2: 那我來到香港都希望找到我喜歡的行業工作。
投票
- VERB 8: 你是否也可要求選民自行決定是否投票給梁天琦？
- NOUN 6: 根據會議規則，我現在祇能主持投票的過程。
提問
- NOUN 5: 現在請毛孟靜議員提問。
- VERB 2: 而關注此事的議員亦可出去聆聽他的闡述，以及可以向他提問。
規定
- NOUN 5: 不過，根據《議事規則》附表一第六及七段的規定，由擔任議員時間最長的我來主持會議。
- VERB 2: 而《議事規則》祇規定議員要完整讀出誓詞。

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (1) was observed with the lemma “Declaration_of_Renunciation_of_British_Citizenship”: Declaration_of_Renunciation_of_British_Citizenship.

The 2nd highest number of forms (1) was observed with the lemma “MP3”: MP3.

The 3rd highest number of forms (1) was observed with the lemma “SET”: SET.

NOUN occurs with 1 features: NounType (270; 15% instances)

NOUN occurs with 1 feature-value pairs: NounType=Clf

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (1496 tokens). Examples: 議員、主席、問題、會議、人、規則、立法會、程序、現在、選舉

Relations

NOUN nodes are attached to their parents using 28 different relations: obj (603; 34% instances), nsubj (227; 13% instances), obl (151; 9% instances), compound (145; 8% instances), clf (126; 7% instances), obl:tmod (99; 6% instances), root (86; 5% instances), nmod (68; 4% instances), conj (65; 4% instances), flat (44; 2% instances), vocative (40; 2% instances), obj:periph (24; 1% instances), parataxis (19; 1% instances), advcl (11; 1% instances), appos (11; 1% instances), case:loc (9; 1% instances), ccomp (9; 1% instances), obl:patient (6; 0% instances), amod (5; 0% instances), compound:vo (5; 0% instances), dislocated (3; 0% instances), xcomp (3; 0% instances), acl (2; 0% instances), case (1; 0% instances), compound:dir (1; 0% instances), nsubj:pass (1; 0% instances), obl:agent (1; 0% instances), reparandum (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (1111; 63% instances), NOUN (334; 19% instances), (86; 5% instances), NUM (60; 3% instances), PROPN (51; 3% instances), ADJ (48; 3% instances), DET (43; 2% instances), PRON (15; 1% instances), ADP (10; 1% instances), ADV (5; 0% instances), AUX (2; 0% instances), SCONJ (1; 0% instances)

692 (39%) NOUN nodes are leaves.

586 (33%) NOUN nodes have one child.

270 (15%) NOUN nodes have two children.

218 (12%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 31 different relations: punct (319; 16% instances), det (245; 12% instances), case (217; 11% instances), compound (167; 8% instances), acl (165; 8% instances), nmod (165; 8% instances), nummod (156; 8% instances), amod (124; 6% instances), conj (74; 4% instances), advmod (64; 3% instances), cop (63; 3% instances), nsubj (46; 2% instances), case:loc (34; 2% instances), cc (29; 1% instances), advcl (19; 1% instances), discourse:sp (17; 1% instances), appos (14; 1% instances), parataxis (12; 1% instances), clf (11; 1% instances), aux (7; 0% instances), mark (6; 0% instances), obl:tmod (6; 0% instances), obl (5; 0% instances), flat (4; 0% instances), csubj (2; 0% instances), discourse (2; 0% instances), mark:rel (2; 0% instances), vocative (2; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), obj (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (334; 17% instances), PUNCT (319; 16% instances), DET (247; 12% instances), VERB (197; 10% instances), ADP (175; 9% instances), NUM (163; 8% instances), ADJ (126; 6% instances), PRON (117; 6% instances), PART (92; 5% instances), AUX (71; 4% instances), ADV (64; 3% instances), PROPN (41; 2% instances), CCONJ (29; 1% instances), SCONJ (3; 0% instances), INTJ (2; 0% instances)

Treebank Statistics: UD_Chinese-HK: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Chinese-HK: POS Tags: `NOUN`