Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: `NOUN`

There are 18540 NOUN lemmas (64%), 18790 NOUN types (60%) and 35040 NOUN tokens (23%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: 事, 物, 為, 後, 他, 様, 中, 人, 時, 場合

The 10 most frequent NOUN types: こと, ため, もの, 後, よう, 人, 中, 他, 場合, お店

The 10 most frequent ambiguous lemmas: 後 (NOUN 177, ADV 11), 様 (AUX 257, NOUN 137), 中 (NOUN 127, ADV 1), 現在 (NOUN 76, ADV 27), 所 (NOUN 60, ADV 1), 前 (NOUN 46, ADV 1), 一部 (NOUN 44, ADV 2), 必要 (NOUN 40, ADJ 35), 結果 (NOUN 40, ADV 4), 全て (NOUN 36, ADV 22)

The 10 most frequent ambiguous types: 後 (NOUN 177, ADV 1), よう (AUX 256, NOUN 128), 中 (NOUN 115, ADV 1), 現在 (NOUN 76, ADV 27), 多く (ADJ 54, NOUN 51), 感じ (NOUN 51, VERB 33), 一部 (NOUN 44, ADV 2), 前 (NOUN 44, ADV 1), 必要 (NOUN 40, ADJ 35), 結果 (NOUN 40, ADV 4)

後
- NOUN 177: これは後に本居宣長によって、現在と同じような位置に訂正された。
- ADV 1: 翌3年 6月 22日、伴の潜伏する大安寺村の慶雲庵に村松藩の捕吏が殺到、伴は捕吏の一人を板戸越しに刺した後相手方の怯んだ隙に自刃して果てたという。
よう
- AUX 256: これらが順に収縮することで食物を胃に送り出すような動きをする。
- NOUN 128: 以上の操作を再帰的に繰り返すと以下のような決定木が出力される。
中
- NOUN 115: イーアスの中にある映画館なので、駐車場が広いのも Good 。
- ADV 1: 看護師不足が問題となる中、高梁市が看護師を目指す学生を対象にした看護師養成奨学金制度をつくり、 4月から奨学生の募集を始める。
現在
- NOUN 76: 現在は空き名跡となっている。
- ADV 27: 現在、その名称はアトレティコの下部組織の名前として残っている。
多く
- ADJ 54: この時代から、日本列島に人類が住んだ遺跡や遺物が多く発見されている。
- NOUN 51: 低公害車はそれほど多くはないが、ハイブリッドバスや CNGバスが導入されている。
感じ
- NOUN 51: やはり、日常的に利用されることが多い業者だけあって、非常に慣れた感じはあります。
- VERB 33: 価格に見合う満足感を感じます。
一部
- NOUN 44: 一部を抜粋します。
- ADV 2: 事件が朝刊各版の締め切り間際に立て続けに起こったため、各新聞は配達先によって記事内容が一部異なっている。
前
- NOUN 44: 先を歩いていたミニスカートの女性が NKプリントの前で自動車にはね飛ばされる。
- ADV 1: 若い講師に不信感を抱かないように前もってそう説明しているようだ。
必要
- NOUN 40: しばらくの完全休養とリハビリが必要とのことでした。
- ADJ 35: なお、「 NTTコム」というと、別会社の「 NTTコミュニケーションズ」を指すので、注意が必要である。
結果
- NOUN 40: 戦闘の経過や結果はテキストで表現されるため、変わりゆく戦況に一喜一憂しながらお楽しみいただけます。
- ADV 4: 結果、他社で物件を契約しました。

Morphology

The form / lemma ratio of NOUN is 1.013484 (the average of all parts of speech is 1.095294).

The 1st highest number of forms (6) was observed with the lemma “_”: かっちゃ, セインツ, ドゥーナダン, ポストペイ, リアドロ, レーベンズ.

The 2nd highest number of forms (4) was observed with the lemma “出し”: だし, ダシ, 出し, 出汁.

The 3rd highest number of forms (4) was observed with the lemma “子供達”: 子どもたち, 子ども達, 子供たち, 子供達.

NOUN occurs with 1 features: Polarity (3; 0% instances)

NOUN occurs with 1 feature-value pairs: Polarity=Neg

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (35037 tokens). Examples: こと, ため, もの, 後, よう, 人, 中, 他, 場合, お店

Relations

NOUN nodes are attached to their parents using 12 different relations: obl (9837; 28% instances), nmod (8921; 25% instances), nsubj (6541; 19% instances), obj (4791; 14% instances), root (2090; 6% instances), compound (825; 2% instances), advcl (621; 2% instances), conj (488; 1% instances), nsubj:outer (463; 1% instances), acl (411; 1% instances), ccomp (36; 0% instances), iobj (16; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (18843; 54% instances), NOUN (10856; 31% instances), (2090; 6% instances), ADJ (1753; 5% instances), NUM (767; 2% instances), PROPN (591; 2% instances), ADV (87; 0% instances), PRON (37; 0% instances), AUX (8; 0% instances), INTJ (3; 0% instances), SCONJ (2; 0% instances), SYM (2; 0% instances), X (1; 0% instances)

1090 (3%) NOUN nodes are leaves.

14555 (42%) NOUN nodes have one child.

12834 (37%) NOUN nodes have two children.

6561 (19%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 16.

Children of NOUN nodes are attached using 25 different relations: case (31522; 50% instances), nmod (10360; 16% instances), punct (7231; 11% instances), acl (6497; 10% instances), aux (1203; 2% instances), cop (1158; 2% instances), nsubj (1081; 2% instances), det (960; 2% instances), compound (764; 1% instances), obl (675; 1% instances), conj (441; 1% instances), advmod (336; 1% instances), mark (335; 1% instances), obj (285; 0% instances), cc (185; 0% instances), amod (161; 0% instances), nummod (115; 0% instances), csubj (97; 0% instances), advcl (66; 0% instances), nsubj:outer (60; 0% instances), dep (26; 0% instances), discourse (4; 0% instances), csubj:outer (3; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (31522; 50% instances), NOUN (10856; 17% instances), PUNCT (7231; 11% instances), VERB (5040; 8% instances), AUX (2361; 4% instances), ADJ (1733; 3% instances), PROPN (1699; 3% instances), DET (960; 2% instances), NUM (918; 1% instances), ADV (338; 1% instances), PRON (327; 1% instances), SCONJ (271; 0% instances), CCONJ (185; 0% instances), PART (65; 0% instances), SYM (55; 0% instances), INTJ (4; 0% instances), X (2; 0% instances)

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: `NOUN`