home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: PROPN

There are 4327 PROPN lemmas (21%), 4327 PROPN types (18%) and 7141 PROPN tokens (4%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: 日本, 東京, アメリカ, 中国, 韓国, ドイツ, 大阪, フランス, イギリス, 台湾

The 10 most frequent PROPN types: 日本, 東京, アメリカ, 中国, 韓国, ドイツ, 大阪, フランス, イギリス, 台湾

The 10 most frequent ambiguous lemmas: 東北 (PROPN 12, NOUN 1), 大分 (PROPN 11, ADV 2), カール (PROPN 10, NOUN 2), ジャック (PROPN 7, NOUN 1), 文 (NOUN 12, PROPN 5), 王子 (NOUN 6, PROPN 5), タイ (NOUN 6, PROPN 4), 岸 (PROPN 4, NOUN 1), 房 (PROPN 4, NOUN 1), 金 (NOUN 52, PROPN 4)

The 10 most frequent ambiguous types: オウム (PROPN 17, NOUN 2), 東北 (PROPN 12, NOUN 1), カール (PROPN 10, NOUN 2), ジャック (PROPN 7, NOUN 1), 文 (NOUN 12, PROPN 5), 王子 (NOUN 6, PROPN 5), タイ (NOUN 6, PROPN 4), 岸 (PROPN 4, NOUN 1), 房 (PROPN 4, NOUN 1), 金 (NOUN 49, PROPN 4)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (1) was observed with the lemma “48”: 48.

The 2nd highest number of forms (1) was observed with the lemma “ABC”: ABC.

The 3rd highest number of forms (1) was observed with the lemma “AFP”: AFP.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 9 different relations: compound (4159; 58% instances), nmod (1307; 18% instances), obl (719; 10% instances), nsubj (675; 9% instances), obj (189; 3% instances), root (62; 1% instances), advcl (14; 0% instances), acl (8; 0% instances), dislocated (8; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: NOUN (4621; 65% instances), VERB (1400; 20% instances), PROPN (978; 14% instances), (62; 1% instances), ADJ (53; 1% instances), NUM (17; 0% instances), ADV (8; 0% instances), PRON (2; 0% instances)

4172 (58%) PROPN nodes are leaves.

1435 (20%) PROPN nodes have one child.

707 (10%) PROPN nodes have two children.

827 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 13.

Children of PROPN nodes are attached using 19 different relations: case (2944; 48% instances), compound (1537; 25% instances), nmod (657; 11% instances), punct (593; 10% instances), acl (227; 4% instances), nsubj (56; 1% instances), cop (42; 1% instances), mark (17; 0% instances), cc (12; 0% instances), obl (10; 0% instances), advmod (9; 0% instances), aux (9; 0% instances), dep (9; 0% instances), det (9; 0% instances), csubj (7; 0% instances), advcl (5; 0% instances), nummod (3; 0% instances), amod (2; 0% instances), obj (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (2944; 48% instances), PROPN (978; 16% instances), NOUN (916; 15% instances), PUNCT (593; 10% instances), SYM (390; 6% instances), VERB (190; 3% instances), AUX (51; 1% instances), NUM (21; 0% instances), ADJ (12; 0% instances), CCONJ (12; 0% instances), ADV (10; 0% instances), DET (9; 0% instances), SCONJ (9; 0% instances), PART (8; 0% instances), PRON (6; 0% instances)