home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUDLUW: POS Tags: PROPN

There are 716 PROPN lemmas (13%), 716 PROPN types (12%) and 999 PROPN tokens (4%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: 中国, 米国, 英国, フランス, ロシア, ギリシャ, アルバニア, スペイン, アメリカ, イギリス

The 10 most frequent PROPN types: 中国, 米国, 英国, フランス, ロシア, ギリシャ, アルバニア, スペイン, アメリカ, イギリス

The 10 most frequent ambiguous lemmas: 北海 (NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: 北海 (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.079832).

The 1st highest number of forms (1) was observed with the lemma “AKP”: AKP.

The 2nd highest number of forms (1) was observed with the lemma “Aoun”: Aoun.

The 3rd highest number of forms (1) was observed with the lemma “Apple”: Apple.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 9 different relations: nmod (384; 38% instances), nsubj (299; 30% instances), obl (206; 21% instances), obj (59; 6% instances), compound (38; 4% instances), nsubj:outer (6; 1% instances), root (5; 1% instances), acl (1; 0% instances), advcl (1; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: VERB (507; 51% instances), NOUN (365; 37% instances), PROPN (111; 11% instances), ADJ (7; 1% instances), (5; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances)

47 (5%) PROPN nodes are leaves.

591 (59%) PROPN nodes have one child.

254 (25%) PROPN nodes have two children.

107 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 15.

Children of PROPN nodes are attached using 15 different relations: case (958; 64% instances), punct (234; 16% instances), nmod (141; 9% instances), acl (61; 4% instances), obl (36; 2% instances), compound (27; 2% instances), nsubj (9; 1% instances), aux (8; 1% instances), cc (8; 1% instances), advmod (5; 0% instances), cop (3; 0% instances), det (3; 0% instances), nummod (3; 0% instances), advcl (2; 0% instances), mark (1; 0% instances)

Children of PROPN nodes belong to 13 different parts of speech: ADP (958; 64% instances), PUNCT (234; 16% instances), PROPN (111; 7% instances), NOUN (108; 7% instances), VERB (40; 3% instances), AUX (11; 1% instances), ADJ (9; 1% instances), CCONJ (8; 1% instances), NUM (7; 0% instances), ADV (5; 0% instances), PRON (4; 0% instances), DET (3; 0% instances), SCONJ (1; 0% instances)