home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: ADP

There are 34 ADP lemmas (0%), 46 ADP types (0%) and 41864 ADP tokens (22%). Out of 16 observed tags, the rank of ADP is: 10 in number of lemmas, 10 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: の, に, は, を, が, と, で, も, から, や

The 10 most frequent ADP types: の, に, は, を, が, と, で, も, から, や

The 10 most frequent ambiguous lemmas: の (ADP 8882, SCONJ 880, PART 2), に (ADP 6429, SCONJ 137, CCONJ 2), は (ADP 5542, SCONJ 5), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), で (ADP 2601, CCONJ 26, SCONJ 16), も (ADP 1844, SCONJ 33), から (ADP 985, SCONJ 71), や (ADP 610, AUX 2, PART 1), など (ADP 453, PART 78)

The 10 most frequent ambiguous types: の (ADP 8880, SCONJ 835, PART 2), に (ADP 6428, AUX 808, SCONJ 137, CCONJ 3), は (ADP 5541, SCONJ 5), が (ADP 4117, SCONJ 784, CCONJ 2), と (ADP 3846, SCONJ 270), で (ADP 2600, AUX 1641, SCONJ 163, CCONJ 24, VERB 2), も (ADP 1844, SCONJ 33), から (ADP 985, SCONJ 71), や (ADP 610, NOUN 3, AUX 1, PART 1), など (ADP 450, PART 78)

Morphology

The form / lemma ratio of ADP is 1.352941 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (3) was observed with the lemma “くらい”: くらい, ぐらい, 位.

The 2nd highest number of forms (3) was observed with the lemma “など”: など, 抔, 等.

The 3rd highest number of forms (3) was observed with the lemma “の”: の, ノ, 之.

ADP does not occur with any features.

Relations

ADP nodes are attached to their parents using 2 different relations: case (41285; 99% instances), fixed (579; 1% instances)

Parents of ADP nodes belong to 15 different parts of speech: NOUN (35083; 84% instances), PROPN (2944; 7% instances), VERB (1499; 4% instances), PRON (1022; 2% instances), ADJ (487; 1% instances), ADV (316; 1% instances), NUM (179; 0% instances), AUX (160; 0% instances), CCONJ (99; 0% instances), SCONJ (29; 0% instances), ADP (20; 0% instances), PART (15; 0% instances), INTJ (4; 0% instances), SYM (4; 0% instances), DET (3; 0% instances)

39997 (96%) ADP nodes are leaves.

670 (2%) ADP nodes have one child.

1174 (3%) ADP nodes have two children.

23 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 5.

Children of ADP nodes are attached using 2 different relations: fixed (3093; 100% instances), punct (1; 0% instances)

Children of ADP nodes belong to 7 different parts of speech: VERB (1378; 45% instances), SCONJ (1121; 36% instances), AUX (562; 18% instances), ADP (20; 1% instances), ADJ (11; 0% instances), NOUN (1; 0% instances), PUNCT (1; 0% instances)