home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: AUX

There are 26 AUX lemmas (1%), 55 AUX types (1%) and 3320 AUX tokens (12%). Out of 16 observed tags, the rank of AUX is: 7 in number of lemmas, 7 in number of types and 3 in number of tokens.

The 10 most frequent AUX lemmas: た, 為る, だ, れる, ます, ない, られる, 出来る, 様, せる

The 10 most frequent AUX types: た, し, で, れ, な, さ, する, に, ない, だ

The 10 most frequent ambiguous lemmas: た (AUX 940, SCONJ 1), 為る (AUX 862, VERB 72, SCONJ 2), だ (AUX 703, CCONJ 1), 出来る (AUX 44, VERB 14), 様 (AUX 42, NOUN 22), せる (AUX 32, SCONJ 1), です (AUX 30, CCONJ 1), ず (AUX 29, SCONJ 6), 無い (ADJ 43, AUX 27), そう (ADV 3, AUX 2)

The 10 most frequent ambiguous types: し (AUX 489, VERB 52, SCONJ 7), で (ADP 312, AUX 254, SCONJ 19), さ (AUX 182, PART 14, VERB 5), する (AUX 181, VERB 12), に (ADP 982, AUX 152, SCONJ 27, CCONJ 1), ない (AUX 90, ADJ 20), だ (AUX 73, CCONJ 1), よう (AUX 42, NOUN 22), なかっ (AUX 34, ADJ 5), できる (AUX 25, VERB 6)

Morphology

The form / lemma ratio of AUX is 2.115385 (the average of all parts of speech is 1.068660).

The 1st highest number of forms (7) was observed with the lemma “だ”: だ, だっ, だろう, で, な, なら, に.

The 2nd highest number of forms (6) was observed with the lemma “為る”: さ, し, しよう, す, する, せ.

The 3rd highest number of forms (4) was observed with the lemma “た”: た, たら, たろう, だ.

AUX occurs with 1 features: Polarity (144; 4% instances)

AUX occurs with 1 feature-value pairs: Polarity=Neg

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (3176 tokens). Examples: た, し, で, れ, な, さ, する, に, だ, ます

Relations

AUX nodes are attached to their parents using 8 different relations: aux (2764; 83% instances), cop (347; 10% instances), fixed (202; 6% instances), acl (2; 0% instances), root (2; 0% instances), appos (1; 0% instances), dep (1; 0% instances), nmod (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: VERB (2373; 71% instances), NOUN (471; 14% instances), ADJ (285; 9% instances), ADP (77; 2% instances), SCONJ (45; 1% instances), AUX (36; 1% instances), ADV (9; 0% instances), PRON (8; 0% instances), PROPN (7; 0% instances), PART (6; 0% instances), (2; 0% instances), CCONJ (1; 0% instances)

3113 (94%) AUX nodes are leaves.

181 (5%) AUX nodes have one child.

13 (0%) AUX nodes have two children.

13 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 7.

Children of AUX nodes are attached using 7 different relations: fixed (242; 95% instances), punct (4; 2% instances), dep (3; 1% instances), compound (2; 1% instances), nmod (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances)

Children of AUX nodes belong to 6 different parts of speech: VERB (185; 73% instances), AUX (36; 14% instances), ADP (19; 7% instances), SCONJ (7; 3% instances), NOUN (4; 2% instances), PUNCT (4; 2% instances)