home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: AUX

There are 44 AUX lemmas (0%), 131 AUX types (1%) and 21158 AUX tokens (11%). Out of 16 observed tags, the rank of AUX is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent AUX lemmas: た, 為る, だ, れる, ます, です, ない, られる, ず, 様

The 10 most frequent AUX types: た, し, で, れ, さ, する, な, に, ます, です

The 10 most frequent ambiguous lemmas: た (AUX 5321, SCONJ 10), 為る (AUX 5115, VERB 675, SCONJ 12, CCONJ 3), だ (AUX 4004, CCONJ 31), です (AUX 828, CCONJ 2), ず (AUX 320, SCONJ 13), 様 (AUX 257, NOUN 172), 出来る (AUX 237, VERB 91), 無い (ADJ 325, AUX 188), そう (AUX 57, ADV 30), 頂く (AUX 52, VERB 15)

The 10 most frequent ambiguous types: た (AUX 5186, SCONJ 10), し (AUX 2933, VERB 417, SCONJ 54), で (ADP 2600, AUX 1641, SCONJ 163, CCONJ 24, VERB 2), さ (AUX 1095, VERB 98, PART 90), する (AUX 1029, VERB 144, CCONJ 3), な (AUX 974, PART 28, CCONJ 2), に (ADP 6428, AUX 808, SCONJ 137, CCONJ 3), です (AUX 652, CCONJ 2), ない (AUX 562, ADJ 161), だ (AUX 372, CCONJ 28)

Morphology

The form / lemma ratio of AUX is 2.977273 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (10) was observed with the lemma “頂く”: いただい, いただき, いただく, いただけ, いただける, 頂い, 頂き, 頂く, 頂け, 頂ける.

The 2nd highest number of forms (9) was observed with the lemma “だ”: じゃ, だ, だっ, だろ, だろう, で, な, なら, に.

The 3rd highest number of forms (8) was observed with the lemma “為る”: さ, し, しよう, す, する, すれ, せ, せよ.

AUX occurs with 1 features: Polarity (946; 4% instances)

AUX occurs with 1 feature-value pairs: Polarity=Neg

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (20212 tokens). Examples: た, し, で, れ, さ, する, な, に, ます, です

Relations

AUX nodes are attached to their parents using 5 different relations: aux (17235; 81% instances), cop (2441; 12% instances), fixed (1467; 7% instances), acl (8; 0% instances), root (7; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: VERB (14646; 69% instances), NOUN (3200; 15% instances), ADJ (1707; 8% instances), SCONJ (582; 3% instances), ADP (562; 3% instances), AUX (207; 1% instances), ADV (110; 1% instances), PROPN (53; 0% instances), PRON (40; 0% instances), PART (29; 0% instances), NUM (12; 0% instances), (7; 0% instances), CCONJ (3; 0% instances)

20188 (95%) AUX nodes are leaves.

790 (4%) AUX nodes have one child.

140 (1%) AUX nodes have two children.

40 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 8 different relations: fixed (1196; 98% instances), nmod (7; 1% instances), punct (7; 1% instances), advcl (2; 0% instances), advmod (2; 0% instances), nsubj (2; 0% instances), mark (1; 0% instances), obl (1; 0% instances)

Children of AUX nodes belong to 8 different parts of speech: VERB (817; 67% instances), AUX (207; 17% instances), ADP (160; 13% instances), SCONJ (14; 1% instances), NOUN (10; 1% instances), PUNCT (7; 1% instances), ADV (2; 0% instances), PART (1; 0% instances)