home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: AUX

There are 79 AUX lemmas (0%), 269 AUX types (1%) and 18393 AUX tokens (12%). Out of 17 observed tags, the rank of AUX is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent AUX lemmas: た, だ, ている, れる, ます, である, です, ない, られる, のだ

The 10 most frequent AUX types: た, れ, ている, な, てい, に, ます, です, である, まし

The 10 most frequent ambiguous lemmas: ている (AUX 2072, VERB 4), 様 (AUX 257, NOUN 137), そう (AUX 57, ADV 30), つう (ADP 18, AUX 7, PART 6), たり (PART 78, ADP 4, AUX 4), や (ADP 608, AUX 2, PART 1)

The 10 most frequent ambiguous types: ている (AUX 1193, VERB 2), な (AUX 973, PART 28), てい (AUX 814, VERB 2), に (ADP 5333, AUX 804, SCONJ 32, X 1), ない (AUX 411, ADJ 156), で (ADP 2583, AUX 403, SCONJ 62, VERB 2, CCONJ 1), よう (AUX 256, NOUN 128), せ (AUX 127, VERB 6), ん (AUX 110, SCONJ 3), なかっ (AUX 101, ADJ 26)

Morphology

The form / lemma ratio of AUX is 3.405063 (the average of all parts of speech is 1.095294).

The 1st highest number of forms (13) was observed with the lemma “ていく”: ていか, ていき, ていく, ていけ, ていける, ていこう, ていっ, てゆか, てゆく, て行き, て行く, でいく, でいっ.

The 2nd highest number of forms (11) was observed with the lemma “てもらう”: てもらい, てもらう, てもらえ, てもらえる, てもらおう, てもらっ, て貰い, て貰え, て貰っ, でもらい, でもらっ.

The 3rd highest number of forms (10) was observed with the lemma “ていただく”: ていただい, ていただき, ていただく, ていただけ, ていただける, て頂い, て頂き, て頂く, て頂け, て頂ける.

AUX occurs with 1 features: Polarity (863; 5% instances)

AUX occurs with 1 feature-value pairs: Polarity=Neg

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (17530 tokens). Examples: た, れ, ている, な, てい, に, ます, です, である, まし

Relations

AUX nodes are attached to their parents using 4 different relations: aux (17151; 93% instances), cop (1212; 7% instances), fixed (22; 0% instances), root (8; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: VERB (13545; 74% instances), NOUN (2361; 13% instances), ADJ (2194; 12% instances), NUM (106; 1% instances), PROPN (74; 0% instances), ADV (47; 0% instances), PRON (35; 0% instances), AUX (22; 0% instances), (8; 0% instances), INTJ (1; 0% instances)

18369 (100%) AUX nodes are leaves.

9 (0%) AUX nodes have one child.

3 (0%) AUX nodes have two children.

12 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 8 different relations: fixed (28; 43% instances), punct (21; 32% instances), nmod (7; 11% instances), advcl (3; 5% instances), advmod (2; 3% instances), nsubj (2; 3% instances), mark (1; 2% instances), obl (1; 2% instances)

Children of AUX nodes belong to 10 different parts of speech: AUX (22; 34% instances), PUNCT (21; 32% instances), NOUN (8; 12% instances), SCONJ (4; 6% instances), VERB (3; 5% instances), ADP (2; 3% instances), ADV (2; 3% instances), NUM (1; 2% instances), PART (1; 2% instances), PROPN (1; 2% instances)