home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Javanese-CSUI: POS Tags: AUX

There are 1 AUX lemmas (6%), 20 AUX types (0%) and 340 AUX tokens (2%). Out of 17 observed tags, the rank of AUX is: 4 in number of lemmas, 12 in number of types and 11 in number of tokens.

The 10 most frequent AUX lemmas: _

The 10 most frequent AUX types: wis, bisa, yaiku, kudu, wus, isa, arep, lagi, bakal, inggih

The 10 most frequent ambiguous lemmas: _ (NOUN 2867, PUNCT 2233, VERB 1952, PROPN 1565, PRON 961, ADV 798, ADP 748, ADJ 736, DET 700, NUM 362, AUX 340, SCONJ 314, CCONJ 306, PART 234, X 183, INTJ 32, SYM 12)

The 10 most frequent ambiguous types: isa (AUX 19, ADJ 1), bakal (AUX 13, NOUN 1), kena (AUX 4, VERB 3)

Morphology

The form / lemma ratio of AUX is 20.000000 (the average of all parts of speech is 238.352941).

The 1st highest number of forms (20) was observed with the lemma “_”: arep, badhe, bakal, bakale, bisa, inggih, isa, kedah, kena, kudu, lagi, mesthi, nggih, saged, sampun, sida, wes, wis, wus, yaiku.

AUX occurs with 2 features: Polite (338; 99% instances), Abbr (19; 6% instances)

AUX occurs with 3 feature-value pairs: Abbr=Yes, Polite=Form, Polite=Infm

AUX occurs with 4 feature combinations. The most frequent feature combination is Polite=Infm (288 tokens). Examples: wis, bisa, yaiku, kudu, wus, arep, lagi, bakal, isa, kena

Relations

AUX nodes are attached to their parents using 2 different relations: aux (301; 89% instances), cop (39; 11% instances)

Parents of AUX nodes belong to 7 different parts of speech: VERB (245; 72% instances), ADJ (38; 11% instances), NOUN (36; 11% instances), PROPN (14; 4% instances), X (4; 1% instances), ADV (2; 1% instances), PRON (1; 0% instances)

328 (96%) AUX nodes are leaves.

12 (4%) AUX nodes have one child.

The highest child degree of a AUX node is 1.

Children of AUX nodes are attached using 2 different relations: fixed (8; 67% instances), punct (4; 33% instances)

Children of AUX nodes belong to 2 different parts of speech: DET (8; 67% instances), PUNCT (4; 33% instances)