home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-GSD: POS Tags: AUX

There are 3 AUX lemmas (0%), 3 AUX types (0%) and 1056 AUX tokens (1%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 16 in number of types and 13 in number of tokens.

The 10 most frequent AUX lemmas: adalah, ialah, rata

The 10 most frequent AUX types: adalah, ialah, rata

The 10 most frequent ambiguous lemmas: adalah (AUX 1018, VERB 7), rata (ADV 11, ADJ 10, NOUN 2, AUX 1)

The 10 most frequent ambiguous types: adalah (AUX 1013, VERB 7), rata (ADJ 10, ADV 10, NOUN 2, AUX 1)

Morphology

The form / lemma ratio of AUX is 1.000000 (the average of all parts of speech is 1.045328).

The 1st highest number of forms (1) was observed with the lemma “adalah”: adalah.

The 2nd highest number of forms (1) was observed with the lemma “ialah”: ialah.

The 3rd highest number of forms (1) was observed with the lemma “rata”: rata.

AUX occurs with 2 features: Degree (1; 0% instances), Number (1; 0% instances)

AUX occurs with 2 feature-value pairs: Degree=Pos, Number=Sing

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (1055 tokens). Examples: adalah, ialah

Relations

AUX nodes are attached to their parents using 2 different relations: cop (1055; 100% instances), advmod (1; 0% instances)

Parents of AUX nodes belong to 7 different parts of speech: NOUN (832; 79% instances), PROPN (140; 13% instances), VERB (41; 4% instances), ADJ (20; 2% instances), NUM (13; 1% instances), PRON (9; 1% instances), DET (1; 0% instances)

1055 (100%) AUX nodes are leaves.

0 (0%) AUX nodes have one child.

1 (0%) AUX nodes have two children.

The highest child degree of a AUX node is 2.

Children of AUX nodes are attached using 2 different relations: advmod (1; 50% instances), punct (1; 50% instances)

Children of AUX nodes belong to 2 different parts of speech: ADV (1; 50% instances), PUNCT (1; 50% instances)