home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: AUX

There are 13 AUX lemmas (0%), 13 AUX types (0%) and 1282 AUX tokens (2%). Out of 17 observed tags, the rank of AUX is: 15 in number of lemmas, 15 in number of types and 11 in number of tokens.

The 10 most frequent AUX lemmas: là, được, phải, bị, muốn, có thể, cần, nên, không thể, chưa thể

The 10 most frequent AUX types: là, được, phải, bị, muốn, có thể, cần, nên, không thể, chưa thể

The 10 most frequent ambiguous lemmas: (AUX 497, SCONJ 89, CCONJ 7, PART 3), được (AUX 251, ADV 205, VERB 26, ADJ 1, PART 1), phải (AUX 226, ADJ 16, VERB 10, ADV 1), bị (AUX 174, VERB 5), muốn (AUX 45, VERB 30), có thể (AUX 40, ADV 32, ADJ 10), cần (AUX 26, VERB 14), nên (SCONJ 66, AUX 10, VERB 5, ADV 2), không thể (ADV 36, AUX 9, ADJ 4), chưa thể (ADV 1, AUX 1)

The 10 most frequent ambiguous types: (AUX 496, SCONJ 89, CCONJ 7, PART 3), được (AUX 249, ADV 205, VERB 26, ADJ 1, PART 1), phải (AUX 221, ADJ 16, VERB 9, ADV 1), bị (AUX 172, VERB 5), muốn (AUX 39, VERB 28), có thể (AUX 40, ADV 28, ADJ 10), cần (AUX 24, VERB 14), nên (SCONJ 65, AUX 10, VERB 5, ADV 2), không thể (ADV 35, AUX 9, ADJ 4), chưa thể (ADV 1, AUX 1)

Morphology

The form / lemma ratio of AUX is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “bị”: bị.

The 2nd highest number of forms (1) was observed with the lemma “chưa thể”: chưa thể.

The 3rd highest number of forms (1) was observed with the lemma “chắc chắn”: chắc chắn.

AUX does not occur with any features.

Relations

AUX nodes are attached to their parents using 12 different relations: cop (481; 38% instances), aux (413; 32% instances), aux:pass (366; 29% instances), discourse (11; 1% instances), acl:subj (3; 0% instances), root (2; 0% instances), acl:tmod (1; 0% instances), compound (1; 0% instances), conj (1; 0% instances), dep (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (782; 61% instances), NOUN (376; 29% instances), ADJ (51; 4% instances), PROPN (38; 3% instances), PRON (18; 1% instances), NUM (5; 0% instances), X (5; 0% instances), ADP (2; 0% instances), ADV (2; 0% instances), (2; 0% instances), PART (1; 0% instances)

1273 (99%) AUX nodes are leaves.

5 (0%) AUX nodes have one child.

2 (0%) AUX nodes have two children.

2 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 8 different relations: obj (7; 41% instances), punct (4; 24% instances), advcl (1; 6% instances), ccomp (1; 6% instances), conj (1; 6% instances), mark (1; 6% instances), nsubj (1; 6% instances), nsubj:pass (1; 6% instances)

Children of AUX nodes belong to 5 different parts of speech: NOUN (8; 47% instances), PUNCT (4; 24% instances), VERB (3; 18% instances), PRON (1; 6% instances), SCONJ (1; 6% instances)