home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: ADV

There are 177 ADV lemmas (2%), 177 ADV types (2%) and 4144 ADV tokens (7%). Out of 17 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: không, đã, cũng, được, lại, chỉ, sẽ, đang, rất, vẫn

The 10 most frequent ADV types: không, đã, cũng, được, lại, chỉ, sẽ, đang, rất, vẫn

The 10 most frequent ambiguous lemmas: không (ADV 592, NOUN 2), đã (ADV 455, ADJ 2), cũng (ADV 321, ADJ 2, SCONJ 2), được (AUX 251, ADV 205, VERB 26, ADJ 1, PART 1), lại (ADV 197, VERB 70), chỉ (ADV 171, NOUN 5, VERB 5), mới (ADV 106, ADJ 31), ra (VERB 172, ADV 70, ADP 9), còn (ADV 69, VERB 54, SCONJ 35), vừa (ADV 67, ADJ 1)

The 10 most frequent ambiguous types: không (ADV 555, NOUN 2), đã (ADV 447, ADJ 2), cũng (ADV 312, ADJ 2, SCONJ 1), được (AUX 249, ADV 205, VERB 26, ADJ 1, PART 1), lại (ADV 196, VERB 69), chỉ (ADV 154, NOUN 5, VERB 5), mới (ADV 100, ADJ 31), ra (VERB 169, ADV 70, ADP 9), còn (ADV 68, VERB 54, SCONJ 20), vừa (ADV 60, ADJ 1)

Morphology

The form / lemma ratio of ADV is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “biết bao”: biết bao.

The 2nd highest number of forms (1) was observed with the lemma “bất cứ”: bất cứ.

The 3rd highest number of forms (1) was observed with the lemma “bất kể”: bất kể.

ADV does not occur with any features.

Relations

ADV nodes are attached to their parents using 32 different relations: advmod (3132; 76% instances), advmod:neg (688; 17% instances), compound:prt (75; 2% instances), compound:dir (74; 2% instances), compound:svc (44; 1% instances), advmod:adj (21; 1% instances), advmod:dir (18; 0% instances), discourse (16; 0% instances), case (10; 0% instances), xcomp (10; 0% instances), root (8; 0% instances), obl:tmod (7; 0% instances), compound (6; 0% instances), conj (4; 0% instances), advcl (3; 0% instances), mark (3; 0% instances), acl:subj (2; 0% instances), advcl:objective (2; 0% instances), compound:redup (2; 0% instances), fixed (2; 0% instances), flat (2; 0% instances), flat:redup (2; 0% instances), obj (2; 0% instances), obl (2; 0% instances), xcomp:dir (2; 0% instances), amod (1; 0% instances), appos:nmod (1; 0% instances), compound:adj (1; 0% instances), compound:apr (1; 0% instances), compound:atov (1; 0% instances), nmod (1; 0% instances), obl:adv (1; 0% instances)

Parents of ADV nodes belong to 14 different parts of speech: VERB (3114; 75% instances), ADJ (574; 14% instances), NOUN (303; 7% instances), PRON (52; 1% instances), ADV (36; 1% instances), NUM (19; 0% instances), ADP (13; 0% instances), PROPN (10; 0% instances), (8; 0% instances), X (8; 0% instances), SCONJ (3; 0% instances), PART (2; 0% instances), DET (1; 0% instances), SYM (1; 0% instances)

4060 (98%) ADV nodes are leaves.

68 (2%) ADV nodes have one child.

9 (0%) ADV nodes have two children.

7 (0%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 28 different relations: punct (20; 16% instances), advmod (14; 11% instances), advmod:neg (13; 11% instances), compound (11; 9% instances), cc (9; 7% instances), mark (7; 6% instances), xcomp (7; 6% instances), fixed (4; 3% instances), nsubj (4; 3% instances), advcl (3; 2% instances), conj (3; 2% instances), obj (3; 2% instances), obl (3; 2% instances), xcomp:adj (3; 2% instances), aux (2; 2% instances), flat (2; 2% instances), obl:adv (2; 2% instances), obl:comp (2; 2% instances), case (1; 1% instances), ccomp (1; 1% instances), clf (1; 1% instances), compound:apr (1; 1% instances), compound:atov (1; 1% instances), compound:svc (1; 1% instances), csubj (1; 1% instances), mark:pcomp (1; 1% instances), obl:tmod (1; 1% instances), parataxis (1; 1% instances)

Children of ADV nodes belong to 11 different parts of speech: ADV (36; 30% instances), PUNCT (20; 16% instances), VERB (17; 14% instances), NOUN (16; 13% instances), SCONJ (10; 8% instances), CCONJ (9; 7% instances), ADJ (7; 6% instances), ADP (2; 2% instances), AUX (2; 2% instances), NUM (2; 2% instances), PROPN (1; 1% instances)