home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: ADJ

There are 1058 ADJ lemmas (14%), 1058 ADJ types (14%) and 3380 ADJ tokens (6%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent ADJ lemmas: nhiều, hơn, gần, khác, lớn, cùng, nhỏ, cao, đúng, đầu tiên

The 10 most frequent ADJ types: nhiều, hơn, gần, khác, lớn, cùng, nhỏ, cao, đúng, đầu tiên

The 10 most frequent ambiguous lemmas: nhiều (ADJ 173, DET 1), hơn (ADJ 85, ADV 30), gần (ADJ 84, ADV 4), khác (ADJ 84, NOUN 1), lớn (ADJ 55, VERB 10), cùng (ADJ 49, SCONJ 27, ADV 7, CCONJ 6, NOUN 3, ADP 2), nhỏ (ADJ 39, NOUN 3), cao (ADJ 38, NOUN 1), đầu tiên (ADJ 34, NOUN 5), mới (ADV 106, ADJ 31)

The 10 most frequent ambiguous types: hơn (ADJ 82, ADV 30), gần (ADJ 75, ADV 4), khác (ADJ 83, NOUN 1), lớn (ADJ 55, VERB 10), cùng (ADJ 46, SCONJ 26, ADV 7, CCONJ 6, NOUN 3, ADP 2), nhỏ (ADJ 39, NOUN 3), cao (ADJ 38, NOUN 1), đầu tiên (ADJ 34, NOUN 5), mới (ADV 100, ADJ 31), đủ (ADJ 30, VERB 2)

Morphology

The form / lemma ratio of ADJ is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “an toàn”: an toàn.

The 2nd highest number of forms (1) was observed with the lemma “anh hùng”: anh hùng.

The 3rd highest number of forms (1) was observed with the lemma “ban đầu”: ban đầu.

ADJ does not occur with any features.

Relations

ADJ nodes are attached to their parents using 46 different relations: amod (1104; 33% instances), advmod:adj (564; 17% instances), xcomp (491; 15% instances), root (252; 7% instances), conj (244; 7% instances), acl:subj (151; 4% instances), advcl (100; 3% instances), compound:amod (96; 3% instances), ccomp (53; 2% instances), advmod (49; 1% instances), compound (29; 1% instances), parataxis (27; 1% instances), compound:adj (22; 1% instances), obl:tmod (22; 1% instances), obl (21; 1% instances), csubj:asubj (20; 1% instances), acl:tonp (18; 1% instances), nmod (15; 0% instances), obj (15; 0% instances), acl:tmod (9; 0% instances), advcl:objective (8; 0% instances), obl:comp (8; 0% instances), acl (7; 0% instances), appos (6; 0% instances), csubj (6; 0% instances), discourse (5; 0% instances), obl:adj (5; 0% instances), xcomp:adj (5; 0% instances), compound:atov (4; 0% instances), advmod:neg (3; 0% instances), appos:nmod (3; 0% instances), compound:verbnoun (3; 0% instances), compound:dir (2; 0% instances), acl:relcl (1; 0% instances), compound:apr (1; 0% instances), compound:prt (1; 0% instances), compound:vmod (1; 0% instances), compound:z (1; 0% instances), csubj:vsubj (1; 0% instances), dislocated (1; 0% instances), flat (1; 0% instances), flat:name (1; 0% instances), nmod:poss (1; 0% instances), nsubj (1; 0% instances), obl:about (1; 0% instances), obl:with (1; 0% instances)

Parents of ADJ nodes belong to 12 different parts of speech: NOUN (1598; 47% instances), VERB (1153; 34% instances), ADJ (270; 8% instances), (252; 7% instances), NUM (67; 2% instances), PROPN (12; 0% instances), ADV (7; 0% instances), PRON (7; 0% instances), ADP (6; 0% instances), X (5; 0% instances), DET (2; 0% instances), SCONJ (1; 0% instances)

2080 (62%) ADJ nodes are leaves.

564 (17%) ADJ nodes have one child.

267 (8%) ADJ nodes have two children.

469 (14%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 57 different relations: punct (617; 20% instances), advmod (484; 15% instances), nsubj (357; 11% instances), conj (239; 8% instances), obl:adj (127; 4% instances), xcomp:adj (121; 4% instances), obl (113; 4% instances), obj (111; 4% instances), mark (101; 3% instances), advmod:neg (95; 3% instances), cc (91; 3% instances), advcl (77; 2% instances), obl:tmod (69; 2% instances), advmod:adj (67; 2% instances), discourse (59; 2% instances), parataxis (37; 1% instances), xcomp (36; 1% instances), case (34; 1% instances), obl:comp (30; 1% instances), compound:adj (23; 1% instances), aux (22; 1% instances), compound:atov (22; 1% instances), nmod (19; 1% instances), det:pmod (18; 1% instances), amod (17; 1% instances), cop (17; 1% instances), obl:with (17; 1% instances), compound (15; 0% instances), aux:pass (12; 0% instances), csubj:vsubj (11; 0% instances), advcl:objective (9; 0% instances), nsubj:pass (9; 0% instances), mark:pcomp (8; 0% instances), obl:about (8; 0% instances), ccomp (6; 0% instances), compound:vmod (6; 0% instances), csubj (5; 0% instances), det (5; 0% instances), clf:det (4; 0% instances), nummod (4; 0% instances), appos:nmod (3; 0% instances), dislocated (3; 0% instances), fixed (3; 0% instances), acl:subj (2; 0% instances), nmod:poss (2; 0% instances), obl:iobj (2; 0% instances), acl (1; 0% instances), clf (1; 0% instances), compound:apr (1; 0% instances), compound:verbnoun (1; 0% instances), compound:z (1; 0% instances), csubj:asubj (1; 0% instances), flat (1; 0% instances), nsubj:nn (1; 0% instances), obl:adv (1; 0% instances), obl:agent (1; 0% instances), vocative (1; 0% instances)

Children of ADJ nodes belong to 17 different parts of speech: NOUN (784; 25% instances), PUNCT (617; 20% instances), ADV (574; 18% instances), VERB (387; 12% instances), ADJ (270; 9% instances), SCONJ (124; 4% instances), PRON (71; 2% instances), PROPN (66; 2% instances), CCONJ (65; 2% instances), ADP (61; 2% instances), AUX (51; 2% instances), PART (50; 2% instances), NUM (12; 0% instances), X (8; 0% instances), DET (4; 0% instances), INTJ (2; 0% instances), SYM (2; 0% instances)