home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: X

There are 70 X lemmas (1%), 70 X types (1%) and 116 X tokens (0%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: vì sao, ra sao, chìm xuồng, như vậy, tại sao, có lẽ, như thế nào, nhất là, y, có vẻ

The 10 most frequent X types: vì sao, ra sao, chìm xuồng, như vậy, tại sao, Có lẽ, Y, như thế nào, nhất là, Hơn nữa

The 10 most frequent ambiguous lemmas: ra sao (X 6, VERB 1), chìm xuồng (X 5, VERB 1), như vậy (SCONJ 13, X 5, ADV 1), tại sao (X 5, NOUN 1, PRON 1), có lẽ (ADV 12, X 4), như thế nào (X 3, NOUN 1), nhất là (PART 3, X 3, SCONJ 2, ADV 1), y (X 3, PROPN 1), có vẻ (X 2, ADJ 1, ADV 1, AUX 1, VERB 1), hơn nữa (SCONJ 6, X 2, ADV 1)

The 10 most frequent ambiguous types: ra sao (X 6, VERB 1), chìm xuồng (X 5, VERB 1), như vậy (X 4, SCONJ 2, ADV 1), Có lẽ (ADV 5, X 4), Y (X 3, PROPN 1), như thế nào (X 3, NOUN 1), nhất là (PART 2, SCONJ 2, ADV 1, X 1), Hơn nữa (SCONJ 5, X 2), Mặt khác (SCONJ 2, X 2), có vẻ (X 2, ADJ 1, ADV 1, AUX 1, VERB 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “an”: an.

The 2nd highest number of forms (1) was observed with the lemma “and”: and.

The 3rd highest number of forms (1) was observed with the lemma “c ó lẽ”: c ó lẽ.

X does not occur with any features.

Relations

X nodes are attached to their parents using 24 different relations: obl (31; 27% instances), discourse (18; 16% instances), root (9; 8% instances), compound (8; 7% instances), mark (6; 5% instances), nmod (6; 5% instances), obj (6; 5% instances), flat:foreign (5; 4% instances), conj (4; 3% instances), advcl (3; 3% instances), acl:subj (2; 2% instances), amod (2; 2% instances), cc (2; 2% instances), ccomp (2; 2% instances), compound:z (2; 2% instances), xcomp (2; 2% instances), acl (1; 1% instances), acl:tonp (1; 1% instances), appos:nmod (1; 1% instances), case (1; 1% instances), dislocated (1; 1% instances), nsubj (1; 1% instances), obl:tmod (1; 1% instances), parataxis (1; 1% instances)

Parents of X nodes belong to 6 different parts of speech: VERB (62; 53% instances), NOUN (31; 27% instances), (9; 8% instances), ADJ (8; 7% instances), X (5; 4% instances), PROPN (1; 1% instances)

87 (75%) X nodes are leaves.

11 (9%) X nodes have one child.

6 (5%) X nodes have two children.

12 (10%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 21 different relations: punct (32; 37% instances), nsubj (8; 9% instances), flat:foreign (7; 8% instances), advmod (6; 7% instances), advcl (3; 3% instances), advmod:adj (3; 3% instances), aux:pass (3; 3% instances), discourse (3; 3% instances), nsubj:pass (3; 3% instances), xcomp (3; 3% instances), advmod:neg (2; 2% instances), ccomp (2; 2% instances), cop (2; 2% instances), det:pmod (2; 2% instances), advcl:objective (1; 1% instances), conj (1; 1% instances), csubj (1; 1% instances), mark (1; 1% instances), nmod:poss (1; 1% instances), obl (1; 1% instances), obl:comp (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (32; 37% instances), NOUN (10; 12% instances), ADV (8; 9% instances), VERB (8; 9% instances), ADJ (5; 6% instances), AUX (5; 6% instances), PROPN (5; 6% instances), X (5; 6% instances), PRON (4; 5% instances), PART (2; 2% instances), SCONJ (2; 2% instances)