home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: DET

There are 37 DET lemmas (0%), 37 DET types (0%) and 969 DET tokens (2%). Out of 17 observed tags, the rank of DET is: 12 in number of lemmas, 12 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: những, các, mọi, mấy, mỗi, một số, vài, tất cả, cả, một

The 10 most frequent DET types: những, các, mọi, mấy, mỗi, một số, vài, tất cả, cả, một

The 10 most frequent ambiguous lemmas: mọi (DET 52, NUM 1), mấy (DET 38, NUM 3, PART 1), mỗi (DET 36, NUM 29, PART 1), tất cả (PRON 25, DET 13, NOUN 1), cả (PRON 79, PART 17, DET 11, ADJ 1, ADV 1), một (NUM 535, DET 7, ADV 1), từng (ADV 35, PRON 15, DET 5, NOUN 1), chút (DET 4, NOUN 1), đôi (NUM 21, DET 4, NOUN 1), phần lớn (DET 3, NOUN 3)

The 10 most frequent ambiguous types: mọi (DET 43, NUM 1), mấy (DET 32, NUM 2, PART 1), mỗi (DET 28, NUM 24, PART 1), tất cả (PRON 21, DET 11, NOUN 1), cả (PRON 65, PART 17, DET 10, ADJ 1, ADV 1), một (NUM 496, DET 6, ADV 1), từng (ADV 35, PRON 15, DET 5, NOUN 1), chút (DET 4, NOUN 1), đôi (NUM 19, DET 4, NOUN 1), phần lớn (DET 2, NOUN 2)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “bao”: bao.

The 2nd highest number of forms (1) was observed with the lemma “bao nhiêu”: bao nhiêu.

The 3rd highest number of forms (1) was observed with the lemma “bấy nhiêu”: Bấy nhiêu.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 11 different relations: det (956; 99% instances), nmod (3; 0% instances), obj (2; 0% instances), advmod (1; 0% instances), advmod:adj (1; 0% instances), clf:det (1; 0% instances), compound (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), nsubj (1; 0% instances), obl:tmod (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (935; 96% instances), PRON (9; 1% instances), NUM (7; 1% instances), VERB (7; 1% instances), PROPN (6; 1% instances), ADJ (4; 0% instances), ADP (1; 0% instances)

890 (92%) DET nodes are leaves.

75 (8%) DET nodes have one child.

4 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 8 different relations: clf (68; 82% instances), case (3; 4% instances), clf:det (3; 4% instances), compound (3; 4% instances), fixed (2; 2% instances), nummod (2; 2% instances), advmod:adj (1; 1% instances), amod (1; 1% instances)

Children of DET nodes belong to 5 different parts of speech: NOUN (75; 90% instances), ADP (3; 4% instances), ADJ (2; 2% instances), NUM (2; 2% instances), ADV (1; 1% instances)