home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: DET

There are 17 DET lemmas (2%), 17 DET types (2%) and 84 DET tokens (4%). Out of 15 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 7 in number of tokens.

The 10 most frequent DET lemmas: những, này, các, đó, bất cứ, mọi, cả, một vài, cả hai, tất cả

The 10 most frequent DET types: những, này, các, đó, bất cứ, mọi, cả, một vài, cả hai, tất cả

The 10 most frequent ambiguous lemmas: đó (DET 10, PRON 10), ai (PRON 6, DET 1), kia (DET 1, PRON 1), nay (ADV 1, DET 1), từng (ADV 1, DET 1)

The 10 most frequent ambiguous types: đó (DET 10, PRON 8), ai (PRON 6, DET 1), kia (DET 1, PRON 1), nay (ADV 1, DET 1), từng (ADV 1, DET 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “ai”: ai.

The 2nd highest number of forms (1) was observed with the lemma “bất cứ”: bất cứ.

The 3rd highest number of forms (1) was observed with the lemma “các”: các.

DET occurs with 3 features: PronType (36; 43% instances), Number (12; 14% instances), Deixis (11; 13% instances)

DET occurs with 7 feature-value pairs: Deixis=Prox, Deixis=Remt, Number=Plur, PronType=Dem, PronType=Ind, PronType=Int, PronType=Tot

DET occurs with 8 feature combinations. The most frequent feature combination is _ (36 tokens). Examples: những, bất cứ, mọi, cả, mấy, từng

Relations

DET nodes are attached to their parents using 4 different relations: det (81; 96% instances), compound (1; 1% instances), nmod:poss (1; 1% instances), nsubj (1; 1% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (78; 93% instances), PRON (6; 7% instances)

81 (96%) DET nodes are leaves.

3 (4%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 1 different relations: clf (3; 100% instances)

Children of DET nodes belong to 1 different parts of speech: NOUN (3; 100% instances)