home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: PART

There are 15 PART lemmas (2%), 15 PART types (2%) and 19 PART tokens (1%). Out of 15 observed tags, the rank of PART is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent PART lemmas: vậy, đấy, bộ, chứ, hề, mà, nhé, phi, phải, ra

The 10 most frequent PART types: vậy, đấy, Bộ, Mà, chứ, hề, nhé, phi, phải, ra

The 10 most frequent ambiguous lemmas: (PRON 12, PART 1), phải (AUX 11, ADJ 3, PART 1), ra (VERB 2, ADP 1, PART 1), đâu (PRON 2, PART 1)

The 10 most frequent ambiguous types: phải (AUX 11, ADJ 3, PART 1), ra (VERB 2, ADP 1, PART 1), đâu (PART 1, PRON 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “bộ”: Bộ.

The 2nd highest number of forms (1) was observed with the lemma “chứ”: chứ.

The 3rd highest number of forms (1) was observed with the lemma “hề”: hề.

PART occurs with 3 features: Polarity (3; 16% instances), Number (1; 5% instances), Typo (1; 5% instances)

PART occurs with 4 feature-value pairs: Number=Plur, Polarity=Neg, Polarity=Pos, Typo=Yes

PART occurs with 5 feature combinations. The most frequent feature combination is _ (14 tokens). Examples: vậy, đấy, Bộ, Mà, chứ, nhé, ra, thì, à, đâu

Relations

PART nodes are attached to their parents using 4 different relations: discourse (15; 79% instances), compound (2; 11% instances), advmod (1; 5% instances), compound:prt (1; 5% instances)

Parents of PART nodes belong to 4 different parts of speech: VERB (11; 58% instances), NOUN (5; 26% instances), PRON (2; 11% instances), ADJ (1; 5% instances)

19 (100%) PART nodes are leaves.

The highest child degree of a PART node is 0.