Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: ADV
There are 67 ADV lemmas (9%), 67 ADV types (9%) and 187 ADV tokens (10%).
Out of 15 observed tags, the rank of ADV is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent ADV lemmas: không, đã, sẽ, chỉ, khi, cũng, nhất, đang, đều, hãy
The 10 most frequent ADV types: không, đã, sẽ, chỉ, khi, cũng, nhất, đang, đều, hãy
The 10 most frequent ambiguous lemmas: không (ADV 26, INTJ 1), khi (ADV 9, ADP 2, SCONJ 1), còn (ADV 3, VERB 3), lại (ADV 3, NOUN 1), rồi (ADV 3, INTJ 1), dù sao (ADV 1, INTJ 1), giờ (ADV 1, NOUN 1), hay (CCONJ 2, ADV 1), hoàn toàn (ADJ 1, ADV 1), hơn (ADP 6, ADV 1)
The 10 most frequent ambiguous types: không (ADV 25, INTJ 1), khi (ADV 7, ADP 2, SCONJ 1), còn (ADV 3, VERB 3), lại (ADV 3, NOUN 1), Dù sao (ADV 1, INTJ 1), hay (CCONJ 2, ADV 1), hoàn toàn (ADJ 1, ADV 1), hơn (ADP 6, ADV 1), mới (ADJ 3, ADV 1), nay (ADV 1, DET 1)
- không
- khi
- ADV 7: Ý tôi là , khi còn là những bé gái , chúng ta bắt đầu một cách mạnh mẽ và đầy quyết tâm – “ Yeah , ai bảo đấy ? “
- ADP 2: Cậu phục vụ đất nước khi mới 19 tuổi vì gì chứ ?
- SCONJ 1: Bạn phải đến bác sĩ tâm lý hay nhà tâm thần học , thanh toán mười đô la và được chữa trị , cũng giống như khi bạn có một vết cắt trên tay .
- còn
- lại
- Dù sao
- hay
- hoàn toàn
- hơn
- mới
- nay
Morphology
The form / lemma ratio of ADV is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “bao giờ”: bao giờ.
The 2nd highest number of forms (1) was observed with the lemma “bây giờ”: bây giờ.
The 3rd highest number of forms (1) was observed with the lemma “chút xíu”: chút xíu.
ADV occurs with 5 features: AdvType (86; 46% instances), Tense (45; 24% instances), Polarity (34; 18% instances), Mood (5; 3% instances), Abbr (1; 1% instances)
ADV occurs with 12 feature-value pairs: Abbr=Yes, AdvType=Cau, AdvType=Deg, AdvType=Loc, AdvType=Man, AdvType=Mod, AdvType=Tim, Mood=Imp, Polarity=Neg, Tense=Fut, Tense=Past, Tense=Pres
ADV occurs with 14 feature combinations.
The most frequent feature combination is _ (69 tokens).
Examples: khi, cũng, đều, ngay, còn, lại, thực sự, ngay cả, rồi, thực ra
Relations
ADV nodes are attached to their parents using 7 different relations: advmod (180; 96% instances), obl (2; 1% instances), advcl (1; 1% instances), case (1; 1% instances), nsubj (1; 1% instances), obl:tmod (1; 1% instances), root (1; 1% instances)
Parents of ADV nodes belong to 7 different parts of speech: VERB (121; 65% instances), ADJ (26; 14% instances), NOUN (25; 13% instances), PRON (8; 4% instances), ADV (5; 3% instances), PROPN (1; 1% instances), (1; 1% instances)
174 (93%) ADV nodes are leaves.
11 (6%) ADV nodes have one child.
1 (1%) ADV nodes have two children.
1 (1%) ADV nodes have three or more children.
The highest child degree of a ADV node is 4.
Children of ADV nodes are attached using 8 different relations: advmod (5; 29% instances), case (4; 24% instances), obl (3; 18% instances), advcl (1; 6% instances), cop (1; 6% instances), nsubj:outer (1; 6% instances), obl:tmod (1; 6% instances), punct (1; 6% instances)
Children of ADV nodes belong to 7 different parts of speech: ADV (5; 29% instances), ADP (4; 24% instances), PRON (4; 24% instances), AUX (1; 6% instances), NOUN (1; 6% instances), PUNCT (1; 6% instances), VERB (1; 6% instances)