NUM

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home vi/pos issue tracker

`NUM`: numeral

This document is a placeholder for the language-specific documentation for NUM.

Treebank Statistics (UD_Vietnamese)

There are 223 NUM lemmas (4%), 223 NUM types (4%) and 1300 NUM tokens (3%). Out of 13 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: một, hai, ba, mỗi, 2, 10, năm, 20, 5, bốn

The 10 most frequent NUM types: một, hai, ba, mỗi, 2, 10, năm, 20, 5, bốn

The 10 most frequent ambiguous lemmas: một (NUM 354, X 1), mỗi (NUM 20, DET 18), năm (NOUN 111, NUM 17), đôi (NUM 14, DET 1), triệu (NUM 11, NOUN 2), tỉ (NUM 9, NOUN 2), nhất (X 19, ADJ 18, NUM 7, NOUN 2), nửa (NUM 6, DET 2), tư (NUM 5, ADJ 3), ngàn (NUM 3, NOUN 1)

The 10 most frequent ambiguous types: một (NUM 354, X 1), mỗi (NUM 20, DET 18), năm (NOUN 111, NUM 17), đôi (NUM 14, DET 1), triệu (NUM 11, NOUN 2), tỉ (NUM 9, NOUN 2), nhất (X 19, ADJ 18, NUM 7, NOUN 2), nửa (NUM 6, DET 2), tư (NUM 5, ADJ 3), ngàn (NUM 3, NOUN 1)

một
- NUM 354: Đó như một lời nhắn_nhủ : chú hổ Lâm_Nhi là “ tài_sản “ của mỗi người .
- X 1: Còn ông Chương cũng không_thể rời Hùng bởi mỗi ngày một hốt_hoảng trước sự tấn_công của kẻ địch .
mỗi
- NUM 20: Đó như một lời nhắn_nhủ : chú hổ Lâm_Nhi là “ tài_sản “ của mỗi người .
- DET 18: Như_vậy mỗi tổ đào mỗi bên 5 m .
năm
- NOUN 111: Ba năm trước , Lâm_Nhi về mới có 30 kg .
- NUM 17: mẹ anh là cụ G . , có năm con .
đôi
- NUM 14: Những đôi uyên_ương ngày_xưa giờ này người còn người mất .
- DET 1: Những đứa trẻ chạy tung_tăng trong con xóm nhỏ gọi nhau í_ới bằng tiếng Việt , đôi lúc chúng lại pha tiếng Lào .
triệu
- NUM 11: giá nấu thuê 5 - 10 triệu đồng / nồi tùy ở thân_sơ và cam_kết chủ thợ .
- NOUN 2: Đấy là phòng giá cao nhất : 3,_5 triệu / tháng .
tỉ
- NUM 9: Nhiều người sang qua sang lại mà lời cả tỉ “ … .
- NOUN 2: TP_._HCM : vay 500 tỉ đồng triển_khai gấp các dự_án cấp nước .
nhất
- X 19: Và nguy_hiểm nhất vẫn là vấn_đề qui_hoạch .
- ADJ 18: Chúng_tôi hỏi : “ Chị hãi nhất con vật nào ở vườn thú này ? “ .
- NUM 7: johor ngày thứ nhất .
- NOUN 2: cuộc thứ nhất hết 5 phút , gọi trong giờ_hành_chính và từ số máy có đông người ngồi xung_quanh .
nửa
- NUM 6: Nhưng chưa đến nửa tuần thì ông Chương đã vội_vã cầu_cứu Hùng .
- DET 2: Viện bị bại_liệt nửa người .
tư
- NUM 5: Đa_số học_sinh của trường hiện_nay là thế_hệ thứ tư , thứ năm .
- ADJ 3: Thắng gần như không hỏi gì về đời tư của Lan .
ngàn
- NUM 3: Diện_tích tăng lên , sản_lượng khoai hằng năm mình thu về cả ngàn tấn .
- NOUN 1: có hàng ngàn nạn_nhân như_vậy ở Đà_Nẵng .

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “1”: 1.

The 3rd highest number of forms (1) was observed with the lemma “1,_5”: 1,_5.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (1172; 90% instances), conj (47; 4% instances), compound (37; 3% instances), dep (19; 1% instances), nsubj (11; 1% instances), dobj (6; 0% instances), parataxis (3; 0% instances), root (3; 0% instances), amod (1; 0% instances), nmod (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (1211; 93% instances), NUM (40; 3% instances), VERB (34; 3% instances), PROPN (6; 0% instances), ADJ (3; 0% instances), PUNCT (3; 0% instances), ROOT (3; 0% instances)

1234 (95%) NUM nodes are leaves.

45 (3%) NUM nodes have one child.

9 (1%) NUM nodes have two children.

12 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 17 different relations: compound (39; 37% instances), punct (23; 22% instances), dep (10; 9% instances), det (8; 8% instances), advmod (3; 3% instances), amod (3; 3% instances), case (3; 3% instances), neg (3; 3% instances), nummod (3; 3% instances), conj (2; 2% instances), nmod (2; 2% instances), xcomp (2; 2% instances), cc (1; 1% instances), cop (1; 1% instances), discourse (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (40; 38% instances), PUNCT (24; 23% instances), NOUN (16; 15% instances), PROPN (8; 8% instances), ADJ (6; 6% instances), X (5; 5% instances), ADP (3; 3% instances), VERB (2; 2% instances), CONJ (1; 1% instances), PART (1; 1% instances)

NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

NUM: numeral

Treebank Statistics (UD_Vietnamese)

Morphology

Relations

`NUM`: numeral