This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home vi/pos issue tracker

NOUN: noun

This document is a placeholder for the language-specific documentation for NOUN.


Treebank Statistics (UD_Vietnamese)

There are 2561 NOUN lemmas (42%), 2561 NOUN types (42%) and 13951 NOUN tokens (32%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: người, ông, anh, nhà, bà, khi, con, ngày, hùng, năm

The 10 most frequent NOUN types: người, ông, anh, nhà, bà, khi, con, ngày, hùng, năm

The 10 most frequent ambiguous lemmas: ông (NOUN 348, PROPN 12), anh (NOUN 188, PROPN 1), (NOUN 156, PROPN 1), con (NOUN 144, ADJ 3), ngày (NOUN 114, X 1), năm (NOUN 111, NUM 17), thám_tử (NOUN 94, VERB 1), lần (NOUN 84, VERB 5), nhau (NOUN 76, PROPN 2), cái (NOUN 68, PART 4)

The 10 most frequent ambiguous types: ông (NOUN 348, PROPN 12), anh (NOUN 188, PROPN 1), (NOUN 156, PROPN 1), con (NOUN 144, ADJ 3), ngày (NOUN 114, X 1), năm (NOUN 111, NUM 17), thám_tử (NOUN 94, VERB 1), lần (NOUN 84, VERB 5), nhau (NOUN 76, PROPN 2), cái (NOUN 68, PART 4)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “&”: &.

The 3rd highest number of forms (1) was observed with the lemma “12_-_2003”: 12_-_2003.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 22 different relations: dobj (3767; 27% instances), compound (3504; 25% instances), nsubj (3029; 22% instances), nmod (2292; 16% instances), conj (535; 4% instances), root (400; 3% instances), advcl (181; 1% instances), appos (78; 1% instances), ccomp (58; 0% instances), xcomp (26; 0% instances), parataxis (24; 0% instances), dep (17; 0% instances), iobj (11; 0% instances), list (7; 0% instances), advmod (6; 0% instances), punct (5; 0% instances), auxpass (3; 0% instances), amod (2; 0% instances), mark (2; 0% instances), vocative (2; 0% instances), case (1; 0% instances), nummod (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (7370; 53% instances), NOUN (5156; 37% instances), ADJ (750; 5% instances), ROOT (400; 3% instances), ADP (172; 1% instances), PROPN (40; 0% instances), PUNCT (25; 0% instances), NUM (16; 0% instances), X (16; 0% instances), CONJ (5; 0% instances), DET (1; 0% instances)

6029 (43%) NOUN nodes are leaves.

3966 (28%) NOUN nodes have one child.

2089 (15%) NOUN nodes have two children.

1867 (13%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 15.

Children of NOUN nodes are attached using 28 different relations: compound (3470; 22% instances), case (1873; 12% instances), xcomp (1554; 10% instances), det (1432; 9% instances), punct (1396; 9% instances), amod (1159; 7% instances), nummod (1159; 7% instances), nmod (880; 6% instances), conj (697; 4% instances), cc (335; 2% instances), cop (322; 2% instances), advmod (291; 2% instances), nsubj (272; 2% instances), ccomp (241; 2% instances), discourse (107; 1% instances), appos (81; 1% instances), advcl (73; 0% instances), dep (66; 0% instances), parataxis (38; 0% instances), neg (34; 0% instances), mark (31; 0% instances), csubj (22; 0% instances), auxpass (21; 0% instances), aux (20; 0% instances), dobj (12; 0% instances), list (7; 0% instances), iobj (4; 0% instances), vocative (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (5156; 33% instances), VERB (2279; 15% instances), ADP (1879; 12% instances), PUNCT (1400; 9% instances), ADJ (1257; 8% instances), NUM (1211; 8% instances), PROPN (939; 6% instances), DET (673; 4% instances), X (347; 2% instances), SCONJ (196; 1% instances), CONJ (169; 1% instances), PART (88; 1% instances), INTJ (4; 0% instances)


NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]