NOUN
: noun
This document is a placeholder for the language-specific documentation
for NOUN
.
Treebank Statistics (UD_Vietnamese)
There are 2561 NOUN
lemmas (42%), 2561 NOUN
types (42%) and 13951 NOUN
tokens (32%).
Out of 13 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: người, ông, anh, nhà, bà, khi, con, ngày, hùng, năm
The 10 most frequent NOUN
types: người, ông, anh, nhà, bà, khi, con, ngày, hùng, năm
The 10 most frequent ambiguous lemmas: ông (NOUN 348, PROPN 12), anh (NOUN 188, PROPN 1), bà (NOUN 156, PROPN 1), con (NOUN 144, ADJ 3), ngày (NOUN 114, X 1), năm (NOUN 111, NUM 17), thám_tử (NOUN 94, VERB 1), lần (NOUN 84, VERB 5), nhau (NOUN 76, PROPN 2), cái (NOUN 68, PART 4)
The 10 most frequent ambiguous types: ông (NOUN 348, PROPN 12), anh (NOUN 188, PROPN 1), bà (NOUN 156, PROPN 1), con (NOUN 144, ADJ 3), ngày (NOUN 114, X 1), năm (NOUN 111, NUM 17), thám_tử (NOUN 94, VERB 1), lần (NOUN 84, VERB 5), nhau (NOUN 76, PROPN 2), cái (NOUN 68, PART 4)
- ông
- anh
- bà
- con
- ngày
- năm
- thám_tử
- lần
- nhau
- cái
Morphology
The form / lemma ratio of NOUN
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “12_-_2003”: 12_-_2003.
NOUN
does not occur with any features.
Relations
NOUN
nodes are attached to their parents using 22 different relations: dobj (3767; 27% instances), compound (3504; 25% instances), nsubj (3029; 22% instances), nmod (2292; 16% instances), conj (535; 4% instances), root (400; 3% instances), advcl (181; 1% instances), appos (78; 1% instances), ccomp (58; 0% instances), xcomp (26; 0% instances), parataxis (24; 0% instances), dep (17; 0% instances), iobj (11; 0% instances), list (7; 0% instances), advmod (6; 0% instances), punct (5; 0% instances), auxpass (3; 0% instances), amod (2; 0% instances), mark (2; 0% instances), vocative (2; 0% instances), case (1; 0% instances), nummod (1; 0% instances)
Parents of NOUN
nodes belong to 11 different parts of speech: VERB (7370; 53% instances), NOUN (5156; 37% instances), ADJ (750; 5% instances), ROOT (400; 3% instances), ADP (172; 1% instances), PROPN (40; 0% instances), PUNCT (25; 0% instances), NUM (16; 0% instances), X (16; 0% instances), CONJ (5; 0% instances), DET (1; 0% instances)
6029 (43%) NOUN
nodes are leaves.
3966 (28%) NOUN
nodes have one child.
2089 (15%) NOUN
nodes have two children.
1867 (13%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 15.
Children of NOUN
nodes are attached using 28 different relations: compound (3470; 22% instances), case (1873; 12% instances), xcomp (1554; 10% instances), det (1432; 9% instances), punct (1396; 9% instances), amod (1159; 7% instances), nummod (1159; 7% instances), nmod (880; 6% instances), conj (697; 4% instances), cc (335; 2% instances), cop (322; 2% instances), advmod (291; 2% instances), nsubj (272; 2% instances), ccomp (241; 2% instances), discourse (107; 1% instances), appos (81; 1% instances), advcl (73; 0% instances), dep (66; 0% instances), parataxis (38; 0% instances), neg (34; 0% instances), mark (31; 0% instances), csubj (22; 0% instances), auxpass (21; 0% instances), aux (20; 0% instances), dobj (12; 0% instances), list (7; 0% instances), iobj (4; 0% instances), vocative (1; 0% instances)
Children of NOUN
nodes belong to 13 different parts of speech: NOUN (5156; 33% instances), VERB (2279; 15% instances), ADP (1879; 12% instances), PUNCT (1400; 9% instances), ADJ (1257; 8% instances), NUM (1211; 8% instances), PROPN (939; 6% instances), DET (673; 4% instances), X (347; 2% instances), SCONJ (196; 1% instances), CONJ (169; 1% instances), PART (88; 1% instances), INTJ (4; 0% instances)
NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]