This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home vi/pos issue tracker

PROPN: proper noun

This document is a placeholder for the language-specific documentation for PROPN.


Treebank Statistics (UD_Vietnamese)

There are 64 PROPN lemmas (1%), 64 PROPN types (1%) and 1837 PROPN tokens (4%). Out of 13 observed tags, the rank of PROPN is: 7 in number of lemmas, 7 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: này, đó, tôi, mình, đây, họ, gì, chúng_tôi, nào, ai

The 10 most frequent PROPN types: này, đó, tôi, mình, đây, họ, gì, chúng_tôi, nào, ai

The 10 most frequent ambiguous lemmas: này (PROPN 229, PART 3, NOUN 1), đó (PROPN 194, PART 3, NOUN 1), tôi (PROPN 125, NOUN 1), mình (PROPN 112, NOUN 8, X 1), đây (PROPN 86, PART 5, NOUN 3), họ (PROPN 62, NOUN 7), (PROPN 83, PART 6, X 4, NOUN 1), chúng_tôi (PROPN 64, NOUN 1), nào (PROPN 75, PART 2, X 1), ai (PROPN 63, NOUN 1)

The 10 most frequent ambiguous types: này (PROPN 229, PART 3, NOUN 1), đó (PROPN 194, PART 3, NOUN 1), tôi (PROPN 125, NOUN 1), mình (PROPN 112, NOUN 8, X 1), đây (PROPN 86, PART 5, NOUN 3), họ (PROPN 62, NOUN 7), (PROPN 83, PART 6, X 4, NOUN 1), chúng_tôi (PROPN 64, NOUN 1), nào (PROPN 75, PART 2, X 1), ai (PROPN 63, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “Ai”: Ai.

The 2nd highest number of forms (1) was observed with the lemma “Ai_nấy”: Ai_nấy.

The 3rd highest number of forms (1) was observed with the lemma “Bây_giờ”: Bây_giờ.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 11 different relations: det (886; 48% instances), nsubj (493; 27% instances), nmod (216; 12% instances), dobj (181; 10% instances), root (27; 1% instances), advcl (12; 1% instances), conj (8; 0% instances), ccomp (7; 0% instances), parataxis (5; 0% instances), auxpass (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: NOUN (939; 51% instances), VERB (740; 40% instances), ADJ (53; 3% instances), ADP (34; 2% instances), PROPN (28; 2% instances), ROOT (27; 1% instances), NUM (8; 0% instances), CONJ (5; 0% instances), X (3; 0% instances)

1533 (83%) PROPN nodes are leaves.

235 (13%) PROPN nodes have one child.

31 (2%) PROPN nodes have two children.

38 (2%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 9.

Children of PROPN nodes are attached using 24 different relations: case (165; 36% instances), punct (57; 12% instances), det (31; 7% instances), advmod (27; 6% instances), nsubj (21; 5% instances), discourse (20; 4% instances), amod (19; 4% instances), cop (19; 4% instances), neg (18; 4% instances), xcomp (18; 4% instances), nmod (15; 3% instances), cc (13; 3% instances), conj (8; 2% instances), nummod (5; 1% instances), advcl (3; 1% instances), auxpass (3; 1% instances), csubj (3; 1% instances), dep (3; 1% instances), mark (3; 1% instances), appos (2; 0% instances), ccomp (2; 0% instances), dobj (2; 0% instances), parataxis (2; 0% instances), aux (1; 0% instances)

Children of PROPN nodes belong to 13 different parts of speech: ADP (165; 36% instances), PUNCT (57; 12% instances), VERB (50; 11% instances), X (44; 10% instances), NOUN (40; 9% instances), PROPN (28; 6% instances), ADJ (27; 6% instances), PART (19; 4% instances), CONJ (11; 2% instances), DET (9; 2% instances), NUM (6; 1% instances), SCONJ (3; 1% instances), INTJ (1; 0% instances)


PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]