home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-CSUI: POS Tags: PROPN

There are 1193 PROPN lemmas (28%), 1194 PROPN types (25%) and 3835 PROPN tokens (14%). Out of 17 observed tags, the rank of PROPN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent PROPN lemmas: pt, AS, indonesia, Jakarta, bank, BI, tbk, Oktober, efek, Kamis

The 10 most frequent PROPN types: PT, AS, Indonesia, Jakarta, Bank, BI, tbk, Oktober, efek, Kamis

The 10 most frequent ambiguous lemmas: bank (NOUN 76, PROPN 10, X 1), efek (NOUN 2, PROPN 1), menteri (PROPN 8, NOUN 6), direktur (PROPN 6, NOUN 5), presiden (NOUN 4, PROPN 3), - (PUNCT 82, PROPN 13, SYM 1), obligasi (NOUN 40, PROPN 1), Energy (PROPN 10, X 1), of (PROPN 7, X 2), Pembangunan (PROPN 7, NOUN 1)

The 10 most frequent ambiguous types: Bank (PROPN 86, NOUN 14), efek (NOUN 2, PROPN 1), Menteri (PROPN 18, NOUN 6), Direktur (PROPN 16, NOUN 13), Presiden (NOUN 18, PROPN 16), - (PUNCT 82, PROPN 13, SYM 1), obligasi (NOUN 38, PROPN 1), NPL (PROPN 12, X 1), Energy (PROPN 10, X 1), Komisi (PROPN 8, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 1.000838 (the average of all parts of speech is 1.085880).

The 1st highest number of forms (2) was observed with the lemma “A”: A, A+idn.

The 2nd highest number of forms (1) was observed with the lemma “’s”: ’s.

The 3rd highest number of forms (1) was observed with the lemma “-”: -.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 17 different relations: flat:name (1427; 37% instances), nmod (1057; 28% instances), appos (310; 8% instances), nsubj (297; 8% instances), obl (230; 6% instances), obl:tmod (142; 4% instances), conj (135; 4% instances), nmod:tmod (97; 3% instances), obj (45; 1% instances), flat (32; 1% instances), nsubj:pass (19; 0% instances), obl:agent (19; 0% instances), root (15; 0% instances), nmod:poss (5; 0% instances), advcl (3; 0% instances), acl:relcl (1; 0% instances), iobj (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: PROPN (1806; 47% instances), NOUN (1181; 31% instances), VERB (727; 19% instances), NUM (40; 1% instances), X (40; 1% instances), ADJ (21; 1% instances), (15; 0% instances), AUX (2; 0% instances), PRON (2; 0% instances), DET (1; 0% instances)

2350 (61%) PROPN nodes are leaves.

542 (14%) PROPN nodes have one child.

466 (12%) PROPN nodes have two children.

477 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 19.

Children of PROPN nodes are attached using 25 different relations: flat:name (1418; 40% instances), punct (621; 17% instances), case (490; 14% instances), appos (227; 6% instances), nmod (214; 6% instances), conj (158; 4% instances), nummod (132; 4% instances), cc (84; 2% instances), acl:relcl (65; 2% instances), nmod:tmod (47; 1% instances), mark (20; 1% instances), amod (17; 0% instances), det (17; 0% instances), nsubj (11; 0% instances), nmod:lmod (10; 0% instances), cop (7; 0% instances), advmod (5; 0% instances), nmod:poss (4; 0% instances), flat (3; 0% instances), orphan (2; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), cc:preconj (1; 0% instances), obl (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (1806; 51% instances), PUNCT (621; 17% instances), ADP (490; 14% instances), NOUN (244; 7% instances), NUM (137; 4% instances), CCONJ (85; 2% instances), VERB (63; 2% instances), X (31; 1% instances), SCONJ (20; 1% instances), ADJ (18; 1% instances), DET (17; 0% instances), SYM (8; 0% instances), AUX (7; 0% instances), PRON (6; 0% instances), ADV (5; 0% instances)