home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-PUD: POS Tags: ADJ

There are 338 ADJ lemmas (8%), 351 ADJ types (7%) and 1026 ADJ tokens (5%). Out of 17 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent ADJ lemmas: besar, baru, lain, baik, pertama, akhir, sama, banyak, salah, tinggi

The 10 most frequent ADJ types: besar, lain, baru, pertama, terakhir, baik, sama, banyak, salah, lama

The 10 most frequent ambiguous lemmas: besar (ADJ 65, NOUN 5), baru (ADJ 33, NOUN 6), lain (ADJ 33, CCONJ 2, NOUN 1), baik (ADJ 28, CCONJ 5, VERB 1), pertama (ADJ 28, ADV 2), akhir (ADJ 23, ADV 12, NOUN 9, VERB 4), sama (ADJ 22, ADP 6, ADV 5, NOUN 4, VERB 2), banyak (DET 35, ADJ 19, ADV 10, NOUN 1), salah (ADJ 17, VERB 3, NOUN 1), tinggi (ADJ 17, NOUN 2)

The 10 most frequent ambiguous types: lain (ADJ 33, NOUN 1), baik (ADJ 21, CCONJ 4), sama (ADJ 21, ADP 1), banyak (DET 32, ADJ 18, ADV 10), jelas (ADJ 12, VERB 2), tinggi (ADJ 12, NOUN 1), kedua (NUM 14, ADJ 6), ketiga (ADJ 7, NUM 1), panjang (ADJ 6, NOUN 1), mungkin (AUX 11, ADJ 3, ADV 3)

Morphology

The form / lemma ratio of ADJ is 1.038462 (the average of all parts of speech is 1.137196).

The 1st highest number of forms (3) was observed with the lemma “besar”: besar, besar-besaran, terbesar.

The 2nd highest number of forms (2) was observed with the lemma “baik”: baik, terbaik.

The 3rd highest number of forms (2) was observed with the lemma “banyak”: banyak, terbanyak.

ADJ occurs with 2 features: Degree (58; 6% instances), NumType (58; 6% instances)

ADJ occurs with 2 feature-value pairs: Degree=Sup, NumType=Ord

ADJ occurs with 3 feature combinations. The most frequent feature combination is _ (910 tokens). Examples: besar, lain, baru, baik, sama, banyak, salah, lama, biasa, penting

Relations

ADJ nodes are attached to their parents using 14 different relations: amod (580; 57% instances), acl:relcl (132; 13% instances), advmod (126; 12% instances), root (60; 6% instances), xcomp (37; 4% instances), conj (29; 3% instances), advcl (22; 2% instances), ccomp (13; 1% instances), parataxis (9; 1% instances), acl (8; 1% instances), csubj (5; 0% instances), fixed (2; 0% instances), obl (2; 0% instances), compound (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (711; 69% instances), VERB (182; 18% instances), (60; 6% instances), ADJ (32; 3% instances), NUM (16; 2% instances), PROPN (15; 1% instances), PRON (4; 0% instances), ADP (2; 0% instances), SYM (2; 0% instances), DET (1; 0% instances), X (1; 0% instances)

629 (61%) ADJ nodes are leaves.

186 (18%) ADJ nodes have one child.

84 (8%) ADJ nodes have two children.

127 (12%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 9.

Children of ADJ nodes are attached using 26 different relations: nsubj (213; 24% instances), advmod (191; 22% instances), punct (126; 14% instances), obl (64; 7% instances), case:adv (43; 5% instances), advcl (41; 5% instances), conj (39; 4% instances), cc (26; 3% instances), mark (23; 3% instances), det (20; 2% instances), aux (16; 2% instances), obl:tmod (12; 1% instances), xcomp (12; 1% instances), compound:a (10; 1% instances), parataxis (8; 1% instances), csubj (6; 1% instances), nmod (6; 1% instances), ccomp (4; 0% instances), fixed (3; 0% instances), appos (2; 0% instances), case (2; 0% instances), cc:preconj (1; 0% instances), csubj:pass (1; 0% instances), dislocated (1; 0% instances), flat (1; 0% instances), nsubj:pass (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PRON (157; 18% instances), ADV (143; 16% instances), NOUN (139; 16% instances), PUNCT (126; 14% instances), VERB (83; 10% instances), ADP (47; 5% instances), PART (41; 5% instances), ADJ (32; 4% instances), CCONJ (27; 3% instances), SCONJ (21; 2% instances), DET (20; 2% instances), AUX (16; 2% instances), PROPN (15; 2% instances), NUM (3; 0% instances), SYM (2; 0% instances)