home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-GSD: POS Tags: ADP

There are 214 ADP lemmas (1%), 236 ADP types (1%) and 12019 ADP tokens (10%). Out of 16 observed tags, the rank of ADP is: 7 in number of lemmas, 7 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: di, pada, dari, dengan, untuk, dalam, oleh, sebagai, ke, seperti

The 10 most frequent ADP types: di, pada, dari, dengan, untuk, dalam, oleh, sebagai, ke, seperti

The 10 most frequent ambiguous lemmas: di (ADP 2333, VERB 11, CCONJ 8, PROPN 6, SCONJ 3, X 1), pada (ADP 1391, NOUN 2, CCONJ 1), dari (ADP 1164, CCONJ 1, PROPN 1), dengan (ADP 1158, CCONJ 3, NOUN 3, SCONJ 1), untuk (ADP 923, PROPN 2), dalam (ADP 808, NOUN 17, ADJ 6, PROPN 2), oleh (ADP 588, NOUN 3, CCONJ 2), sebagai (ADP 584, NOUN 11, ADV 1, PROPN 1, VERB 1), ke (ADP 359, NUM 63, DET 9, VERB 1, X 1), seperti (ADP 213, SCONJ 1)

The 10 most frequent ambiguous types: di (ADP 2200, VERB 11, CCONJ 8, SCONJ 3, PROPN 2, X 1), dari (ADP 1129, CCONJ 1, PROPN 1), dengan (ADP 1121, CCONJ 1), untuk (ADP 868, PROPN 1), dalam (ADP 715, NOUN 10, ADJ 5, PROPN 1), oleh (ADP 575, CCONJ 2), sebagai (ADP 554, VERB 1), ke (ADP 356, NUM 62, DET 9, VERB 1, X 1), seperti (ADP 204, SCONJ 1), secara (ADP 155, ADV 5)

Morphology

The form / lemma ratio of ADP is 1.102804 (the average of all parts of speech is 1.045328).

The 1st highest number of forms (4) was observed with the lemma “kepada”: kepada, kepadaku, kepadamu, kepadanya.

The 2nd highest number of forms (3) was observed with the lemma “untuk”: untuk, untukmu, untuknya.

The 3rd highest number of forms (2) was observed with the lemma “antara”: antara, antaranya.

ADP occurs with 6 features: Number (1358; 11% instances), Degree (867; 7% instances), Voice (296; 2% instances), Number[psor] (51; 0% instances), Person[psor] (51; 0% instances), PronType (23; 0% instances)

ADP occurs with 11 feature-value pairs: Degree=Pos, Degree=Sup, Number=Sing, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, PronType=Ind, PronType=Int, Voice=Act, Voice=Pass

ADP occurs with 15 feature combinations. The most frequent feature combination is _ (10599 tokens). Examples: di, pada, dari, dengan, untuk, oleh, sebagai, ke, seperti, secara

Relations

ADP nodes are attached to their parents using 16 different relations: case (11897; 99% instances), mark (45; 0% instances), root (16; 0% instances), advmod (13; 0% instances), dep (13; 0% instances), amod (8; 0% instances), fixed (8; 0% instances), appos (6; 0% instances), obl (3; 0% instances), advcl (2; 0% instances), compound (2; 0% instances), nsubj (2; 0% instances), nmod (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 13 different parts of speech: NOUN (7336; 61% instances), PROPN (2890; 24% instances), VERB (1395; 12% instances), ADJ (162; 1% instances), PRON (127; 1% instances), ADV (33; 0% instances), NUM (29; 0% instances), (16; 0% instances), SYM (16; 0% instances), ADP (7; 0% instances), DET (4; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances)

11960 (100%) ADP nodes are leaves.

20 (0%) ADP nodes have one child.

19 (0%) ADP nodes have two children.

20 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 8.

Children of ADP nodes are attached using 18 different relations: punct (34; 25% instances), nmod (28; 21% instances), fixed (10; 7% instances), advmod (8; 6% instances), nsubj (7; 5% instances), conj (6; 4% instances), case (5; 4% instances), compound (5; 4% instances), dep (5; 4% instances), amod (4; 3% instances), flat (4; 3% instances), mark (4; 3% instances), acl (3; 2% instances), advcl (3; 2% instances), det (3; 2% instances), nummod (3; 2% instances), obj (2; 1% instances), nsubj:pass (1; 1% instances)

Children of ADP nodes belong to 14 different parts of speech: NOUN (37; 27% instances), PUNCT (34; 25% instances), VERB (12; 9% instances), PROPN (10; 7% instances), ADP (7; 5% instances), ADV (7; 5% instances), ADJ (6; 4% instances), PRON (6; 4% instances), SCONJ (6; 4% instances), DET (4; 3% instances), NUM (3; 2% instances), CCONJ (1; 1% instances), PART (1; 1% instances), SYM (1; 1% instances)