Treebank Statistics: UD_Indonesian-CSUI: POS Tags: PART
There are 8 PART
lemmas (0%), 8 PART
types (0%) and 159 PART
tokens (1%).
Out of 17 observed tags, the rank of PART
is: 15 in number of lemmas, 15 in number of types and 15 in number of tokens.
The 10 most frequent PART
lemmas: tidak, belum, pun, bukan, lah, kah, tak, jangan
The 10 most frequent PART
types: tidak, belum, pun, bukan, lah, kah, tak, jangan
The 10 most frequent ambiguous lemmas: belum (PART 28, ADJ 16, SCONJ 8), jangan (ADV 1, PART 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.085880).
The 1st highest number of forms (1) was observed with the lemma “belum”: belum.
The 2nd highest number of forms (1) was observed with the lemma “bukan”: bukan.
The 3rd highest number of forms (1) was observed with the lemma “jangan”: jangan.
PART
occurs with 1 features: Polarity (124; 78% instances)
PART
occurs with 1 feature-value pairs: Polarity=Neg
PART
occurs with 2 feature combinations.
The most frequent feature combination is Polarity=Neg
(124 tokens).
Examples: tidak, belum, bukan, tak, jangan
Relations
PART
nodes are attached to their parents using 3 different relations: advmod (124; 78% instances), advmod:emph (33; 21% instances), conj (2; 1% instances)
Parents of PART
nodes belong to 8 different parts of speech: VERB (79; 50% instances), ADJ (34; 21% instances), NOUN (18; 11% instances), SCONJ (9; 6% instances), ADV (8; 5% instances), PRON (6; 4% instances), PART (4; 3% instances), DET (1; 1% instances)
150 (94%) PART
nodes are leaves.
9 (6%) PART
nodes have one child.
The highest child degree of a PART
node is 1.
Children of PART
nodes are attached using 3 different relations: advmod:emph (4; 44% instances), cc (3; 33% instances), advmod (2; 22% instances)
Children of PART
nodes belong to 3 different parts of speech: PART (4; 44% instances), CCONJ (3; 33% instances), ADV (2; 22% instances)