Treebank Statistics: UD_Indonesian-PUD: POS Tags: PART
There are 7 PART
lemmas (0%), 7 PART
types (0%) and 225 PART
tokens (1%).
Out of 17 observed tags, the rank of PART
is: 15 in number of lemmas, 15 in number of types and 14 in number of tokens.
The 10 most frequent PART
lemmas: tidak, pun, tak, belum, kah, lah, bukan
The 10 most frequent PART
types: tidak, pun, tak, belum, kah, lah, bukan
The 10 most frequent ambiguous lemmas: tidak (PART 137, ADV 6), belum (PART 13, SCONJ 11, ADV 6, ADP 5, ADJ 4)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.137428).
The 1st highest number of forms (1) was observed with the lemma “belum”: belum.
The 2nd highest number of forms (1) was observed with the lemma “bukan”: bukan.
The 3rd highest number of forms (1) was observed with the lemma “kah”: kah.
PART
occurs with 1 features: Polarity (177; 79% instances)
PART
occurs with 1 feature-value pairs: Polarity=Neg
PART
occurs with 2 feature combinations.
The most frequent feature combination is Polarity=Neg
(177 tokens).
Examples: tidak, tak, belum, bukan
Relations
PART
nodes are attached to their parents using 2 different relations: advmod (177; 79% instances), advmod:emph (48; 21% instances)
Parents of PART
nodes belong to 10 different parts of speech: VERB (123; 55% instances), ADJ (41; 18% instances), PRON (20; 9% instances), NOUN (19; 8% instances), ADV (6; 3% instances), SCONJ (6; 3% instances), DET (5; 2% instances), NUM (3; 1% instances), PART (1; 0% instances), PROPN (1; 0% instances)
224 (100%) PART
nodes are leaves.
1 (0%) PART
nodes have one child.
The highest child degree of a PART
node is 1.
Children of PART
nodes are attached using 1 different relations: advmod:emph (1; 100% instances)
Children of PART
nodes belong to 1 different parts of speech: PART (1; 100% instances)