Treebank Statistics: UD_Indonesian-GSD: POS Tags: DET
There are 52 DET
lemmas (0%), 50 DET
types (0%) and 3618 DET
tokens (3%).
Out of 17 observed tags, the rank of DET
is: 10 in number of lemmas, 10 in number of types and 9 in number of tokens.
The 10 most frequent DET
lemmas: ini, itu, buah, tersebut, nya, orang, beberapa, para, berbagai, suatu
The 10 most frequent DET
types: ini, itu, sebuah, tersebut, nya, seorang, beberapa, para, berbagai, suatu
The 10 most frequent ambiguous lemmas: ini (DET 1154, PRON 13), itu (DET 402, PRON 38), buah (DET 365, NOUN 31, VERB 4, PROPN 1), tersebut (DET 267, VERB 1), nya (DET 219, PRON 21, PROPN 2), orang (NOUN 271, DET 199, PRON 23, VERB 1), beberapa (DET 173, PRON 2), para (DET 141, PROPN 1), suatu (DET 94, PRON 15), semua (DET 66, PRON 31)
The 10 most frequent ambiguous types: ini (DET 1141, PRON 8), itu (DET 399, PRON 32), nya (PRON 1494, DET 221), seorang (DET 193, PRON 5), beberapa (DET 156, PRON 2), semua (DET 60, PRON 27), seluruh (DET 51, NOUN 1), sendiri (DET 51, ADJ 14, ADV 7, NOUN 1), banyak (ADV 82, DET 39, ADJ 5), masing-masing (DET 20, ADV 4, NOUN 1)
- ini
- itu
- nya
- seorang
- beberapa
- semua
- seluruh
- DET 51: Album ini dijual di gerai Coffee Toffee di seluruh Indonesia .
- NOUN 1: Di Indonesia , definisi BUMN menurut Undang-Undang Nomor 19 Tahun 2003 adalah badan usaha yang seluruh atau sebagian besar modal nya dimiliki oleh negara melalui penyertaan secara langsung yang berasal dari kekayaan negara yang dipisahkan .
- sendiri
- DET 51: Dan bagaimana dengan Aurelie sendiri ?
- ADJ 14: Wiwi bertanya kenapa tidak Eyang Tini yang mengambil nya sendiri ?
- ADV 7: Namun KAN sendiri tidak memiliki kekuasaan formal .
- NOUN 1: Pengguna juga dapat menambahkan sendiri topik berita yang diinginkan dan menambahkan nya dalam kategori berita .
- banyak
- masing-masing
Morphology
The form / lemma ratio of DET
is 0.961538 (the average of all parts of speech is 1.120343).
The 1st highest number of forms (2) was observed with the lemma “banyak”: banyak, kebanyakan.
The 2nd highest number of forms (2) was observed with the lemma “beberapa”: beberapa, berberapa.
The 3rd highest number of forms (2) was observed with the lemma “masing”: masing, masing-masing.
DET
occurs with 5 features: PronType (3612; 100% instances), Definite (920; 25% instances), Number (454; 13% instances), Typo (3; 0% instances), Abbr (1; 0% instances)
DET
occurs with 11 feature-value pairs: Abbr=Yes
, Definite=Def
, Definite=Ind
, Number=Plur
, Number=Sing
, PronType=Art
, PronType=Dem
, PronType=Emp
, PronType=Ind
, PronType=Tot
, Typo=Yes
DET
occurs with 13 feature combinations.
The most frequent feature combination is PronType=Dem
(1854 tokens).
Examples: ini, itu, tersebut, tertentu, begitu, berikut, tadi, begini, demikian, tesebut
Relations
DET
nodes are attached to their parents using 13 different relations: det (3544; 98% instances), nsubj (30; 1% instances), amod (13; 0% instances), fixed (9; 0% instances), obj (7; 0% instances), advmod (4; 0% instances), nsubj:pass (3; 0% instances), conj (2; 0% instances), obl (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), xcomp (1; 0% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (2893; 80% instances), PROPN (282; 8% instances), VERB (200; 6% instances), ADJ (125; 3% instances), PRON (67; 2% instances), NUM (27; 1% instances), ADV (12; 0% instances), ADP (6; 0% instances), SCONJ (3; 0% instances), DET (2; 0% instances), PART (1; 0% instances)
3541 (98%) DET
nodes are leaves.
66 (2%) DET
nodes have one child.
6 (0%) DET
nodes have two children.
5 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 4.
Children of DET
nodes are attached using 12 different relations: punct (39; 41% instances), advmod (22; 23% instances), advmod:emph (16; 17% instances), nmod (4; 4% instances), case (2; 2% instances), cc (2; 2% instances), det (2; 2% instances), mark (2; 2% instances), nsubj (2; 2% instances), obl (2; 2% instances), acl:relcl (1; 1% instances), nsubj:pass (1; 1% instances)
Children of DET
nodes belong to 11 different parts of speech: PUNCT (39; 41% instances), PART (18; 19% instances), ADJ (10; 11% instances), ADV (10; 11% instances), NOUN (5; 5% instances), PRON (4; 4% instances), ADP (2; 2% instances), CCONJ (2; 2% instances), DET (2; 2% instances), SCONJ (2; 2% instances), VERB (1; 1% instances)