Treebank Statistics: UD_Indonesian-GSD: Features: PronType
This feature is universal but the values Emp are language-specific.
It occurs with 8 different values: Art, Dem, Emp, Ind, Int, Prs, Rel, Tot.
10528 tokens (9%) have a non-empty value of PronType.
153 types (1%) occur at least once with a non-empty value of PronType.
92 lemmas (1%) occur at least once with a non-empty value of PronType.
The feature is used with 4 part-of-speech tags: PRON (6408; 5% instances), DET (3628; 3% instances), ADV (404; 0% instances), NUM (88; 0% instances).
PRON
6408 PRON tokens (100% of all PRON tokens) have a non-empty value of PronType.
The most frequent other feature values with which PRON and PronType co-occurred: Person=EMPTY (3593; 56%), Number=EMPTY (3552; 55%).
PRON tokens may have the following values of PronType:
Dem(155; 2% of non-emptyPronType): mana, itu, demikian, ini, sini, begitu, sana, situInd(50; 1% of non-emptyPronType): seseorang, sesuatu, seorang, sejumlah, beberapa, Banyak, nyaInt(217; 3% of non-emptyPronType): apa, siapa, mana, Berapa, Dimana, Kenapa, apa-apaPrs(2900; 45% of non-emptyPronType): nya, ia, mereka, dia, diri, kita, ku, kamu, aku, muRel(3043; 47% of non-emptyPronType): yang, siapa, yg, apaTot(43; 1% of non-emptyPronType): semua, keseluruhan, segala, kesemuanya, segenap
| Paradigm dia | Prs | Ind |
|---|---|---|
| nya | ||
| Person=3 | nya, ia, dia |
PronType seems to be lexical feature of PRON. 90% lemmas (38) occur only with one value of PronType.
DET
3628 DET tokens (100% of all DET tokens) have a non-empty value of PronType.
The most frequent other feature values with which DET and PronType co-occurred: Number=EMPTY (3174; 87%), Definite=EMPTY (2708; 75%).
DET tokens may have the following values of PronType:
Art(931; 26% of non-emptyPronType): sebuah, nya, seorang, suatu, sang, si, yang, seekor, The, SauatuDem(1855; 51% of non-emptyPronType): ini, itu, tersebut, tertentu, begitu, berikut, tadi, begini, demikian, tesebutEmp(51; 1% of non-emptyPronType): sendiriInd(529; 15% of non-emptyPronType): beberapa, para, berbagai, banyak, sejumlah, sekelompok, kebanyakan, sebagian, semacam, serangkaianRel(4; 0% of non-emptyPronType): yangTot(258; 7% of non-emptyPronType): semua, setiap, seluruh, masing-masing, segala, per, tiap, berdua, keseluruhan, masing
| Paradigm yang | Art | Rel |
|---|---|---|
| _ | yang | yang |
| Definite=Def | yang |
PronType seems to be lexical feature of DET. 98% lemmas (53) occur only with one value of PronType.
ADV
404 ADV tokens (12% of all ADV tokens) have a non-empty value of PronType.
ADV tokens may have the following values of PronType:
Dem(20; 5% of non-emptyPronType): begituInd(79; 20% of non-emptyPronType): banyakInt(287; 71% of non-emptyPronType): apa, bagaimana, mengapa, kenapa, dimana, kapan, berapa, mana, KemanaRel(17; 4% of non-emptyPronType): bagaimana, berapa, mengapa, kapanTot(1; 0% of non-emptyPronType): segalanyaEMPTY(3096): juga, lebih, kemudian, hanya, masih, sangat, pernah, lagi, akhirnya, biasanya
| Paradigm bagaimana | Int | Rel |
|---|---|---|
| bagaimana | bagaimana |
NUM
88 NUM tokens (2% of all NUM tokens) have a non-empty value of PronType.
The most frequent other feature values with which NUM and PronType co-occurred: NumType=Card (88; 100%).
NUM tokens may have the following values of PronType:
Tot(88; 100% of non-emptyPronType): kedua, ketiga, keempat, Ke-400, ke-2, keenam, kelima, ketujuhEMPTY(4180): satu, dua, 1, 2, 3, tiga, 5, 2010, 4, 2006
Relations with Agreement in PronType
The 10 most frequent relations where parent and child node agree in PronType:
PRON –[nmod:poss]–> PRON (34; 100%),
PRON –[conj]–> PRON (4; 100%),
ADV –[conj]–> ADV (2; 100%),
ADV –[parataxis]–> ADV (1; 100%),
PRON –[parataxis]–> PRON (1; 100%).