Treebank Statistics: UD_Indonesian-GSD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
24553 tokens (20%) have a non-empty value of Number.
3886 types (20%) occur at least once with a non-empty value of Number.
2552 lemmas (16%) occur at least once with a non-empty value of Number.
The feature is used with 3 part-of-speech tags: NOUN (21243; 17% instances), PRON (2856; 2% instances), DET (454; 0% instances).
NOUN
21243 NOUN tokens (80% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Plur(692; 3% of non-emptyNumber): orang-orang, anak-anak, negara-negara, undang-undang, lagu-lagu, kata-kata, kitab-kitab, kota-kota, raja-raja, kapal-kapalSing(20551; 97% of non-emptyNumber): tahun, orang, desa, nama, kota, bagian, bahasa, wilayah, saat, filmEMPTY(5193): tanggal, sepak, luas, band, atas, pusat, gelar, km, serial, sekarang
| Paradigm tahun | Sing | Plur |
|---|---|---|
| tahun, tahunan | tahun-tahun |
PRON
2856 PRON tokens (45% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (2815; 99%), Person=3 (2469; 86%).
PRON tokens may have the following values of Number:
Plur(466; 16% of non-emptyNumber): mereka, kita, kami, kalian, apa-apa, beberapaSing(2390; 84% of non-emptyNumber): nya, ia, dia, ku, kamu, aku, mu, engkau, seseorang, beliauEMPTY(3552): yang, apa, diri, siapa, mana, itu, demikian, semua, ini, sini
Number seems to be lexical feature of PRON. 100% lemmas (21) occur only with one value of Number.
DET
454 DET tokens (13% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (454; 100%), PronType=Ind (454; 100%).
DET tokens may have the following values of Number:
Plur(453; 100% of non-emptyNumber): beberapa, para, berbagai, banyak, sejumlah, kebanyakan, serangkaian, aneka, beragam, sekelompokSing(1; 0% of non-emptyNumber): sesuatuEMPTY(3174): ini, itu, sebuah, tersebut, nya, seorang, suatu, semua, setiap, seluruh
Number seems to be lexical feature of DET. 100% lemmas (13) occur only with one value of Number.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[compound]–> NOUN (4094; 66%),
NOUN –[nmod]–> NOUN (1559; 66%),
NOUN –[conj]–> NOUN (966; 69%),
NOUN –[nmod:poss]–> PRON (964; 71%),
NOUN –[nsubj]–> NOUN (123; 64%),
NOUN –[amod]–> NOUN (76; 61%),
NOUN –[nmod:tmod]–> NOUN (51; 74%),
NOUN –[acl]–> NOUN (32; 73%),
NOUN –[clf]–> NOUN (11; 100%),
NOUN –[advcl]–> NOUN (8; 53%).