Treebank Statistics: UD_Indonesian-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
5385 tokens (28%) have a non-empty value of Number.
1981 types (42%) occur at least once with a non-empty value of Number.
1533 lemmas (42%) occur at least once with a non-empty value of Number.
The feature is used with 3 part-of-speech tags: NOUN (4656; 24% instances), PRON (620; 3% instances), DET (109; 1% instances).
NOUN
4656 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Plur(143; 3% of non-emptyNumber): orang-orang, negara-negara, anak-anak, batas-batas, 1970-an, bagian-bagian, batu-batu, bertahun-tahun, bulan-bulan, kota-kotaSing(4513; 97% of non-emptyNumber): tahun, orang, bulan, bagian, hari, negara, kota, laut, hal, perangEMPTY(32): SM, mercu, AIDS, ATM, BC, DFB, GIF, HFC, Kontituensi, MLA
| Paradigm tahun | Sing | Plur |
|---|---|---|
| tahun, bertahun-tahun, tahunan | bertahun-tahun, tahun-tahun |
Number seems to be lexical feature of NOUN. 96% lemmas (1454) occur only with one value of Number.
PRON
620 PRON tokens (47% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (620; 100%), Person=3 (501; 81%).
PRON tokens may have the following values of Number:
Plur(139; 22% of non-emptyNumber): mereka, kami, kita, kalianSing(481; 78% of non-emptyNumber): nya, ia, saya, dia, Anda, Aku, ku, kamuEMPTY(710): yang, itu, ini, mana, apa, diri, sana, siapa, seseorang, begitu
Number seems to be lexical feature of PRON. 100% lemmas (12) occur only with one value of Number.
DET
109 DET tokens (15% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (109; 100%), PronType=Ind (109; 100%).
DET tokens may have the following values of Number:
Plur(109; 100% of non-emptyNumber): para, banyak, beberapa, berbagai, serangkaian, sepasangEMPTY(630): ini, itu, nya, sebuah, tersebut, seorang, semua, sendiri, seluruh, setiap
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[nmod]–> NOUN (1283; 94%),
NOUN –[conj]–> NOUN (219; 95%),
NOUN –[nmod:poss]–> PRON (198; 77%),
NOUN –[nmod:tmod]–> NOUN (77; 96%),
NOUN –[nmod:lmod]–> NOUN (65; 97%),
NOUN –[nsubj]–> NOUN (50; 96%),
NOUN –[compound]–> NOUN (24; 100%),
NOUN –[nmod:poss]–> NOUN (18; 100%),
PRON –[nmod:lmod]–> NOUN (10; 77%),
NOUN –[acl:relcl]–> NOUN (9; 100%).