Treebank Statistics: UD_Indonesian-GSD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
24568 tokens (20%) have a non-empty value of Number
.
3888 types (20%) occur at least once with a non-empty value of Number
.
2553 lemmas (16%) occur at least once with a non-empty value of Number
.
The feature is used with 3 part-of-speech tags: NOUN (21254; 17% instances), PRON (2860; 2% instances), DET (454; 0% instances).
NOUN
21254 NOUN tokens (80% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(692; 3% of non-emptyNumber
): orang-orang, anak-anak, negara-negara, undang-undang, lagu-lagu, kata-kata, kitab-kitab, kota-kota, raja-raja, kapal-kapalSing
(20562; 97% of non-emptyNumber
): tahun, orang, desa, nama, kota, bagian, bahasa, wilayah, saat, filmEMPTY
(5226): tanggal, sepak, luas, band, atas, pusat, gelar, km, serial, sekarang
Paradigm tahun | Sing | Plur |
---|---|---|
tahun, tahunan | tahun-tahun |
PRON
2860 PRON tokens (45% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (2819; 99%), Person=3 (2473; 86%).
PRON
tokens may have the following values of Number
:
Plur
(466; 16% of non-emptyNumber
): mereka, kita, kami, kalian, apa-apa, beberapaSing
(2394; 84% of non-emptyNumber
): nya, ia, dia, ku, kamu, aku, mu, engkau, seseorang, beliauEMPTY
(3560): yang, apa, diri, siapa, mana, itu, demikian, semua, ini, sini
Number
seems to be lexical feature of PRON
. 100% lemmas (21) occur only with one value of Number
.
DET
454 DET tokens (13% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=EMPTY (454; 100%), PronType=Ind (454; 100%).
DET
tokens may have the following values of Number
:
Plur
(453; 100% of non-emptyNumber
): beberapa, para, berbagai, banyak, sejumlah, kebanyakan, serangkaian, aneka, beragam, sekelompokSing
(1; 0% of non-emptyNumber
): sesuatuEMPTY
(3164): ini, itu, sebuah, tersebut, nya, seorang, suatu, semua, setiap, seluruh
Number
seems to be lexical feature of DET
. 100% lemmas (13) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[compound]–> NOUN (4093; 66%),
NOUN –[nmod]–> NOUN (1553; 66%),
NOUN –[conj]–> NOUN (964; 69%),
NOUN –[nmod:poss]–> PRON (964; 71%),
NOUN –[nsubj]–> NOUN (123; 64%),
NOUN –[amod]–> NOUN (79; 62%),
NOUN –[nmod:tmod]–> NOUN (50; 74%),
NOUN –[acl]–> NOUN (31; 72%),
NOUN –[clf]–> NOUN (11; 100%),
NOUN –[advcl]–> NOUN (8; 53%).