This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home id/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Indonesian)

There are 1 DET lemmas (6%), 112 DET types (0%) and 3963 DET tokens (3%). Out of 16 observed tags, the rank of DET is: 6 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: ini, itu, sebuah, tersebut, satu, seorang, salah, beberapa, para, berbagai

The 10 most frequent ambiguous lemmas: _ (NOUN 27313, PROPN 22844, PUNCT 18228, VERB 13257, ADP 12019, ADV 4760, ADJ 4574, PRON 4397, NUM 4386, DET 3963, CONJ 3659, SCONJ 1475, PART 590, SYM 418, X 39, AUX 1)

The 10 most frequent ambiguous types: ini (DET 1136, NOUN 3), itu (DET 410, NOUN 5, CONJ 4, PRON 1, SCONJ 1), satu (DET 209, NUM 92, NOUN 20, ADJ 1), seorang (DET 191, NOUN 6, PRON 1), salah (DET 165, NOUN 23, ADJ 2, VERB 1, ADV 1), beberapa (DET 156, ADJ 1, NOUN 1), berbagai (DET 92, ADJ 1), semua (DET 72, NOUN 4, ADV 1), masing (DET 49, ADV 8, ADJ 2, NOUN 2), sebagian (DET 39, NOUN 12, ADV 3)

Morphology

The form / lemma ratio of DET is 112.000000 (the average of all parts of speech is 1437.312500).

The 1st highest number of forms (112) was observed with the lemma “_”: 2, Bagi, Begitu, Dr, Gepenglah, Keduanya, Oh, Pituruh, Sauatu, Semangkuk, Tangguh, Tetap, Tujuh, a, al, an, aneka, apapun, baik, banyak, banyaknya, beberapa, beginilah, beragam, berapa, berbagai, berberapa, berdua, berikut, buah, buruh, demikian, dibeberapa, how, in, ini, inilah, itu, itua, itulah, itupun, ke, kebanyakan, kedua, keempat, kelima, keseluruhan, ketiga, khususnya, la, lain, lainnya, macam, maka, manakah, manapun, masing, mayoritas, nya, orang, para, pembantaian, per, pula, pun, ratusan, ribuan, salah, sama, sang, satu, satu-satunya, satunya, se, seantero, sebagaian, sebagian, sebua, sebuah, sedikit, sedikitnya, seekor, segala, segenap, sejumlah, sekalipun, sekelompok, sekeping, sekitar, sekumpulan, seluruh, semacam, semua, semuanya, sendiri, seorang, sepasang, sepucuk, serangkaian, sesuatu, setiap, si, stu, suatu, tersebut, tertentu, tesebut, the, tiap, tsb, uap, yaitu.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 14 different relations: det (3627; 92% instances), mwe (211; 5% instances), nummod (47; 1% instances), nsubj (39; 1% instances), advmod (14; 0% instances), dobj (12; 0% instances), nsubjpass (4; 0% instances), acl (2; 0% instances), compound (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), conj (1; 0% instances), dep (1; 0% instances), iobj (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (3177; 80% instances), PROPN (304; 8% instances), DET (216; 5% instances), VERB (161; 4% instances), NUM (21; 1% instances), ADJ (20; 1% instances), PRON (20; 1% instances), SCONJ (17; 0% instances), ADV (13; 0% instances), CONJ (11; 0% instances), ADP (3; 0% instances)

3700 (93%) DET nodes are leaves.

212 (5%) DET nodes have one child.

50 (1%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 18 different relations: mwe (198; 63% instances), punct (44; 14% instances), det (17; 5% instances), amod (14; 4% instances), advmod (13; 4% instances), nummod (7; 2% instances), nmod (5; 2% instances), acl (4; 1% instances), case (2; 1% instances), compound (2; 1% instances), nsubj (2; 1% instances), cc (1; 0% instances), ccomp (1; 0% instances), dobj (1; 0% instances), mark (1; 0% instances), name (1; 0% instances), neg (1; 0% instances), nsubjpass (1; 0% instances)

Children of DET nodes belong to 13 different parts of speech: DET (216; 69% instances), PUNCT (44; 14% instances), ADJ (14; 4% instances), ADV (14; 4% instances), NOUN (8; 3% instances), NUM (5; 2% instances), VERB (4; 1% instances), PRON (3; 1% instances), ADP (2; 1% instances), PROPN (2; 1% instances), CONJ (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]