home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-STB: POS Tags: DET

There are 13 DET lemmas (3%), 12 DET types (2%) and 23 DET tokens (3%). Out of 13 observed tags, the rank of DET is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: මේ, එම, ඒ, සෑම, අදාළ, අනෙක්, එබඳු, ඕනෑ, කිසිඳු, තව

The 10 most frequent DET types: එම, මේ, ඒ, තවත්, සෑම, අදාළ, අනෙකුත්, එබඳු, ඕනෑම, කිසිඳු

The 10 most frequent ambiguous lemmas: මේ (DET 5, ADV 1, PRON 1), (PRON 9, DET 3, ADV 1), එබඳු (ADJ 2, DET 1), සියලු (NOUN 3, DET 1)

The 10 most frequent ambiguous types: එම (DET 5, PRON 1), මේ (DET 5, PRON 1), (PRON 7, DET 2), එබඳු (ADJ 2, DET 1)

Morphology

The form / lemma ratio of DET is 0.923077 (the average of all parts of speech is 1.145336).

The 1st highest number of forms (2) was observed with the lemma “ඒ”: එම, ඒ.

The 2nd highest number of forms (1) was observed with the lemma “අදාළ”: අදාළ.

The 3rd highest number of forms (1) was observed with the lemma “අනෙක්”: අනෙකුත්.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 2 different relations: det (22; 96% instances), dep (1; 4% instances)

Parents of DET nodes belong to 1 different parts of speech: NOUN (23; 100% instances)

23 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.