Treebank Statistics: UD_Sinhala-STB: POS Tags: DET
There are 13 DET lemmas (3%), 12 DET types (2%) and 23 DET tokens (3%).
Out of 13 observed tags, the rank of DET is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent DET lemmas: මේ, එම, ඒ, සෑම, අදාළ, අනෙක්, එබඳු, ඕනෑ, කිසිඳු, තව
The 10 most frequent DET types: එම, මේ, ඒ, තවත්, සෑම, අදාළ, අනෙකුත්, එබඳු, ඕනෑම, කිසිඳු
The 10 most frequent ambiguous lemmas: මේ (DET 5, ADV 1, PRON 1), ඒ (PRON 9, DET 3, ADV 1), එබඳු (ADJ 2, DET 1), සියලු (NOUN 3, DET 1)
The 10 most frequent ambiguous types: එම (DET 5, PRON 1), මේ (DET 5, PRON 1), ඒ (PRON 7, DET 2), එබඳු (ADJ 2, DET 1)
- එම
- මේ
- ඒ
- එබඳු
Morphology
The form / lemma ratio of DET is 0.923077 (the average of all parts of speech is 1.145336).
The 1st highest number of forms (2) was observed with the lemma “ඒ”: එම, ඒ.
The 2nd highest number of forms (1) was observed with the lemma “අදාළ”: අදාළ.
The 3rd highest number of forms (1) was observed with the lemma “අනෙක්”: අනෙකුත්.
DET does not occur with any features.
Relations
DET nodes are attached to their parents using 2 different relations: det (22; 96% instances), dep (1; 4% instances)
Parents of DET nodes belong to 1 different parts of speech: NOUN (23; 100% instances)
23 (100%) DET nodes are leaves.
The highest child degree of a DET node is 0.