Treebank Statistics: UD_Sinhala-STB: POS Tags: DET
There are 13 DET
lemmas (3%), 12 DET
types (2%) and 23 DET
tokens (3%).
Out of 13 observed tags, the rank of DET
is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent DET
lemmas: මේ, එම, ඒ, සෑම, අදාළ, අනෙක්, එබඳු, ඕනෑ, කිසිඳු, තව
The 10 most frequent DET
types: එම, මේ, ඒ, තවත්, සෑම, අදාළ, අනෙකුත්, එබඳු, ඕනෑම, කිසිඳු
The 10 most frequent ambiguous lemmas: මේ (DET 5, ADV 1, PRON 1), ඒ (PRON 9, DET 3, ADV 1), එබඳු (ADJ 2, DET 1), සියලු (NOUN 3, DET 1)
The 10 most frequent ambiguous types: එම (DET 5, PRON 1), මේ (DET 5, PRON 1), ඒ (PRON 7, DET 2), එබඳු (ADJ 2, DET 1)
- එම
- මේ
- ඒ
- එබඳු
Morphology
The form / lemma ratio of DET
is 0.923077 (the average of all parts of speech is 1.145336).
The 1st highest number of forms (2) was observed with the lemma “ඒ”: එම, ඒ.
The 2nd highest number of forms (1) was observed with the lemma “අදාළ”: අදාළ.
The 3rd highest number of forms (1) was observed with the lemma “අනෙක්”: අනෙකුත්.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 2 different relations: det (22; 96% instances), dep (1; 4% instances)
Parents of DET
nodes belong to 1 different parts of speech: NOUN (23; 100% instances)
23 (100%) DET
nodes are leaves.
The highest child degree of a DET
node is 0.