home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: DET

There are 4 DET lemmas (1%), 4 DET types (1%) and 7 DET tokens (1%). Out of 14 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: මේ, කිසිම, කෙනෙක්, ටික

The 10 most frequent DET types: මේ, කිසිම, කෙනෙක්, ටික

The 10 most frequent ambiguous lemmas: මේ (PRON 5, DET 4), කෙනෙක් (DET 1, NOUN 1)

The 10 most frequent ambiguous types: මේ (DET 4, PRON 4), කෙනෙක් (NOUN 2, DET 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.100000).

The 1st highest number of forms (1) was observed with the lemma “කිසිම”: කිසිම.

The 2nd highest number of forms (1) was observed with the lemma “කෙනෙක්”: කෙනෙක්.

The 3rd highest number of forms (1) was observed with the lemma “ටික”: ටික.

DET occurs with 1 features: PronType (7; 100% instances)

DET occurs with 2 feature-value pairs: PronType=Dem, PronType=Ind

DET occurs with 2 feature combinations. The most frequent feature combination is PronType=Dem (4 tokens). Examples: මේ

Relations

DET nodes are attached to their parents using 2 different relations: det (6; 86% instances), nsubj (1; 14% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (6; 86% instances), VERB (1; 14% instances)

6 (86%) DET nodes are leaves.

0 (0%) DET nodes have one child.

1 (14%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 2 different relations: acl (1; 50% instances), compound (1; 50% instances)

Children of DET nodes belong to 2 different parts of speech: NOUN (1; 50% instances), SCONJ (1; 50% instances)