home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uzbek-TueCL: POS Tags: DET

There are 5 DET lemmas (2%), 5 DET types (1%) and 9 DET tokens (1%). Out of 15 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: bu, hech, oʻsha, barcha, shunday

The 10 most frequent DET types: bu, hech, oʻsha, barcha, shunday

The 10 most frequent ambiguous lemmas: bu (PRON 4, DET 3), shunday (DET 1, PRON 1)

The 10 most frequent ambiguous types: bu (DET 2, PRON 1), shunday (DET 1, PRON 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.489437).

The 1st highest number of forms (1) was observed with the lemma “barcha”: barcha.

The 2nd highest number of forms (1) was observed with the lemma “bu”: bu.

The 3rd highest number of forms (1) was observed with the lemma “hech”: hech.

DET occurs with 1 features: PronType (9; 100% instances)

DET occurs with 3 feature-value pairs: PronType=Dem, PronType=Neg, PronType=Tot

DET occurs with 3 feature combinations. The most frequent feature combination is PronType=Dem (6 tokens). Examples: bu, oʻsha, shunday

Relations

DET nodes are attached to their parents using 3 different relations: det (6; 67% instances), compound (2; 22% instances), amod (1; 11% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (7; 78% instances), PRON (2; 22% instances)

9 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.