home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kazakh-KTB: POS Tags: DET

There are 35 DET lemmas (1%), 38 DET types (1%) and 220 DET tokens (2%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: осы, бұл, бір, өз, сол, ол, барлық, әр, мына, не

The 10 most frequent DET types: осы, бұл, бір, өз, ол, сол, барлық, әр, мына, не

The 10 most frequent ambiguous lemmas: осы (DET 39, PRON 14), бұл (PRON 53, DET 35), бір (NUM 44, DET 20, PRON 1, X 1), өз (PRON 31, DET 13), сол (PRON 20, DET 11), ол (PRON 88, DET 10), барлық (DET 9, ADJ 7, PRON 2), әр (DET 9, X 1), мына (DET 8, PRON 1), не (PRON 21, DET 7, CCONJ 4)

The 10 most frequent ambiguous types: осы (DET 22, PRON 2), бұл (DET 15, PRON 6), бір (NUM 21, DET 16, X 1), ол (PRON 13, DET 6), сол (DET 6, PRON 1), барлық (DET 6, ADJ 4), әр (DET 2, X 1), не (PRON 12, DET 7, CCONJ 3), ешқандай (DET 4, PRON 1), көп (ADJ 5, DET 4, ADV 1)

Morphology

The form / lemma ratio of DET is 1.085714 (the average of all parts of speech is 1.747153).

The 1st highest number of forms (3) was observed with the lemma “бұл”: Мұнша, бұ, бұл.

The 2nd highest number of forms (2) was observed with the lemma “сол”: сол, сонша.

The 3rd highest number of forms (1) was observed with the lemma “ана”: ана.

DET occurs with 3 features: PronType (220; 100% instances), Reflex (13; 6% instances), Case (3; 1% instances)

DET occurs with 8 feature-value pairs: Case=Nom, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Tot, Reflex=Yes

DET occurs with 7 feature combinations. The most frequent feature combination is PronType=Dem (112 tokens). Examples: осы, бұл, ол, сол, мына, бұ, мұндай, манағы, Анау, Мұнша

Relations

DET nodes are attached to their parents using 3 different relations: det (218; 99% instances), cc (1; 0% instances), nsubj (1; 0% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (212; 96% instances), PROPN (3; 1% instances), ADJ (2; 1% instances), VERB (2; 1% instances), DET (1; 0% instances)

215 (98%) DET nodes are leaves.

5 (2%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 5 different relations: appos (1; 20% instances), dep (1; 20% instances), det (1; 20% instances), fixed (1; 20% instances), punct (1; 20% instances)

Children of DET nodes belong to 5 different parts of speech: DET (1; 20% instances), PRON (1; 20% instances), PUNCT (1; 20% instances), SCONJ (1; 20% instances), X (1; 20% instances)