home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bengali-BRU: POS Tags: DET

There are 7 DET lemmas (6%), 6 DET types (4%) and 14 DET tokens (4%). Out of 14 observed tags, the rank of DET is: 5 in number of lemmas, 6 in number of types and 5 in number of tokens.

The 10 most frequent DET lemmas: কি, একটা, একটি, কোন, তাই, না, নেই

The 10 most frequent DET types: কি, একটা, কোন, তাই, নাই, নেই

The 10 most frequent ambiguous lemmas: কি (DET 7, PRON 5, PART 2), তাই (ADV 1, DET 1), না (PART 7, INTJ 3, DET 1)

The 10 most frequent ambiguous types: কি (DET 7, PRON 5, PART 2), তাই (ADV 1, DET 1), নাই (DET 1, PART 1)

Morphology

The form / lemma ratio of DET is 0.857143 (the average of all parts of speech is 1.290598).

The 1st highest number of forms (1) was observed with the lemma “একটা”: একটা.

The 2nd highest number of forms (1) was observed with the lemma “একটি”: একটা.

The 3rd highest number of forms (1) was observed with the lemma “কি”: কি.

DET occurs with 2 features: PronType (13; 93% instances), Definite (3; 21% instances)

DET occurs with 4 feature-value pairs: Definite=Ind, PronType=Art, PronType=Dem, PronType=Int

DET occurs with 4 feature combinations. The most frequent feature combination is PronType=Int (9 tokens). Examples: কি, কোন, নাই

Relations

DET nodes are attached to their parents using 3 different relations: det (12; 86% instances), compound (1; 7% instances), parataxis (1; 7% instances)

Parents of DET nodes belong to 4 different parts of speech: NOUN (10; 71% instances), VERB (2; 14% instances), ADJ (1; 7% instances), DET (1; 7% instances)

13 (93%) DET nodes are leaves.

0 (0%) DET nodes have one child.

1 (7%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 2 different relations: compound (1; 50% instances), punct (1; 50% instances)

Children of DET nodes belong to 2 different parts of speech: DET (1; 50% instances), PUNCT (1; 50% instances)