home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: DET

There are 8 DET lemmas (2%), 10 DET types (1%) and 32 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: ia, nowu, awy, _, cowu, awu, cewu, cewy

The 10 most frequent DET types: ia, nowu, awy, awu, cowu, Iage, ce, cewy, no, nowy

The 10 most frequent ambiguous lemmas: ia (DET 9, NOUN 2, NUM 2), _ (VERB 46, NOUN 41, ADV 24, PRON 18, PROPN 17, X 16, ADP 14, PUNCT 13, PART 3, DET 2)

The 10 most frequent ambiguous types: ia (DET 7, NUM 2, NOUN 1)

Morphology

The form / lemma ratio of DET is 1.250000 (the average of all parts of speech is 1.661638).

The 1st highest number of forms (2) was observed with the lemma “_”: awu, nowy.

The 2nd highest number of forms (2) was observed with the lemma “ia”: Iage, ia.

The 3rd highest number of forms (2) was observed with the lemma “nowu”: no, nowu.

DET occurs with 2 features: Definite (6; 19% instances), Number (1; 3% instances)

DET occurs with 2 feature-value pairs: Definite=Ind, Number=Plur

DET occurs with 3 feature combinations. The most frequent feature combination is _ (26 tokens). Examples: nowu, awy, ia, awu, cowu, ce, cewy, no, nowy

Relations

DET nodes are attached to their parents using 3 different relations: det (26; 81% instances), nmod (4; 13% instances), nsubj (2; 6% instances)

Parents of DET nodes belong to 3 different parts of speech: NOUN (30; 94% instances), ADV (1; 3% instances), VERB (1; 3% instances)

32 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.