home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Esperanto-Cairo: POS Tags: DET

There are 2 DET lemmas (2%), 3 DET types (3%) and 10 DET tokens (6%). Out of 14 observed tags, the rank of DET is: 11 in number of lemmas, 11 in number of types and 7 in number of tokens.

The 10 most frequent DET lemmas: la, tiu

The 10 most frequent DET types: la, tiu, tiun

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of DET is 1.500000 (the average of all parts of speech is 1.085714).

The 1st highest number of forms (2) was observed with the lemma “tiu”: tiu, tiun.

The 2nd highest number of forms (1) was observed with the lemma “la”: la.

DET occurs with 3 features: Case (2; 20% instances), Number (2; 20% instances), PronType (1; 10% instances)

DET occurs with 4 feature-value pairs: Case=Acc, Case=Nom, Number=Sing, PronType=Dem

DET occurs with 3 feature combinations. The most frequent feature combination is _ (8 tokens). Examples: la

Relations

DET nodes are attached to their parents using 1 different relations: det (10; 100% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (9; 90% instances), ADJ (1; 10% instances)

9 (90%) DET nodes are leaves.

1 (10%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 1 different relations: advmod (1; 100% instances)

Children of DET nodes belong to 1 different parts of speech: PART (1; 100% instances)