home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-TSA: POS Tags: DET

There are 3 DET lemmas (1%), 5 DET types (1%) and 116 DET tokens (13%). Out of 14 observed tags, the rank of DET is: 12 in number of lemmas, 11 in number of types and 2 in number of tokens.

The 10 most frequent DET lemmas: i, një, e

The 10 most frequent DET types: të, e, i, një, së

The 10 most frequent ambiguous lemmas: i (DET 99, PRON 5, CCONJ 1), një (DET 15, NUM 4)

The 10 most frequent ambiguous types: (DET 47, PART 17), e (DET 32, PRON 3, CCONJ 1), i (DET 18, PRON 3), një (DET 14, NUM 4)

Morphology

The form / lemma ratio of DET is 1.666667 (the average of all parts of speech is 1.167464).

The 1st highest number of forms (4) was observed with the lemma “i”: e, i, së, të.

The 2nd highest number of forms (1) was observed with the lemma “e”: e.

The 3rd highest number of forms (1) was observed with the lemma “një”: një.

DET occurs with 2 features: Gender (115; 99% instances), Number (1; 1% instances)

DET occurs with 3 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur

DET occurs with 4 feature combinations. The most frequent feature combination is Gender=Fem (70 tokens). Examples: e, të, një, së

Relations

DET nodes are attached to their parents using 5 different relations: det (68; 59% instances), det:adj (34; 29% instances), det:pron (12; 10% instances), det:noun (1; 1% instances), root (1; 1% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (59; 51% instances), ADJ (37; 32% instances), PRON (14; 12% instances), PROPN (5; 4% instances), (1; 1% instances)

115 (99%) DET nodes are leaves.

0 (0%) DET nodes have one child.

0 (0%) DET nodes have two children.

1 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 5.

Children of DET nodes are attached using 5 different relations: cop (1; 20% instances), nmod (1; 20% instances), nsubj (1; 20% instances), obl (1; 20% instances), punct (1; 20% instances)

Children of DET nodes belong to 4 different parts of speech: PROPN (2; 40% instances), AUX (1; 20% instances), NOUN (1; 20% instances), PUNCT (1; 20% instances)