home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Gheg-GPS: POS Tags: DET

There are 6 DET lemmas (1%), 35 DET types (1%) and 750 DET tokens (5%). Out of 15 observed tags, the rank of DET is: 13 in number of lemmas, 12 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: një, e, të, i, së, a

The 10 most frequent DET types: ni, e, një, i, t, ni:, të, nji, nje, një:

The 10 most frequent ambiguous lemmas: një (DET 480, NUM 7, PRON 3), e (PRON 422, CCONJ 135, DET 99, INTJ 1), (DET 90, PART 71, ADP 5, PRON 3), i (PRON 548, DET 69), a (PRON 199, CCONJ 33, PART 10, ADV 1, DET 1, INTJ 1, SCONJ 1)

The 10 most frequent ambiguous types: ni (DET 318, NUM 28, VERB 1), e (PRON 417, CCONJ 129, DET 92, INTJ 2), një (DET 75, NUM 7), i (PRON 530, DET 61), t (DET 46, PART 45, PRON 1), ni: (DET 33, NUM 2, VERB 1), (DET 32, PART 13, ADP 4, PRON 1), nji (DET 22, NUM 4), nje (DET 10, NUM 3), s (PART 61, DET 5)

Morphology

The form / lemma ratio of DET is 5.833333 (the average of all parts of speech is 2.539450).

The 1st highest number of forms (15) was observed with the lemma “një”: n, n/ni:, n:i, ni, ni:, nj, nj/, nja, nje, nji, nji:, njo, një, një/, një:.

The 2nd highest number of forms (7) was observed with the lemma “të”: t, te, tu, të, të/, të:, të:/.

The 3rd highest number of forms (5) was observed with the lemma “e”: e, e/, e:, ë, ë:h:h:.

DET occurs with 5 features: Number (227; 30% instances), Case (223; 30% instances), Gender (97; 13% instances), Definite (65; 9% instances), Foreign (1; 0% instances)

DET occurs with 11 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Ind, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

DET occurs with 27 feature combinations. The most frequent feature combination is _ (457 tokens). Examples: ni, ni:, nji, nje, një, të, t, një:, s, i

Relations

DET nodes are attached to their parents using 3 different relations: det (725; 97% instances), reparandum (24; 3% instances), fixed (1; 0% instances)

Parents of DET nodes belong to 10 different parts of speech: NOUN (499; 67% instances), ADJ (107; 14% instances), PRON (98; 13% instances), NUM (16; 2% instances), ADV (9; 1% instances), VERB (8; 1% instances), DET (7; 1% instances), PART (3; 0% instances), INTJ (2; 0% instances), ADP (1; 0% instances)

739 (99%) DET nodes are leaves.

8 (1%) DET nodes have one child.

2 (0%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 7 different relations: reparandum (8; 53% instances), advmod (2; 13% instances), aux:part (1; 7% instances), case (1; 7% instances), conj (1; 7% instances), mark (1; 7% instances), nsubj (1; 7% instances)

Children of DET nodes belong to 8 different parts of speech: DET (7; 47% instances), PART (2; 13% instances), ADP (1; 7% instances), AUX (1; 7% instances), NUM (1; 7% instances), PRON (1; 7% instances), SCONJ (1; 7% instances), VERB (1; 7% instances)