home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-Autogramm: POS Tags: DET

There are 1 DET lemmas (6%), 37 DET types (2%) and 1737 DET tokens (15%). Out of 16 observed tags, the rank of DET is: 6 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: =t, i=, oː=, w=, ti=, uː=, oːn, t=, uːn, j=

The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)

The 10 most frequent ambiguous types: =t (DET 383, CCONJ 117, SCONJ 9), i= (DET 246, PRON 8), w= (DET 118, PRON 6), ti= (DET 115, PRON 15, SCONJ 3), oːn (DET 84, PRON 1), t= (DET 73, PRON 4), =b (DET 56, SCONJ 6), eːn (VERB 228, DET 30), beːn (DET 18, ADV 7, PRON 1), beːt (DET 5, ADP 1)

Morphology

The form / lemma ratio of DET is 37.000000 (the average of all parts of speech is 126.812500).

The 1st highest number of forms (37) was observed with the lemma “_”: -a, -aː, =b, =eː, =t, aː=, aːn, aːt, baliːnaːj, beːn, beːt, eː=, eːn, eːt, i=, j=, mhasi, oː=, oːn, oːnaːj, oːt, t=, taː=, taːt, teː=, teːn, ti=, toː=, toːn, toːt, tuː=, tuːt, u=, uː=, uːn, uːt, w=.

DET occurs with 7 features: Gender (1733; 100% instances), Definite (1035; 60% instances), Number (805; 46% instances), Case (734; 42% instances), PronType (283; 16% instances), Deixis (279; 16% instances), Degree (3; 0% instances)

DET occurs with 13 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Dim, Deixis=Prox, Deixis=Remt, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PronType=Dem

DET occurs with 43 feature combinations. The most frequent feature combination is Gender=Fem (360 tokens). Examples: =t, ti=, t=, toː=, tuː=, oːt, eːt, toːt, teː=, tuːt

Relations

DET nodes are attached to their parents using 7 different relations: det (1692; 97% instances), discourse (25; 1% instances), fixed (6; 0% instances), reparandum (6; 0% instances), dep (5; 0% instances), dep:comp (2; 0% instances), dislocated:subj (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (1473; 85% instances), PRON (83; 5% instances), ADJ (65; 4% instances), ADP (35; 2% instances), NUM (27; 2% instances), PROPN (21; 1% instances), ADV (9; 1% instances), VERB (9; 1% instances), SCONJ (6; 0% instances), X (5; 0% instances), PART (3; 0% instances), INTJ (1; 0% instances)

1714 (99%) DET nodes are leaves.

23 (1%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 2 different relations: punct (22; 96% instances), advmod (1; 4% instances)

Children of DET nodes belong to 2 different parts of speech: PUNCT (22; 96% instances), PART (1; 4% instances)