home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: DET

There are 1 DET lemmas (6%), 33 DET types (3%) and 933 DET tokens (16%). Out of 16 observed tags, the rank of DET is: 6 in number of lemmas, 7 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: =t, i=, oː=, =b, uː=, w=, ti=, oːn, t=, uːn

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: =t (DET 157, CCONJ 67, SCONJ 3, PRON 1), i= (DET 145, PRON 2, SCONJ 2), =b (DET 91, SCONJ 2, PRON 1), w= (DET 62, SCONJ 4), ti= (DET 50, SCONJ 5, PRON 1), oːn (DET 44, PRON 1), eːn (VERB 62, DET 14), beːn (DET 7, ADV 3, PRON 1), =eː (PRON 23, ADP 13, SCONJ 12, DET 1), beːt (ADP 1, DET 1)

Morphology

The form / lemma ratio of DET is 33.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (33) was observed with the lemma “_”: =b, =eː, =t, aː=, aːn, baliːnaːj, beːn, beːt, deː, eː=, eːn, eːt, i=, j=, mhasi, oː=, oːn, oːnaːj, oːt, t=, taː=, teː=, ti=, toː=, toːn, toːt, tuː=, tuːt, u=, uː=, uːn, uːt, w=.

DET occurs with 7 features: Gender (930; 100% instances), Definite (603; 65% instances), Case (456; 49% instances), Number (429; 46% instances), PronType (129; 14% instances), Deixis (126; 14% instances), Degree (2; 0% instances)

DET occurs with 13 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Dim, Deixis=Prox, Deixis=Remt, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PronType=Dem

DET occurs with 34 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc (166 tokens). Examples: i=, j=, u=

Relations

DET nodes are attached to their parents using 7 different relations: det (907; 97% instances), discourse (10; 1% instances), fixed (5; 1% instances), reparandum (5; 1% instances), dep (4; 0% instances), acl:relcl (1; 0% instances), dep:comp (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (738; 79% instances), VERB (81; 9% instances), PRON (35; 4% instances), ADJ (28; 3% instances), PROPN (13; 1% instances), NUM (12; 1% instances), ADP (11; 1% instances), SCONJ (5; 1% instances), ADV (4; 0% instances), X (4; 0% instances), DET (1; 0% instances), PART (1; 0% instances)

918 (98%) DET nodes are leaves.

13 (1%) DET nodes have one child.

2 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 6 different relations: punct (11; 65% instances), advmod (2; 12% instances), cc (1; 6% instances), dep (1; 6% instances), det (1; 6% instances), nmod:poss (1; 6% instances)

Children of DET nodes belong to 7 different parts of speech: PUNCT (11; 65% instances), ADP (1; 6% instances), ADV (1; 6% instances), CCONJ (1; 6% instances), DET (1; 6% instances), PART (1; 6% instances), PRON (1; 6% instances)