Treebank Statistics: UD_Beja-NSC: POS Tags: DET
There are 1 DET
lemmas (6%), 33 DET
types (3%) and 933 DET
tokens (16%).
Out of 16 observed tags, the rank of DET
is: 6 in number of lemmas, 7 in number of types and 3 in number of tokens.
The 10 most frequent DET
lemmas: _
The 10 most frequent DET
types: =t, i=, oː=, =b, uː=, w=, ti=, oːn, t=, uːn
The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)
The 10 most frequent ambiguous types: =t (DET 157, CCONJ 67, SCONJ 3, PRON 1), i= (DET 145, PRON 2, SCONJ 2), =b (DET 91, SCONJ 2, PRON 1), w= (DET 62, SCONJ 4), ti= (DET 50, SCONJ 5, PRON 1), oːn (DET 44, PRON 1), eːn (VERB 62, DET 14), beːn (DET 7, ADV 3, PRON 1), =eː (PRON 23, ADP 13, SCONJ 12, DET 1), beːt (ADP 1, DET 1)
- =t
- DET 157: oːn ti= ʃaː =t =oːn tamna ti= i̠ːjʔaː gʷʔana /
- CCONJ 67: ontʔa // bak ʔabkin / w= hi ini =oː =hoːb // ajwa / adi =t
- SCONJ 3: iraːnaj miːmaʃa =ji dar =iː hasamani =t ti= naː =t =iːb // t= ʔadi saffiimtini =heːb han andi /
- PRON 1: t= ʔidda =t =eː ti= i= buːn =i =t jhaksi =t / hiːreːreː jʔeːn =hoːb / handi =i / khiː / uː= jhaːm / hangiːt //
- i=
- =b
- w=
- ti=
- oːn
- eːn
- beːn
- =eː
- PRON 23: winneːt ʔareːji eːn / ʔakra reːr // mhaj koː =jeː j= ʔar =eː //
- ADP 13: ʃamattan =i =ji ʃamat =eː =ka ani i= mhiːn =i naːjeː mhan /
- SCONJ 12: oː= kna hoːj bi= ibarin =eː =na ki= thaːj eːn /
- DET 1: uː= tak areː / ti= ndeː =t =i =da jʔeːtiːt / w= ʔoːr =oːk rhan / ti= karaːma =t =eː firar# / ti= tifirʔi =jeːt iktimna / afirha =b akajeː =wa / i= dhaj =iːb / hawaːjeː =wa rhan indi eːn //
- beːt
Morphology
The form / lemma ratio of DET
is 33.000000 (the average of all parts of speech is 76.500000).
The 1st highest number of forms (33) was observed with the lemma “_”: =b, =eː, =t, aː=, aːn, baliːnaːj, beːn, beːt, deː, eː=, eːn, eːt, i=, j=, mhasi, oː=, oːn, oːnaːj, oːt, t=, taː=, teː=, ti=, toː=, toːn, toːt, tuː=, tuːt, u=, uː=, uːn, uːt, w=.
DET
occurs with 7 features: Gender (930; 100% instances), Definite (603; 65% instances), Case (456; 49% instances), Number (429; 46% instances), PronType (129; 14% instances), Deixis (126; 14% instances), Degree (2; 0% instances)
DET
occurs with 13 feature-value pairs: Case=Acc
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Degree=Dim
, Deixis=Prox
, Deixis=Remt
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, PronType=Dem
DET
occurs with 34 feature combinations.
The most frequent feature combination is Definite=Def|Gender=Masc
(166 tokens).
Examples: i=, j=, u=
Relations
DET
nodes are attached to their parents using 7 different relations: det (907; 97% instances), discourse (10; 1% instances), fixed (5; 1% instances), reparandum (5; 1% instances), dep (4; 0% instances), acl:relcl (1; 0% instances), dep:comp (1; 0% instances)
Parents of DET
nodes belong to 12 different parts of speech: NOUN (738; 79% instances), VERB (81; 9% instances), PRON (35; 4% instances), ADJ (28; 3% instances), PROPN (13; 1% instances), NUM (12; 1% instances), ADP (11; 1% instances), SCONJ (5; 1% instances), ADV (4; 0% instances), X (4; 0% instances), DET (1; 0% instances), PART (1; 0% instances)
918 (98%) DET
nodes are leaves.
13 (1%) DET
nodes have one child.
2 (0%) DET
nodes have two children.
The highest child degree of a DET
node is 2.
Children of DET
nodes are attached using 6 different relations: punct (11; 65% instances), advmod (2; 12% instances), cc (1; 6% instances), dep (1; 6% instances), det (1; 6% instances), nmod:poss (1; 6% instances)
Children of DET
nodes belong to 7 different parts of speech: PUNCT (11; 65% instances), ADP (1; 6% instances), ADV (1; 6% instances), CCONJ (1; 6% instances), DET (1; 6% instances), PART (1; 6% instances), PRON (1; 6% instances)