home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: DET

There are 15 DET lemmas (1%), 29 DET types (1%) and 1205 DET tokens (12%). Out of 16 observed tags, the rank of DET is: 11 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: an, un, ho, e, ma, o, he, holl, bep, hon

The 10 most frequent DET types: ar, an, ur, al, un, r, e, ho, n, o

The 10 most frequent ambiguous lemmas: an (DET 863, X 2), e (ADP 242, PART 214, DET 37, X 2), ma (DET 27, SCONJ 25), o (PART 40, DET 22, INTJ 2), holl (DET 15, PRON 8, ADV 1), a (PART 303, ADP 59, DET 6, PRON 5), pep (DET 5, X 3), da (ADP 253, DET 3, X 3), kement (DET 2, X 1), peseurt (ADJ 1, DET 1)

The 10 most frequent ambiguous types: ar (DET 382, X 13), an (DET 256, X 4), ur (DET 62, PART 9), e (PART 208, ADP 92, DET 33, X 2), n (DET 22, ADV 1), o (PART 34, DET 22, PRON 8), _ (PRON 53, DET 18, ADV 4, PUNCT 1), he (DET 16, PRON 2), ma (SCONJ 16, DET 14), holl (DET 15, PRON 8, ADV 1)

Morphology

The form / lemma ratio of DET is 1.933333 (the average of all parts of speech is 1.406011).

The 1st highest number of forms (7) was observed with the lemma “an”: ‘r, _, al, an, ar, n, r.

The 2nd highest number of forms (4) was observed with the lemma “ma”: am, m, ma, va.

The 3rd highest number of forms (4) was observed with the lemma “un”: ‘n, ul, un, ur.

DET occurs with 4 features: Poss (153; 13% instances), Gender[psor] (54; 4% instances), Number (20; 2% instances), PronType (1; 0% instances)

DET occurs with 6 feature-value pairs: Gender[psor]=Fem, Gender[psor]=Masc, Number=Plur, Number=Sing, Poss=Yes, PronType=Int

DET occurs with 7 feature combinations. The most frequent feature combination is _ (1031 tokens). Examples: ar, an, ur, al, un, r, n, _, holl, ul

Relations

DET nodes are attached to their parents using 2 different relations: det (1204; 100% instances), nmod (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (1162; 96% instances), PROPN (11; 1% instances), ADJ (9; 1% instances), NUM (9; 1% instances), PRON (9; 1% instances), VERB (4; 0% instances), PUNCT (1; 0% instances)

1190 (99%) DET nodes are leaves.

15 (1%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 4 different relations: case (6; 40% instances), dep (6; 40% instances), cc (2; 13% instances), punct (1; 7% instances)

Children of DET nodes belong to 4 different parts of speech: ADP (6; 40% instances), X (6; 40% instances), CCONJ (2; 13% instances), PUNCT (1; 7% instances)