home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Yoruba-YTB: POS Tags: DET

There are 21 DET lemmas (1%), 21 DET types (1%) and 279 DET tokens (3%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: àwọn, yìí, náà, gbogbo, o, wọ̀nyí, Oríṣìíríṣìí, báyìí, Bákanáà, Imo

The 10 most frequent DET types: àwọn, yìí, náà, gbogbo, o, wọ̀nyí, Oríṣìíríṣìí, báyìí, Bákanáà, Imo

The 10 most frequent ambiguous lemmas: àwọn (DET 127, PRON 12), náà (DET 39, ADV 23, PRON 13, ADJ 4), o (DET 6, NOUN 2, PRON 1), lo (DET 1, VERB 1), à (PRON 3, DET 1), èyí (PRON 15, DET 1, NOUN 1)

The 10 most frequent ambiguous types: àwọn (DET 120, PRON 11), náà (DET 38, ADV 23, PRON 13, ADJ 4), o (DET 4, NOUN 2, PRON 2), lo (DET 1, VERB 1), (ADV 2, DET 1, VERB 1), èyí (PRON 15, DET 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.007344).

The 1st highest number of forms (2) was observed with the lemma “náà”: ná, náà.

The 2nd highest number of forms (1) was observed with the lemma “Bákanáà”: Bákanáà.

The 3rd highest number of forms (1) was observed with the lemma “Imo”: Imo.

DET occurs with 3 features: Number (120; 43% instances), PronType (120; 43% instances), Typo (1; 0% instances)

DET occurs with 3 feature-value pairs: Number=Plur, PronType=Dem, Typo=Yes

DET occurs with 3 feature combinations. The most frequent feature combination is _ (158 tokens). Examples: yìí, náà, gbogbo, Àwọn, o, wọ̀nyí, Oríṣìíríṣìí, báyìí, Bákanáà, Imo

Relations

DET nodes are attached to their parents using 7 different relations: det (269; 96% instances), nmod (4; 1% instances), nsubj (2; 1% instances), ccomp (1; 0% instances), conj (1; 0% instances), fixed (1; 0% instances), obl (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (221; 79% instances), VERB (19; 7% instances), PROPN (15; 5% instances), PRON (12; 4% instances), ADJ (7; 3% instances), ADV (4; 1% instances), NUM (1; 0% instances)

262 (94%) DET nodes are leaves.

14 (5%) DET nodes have one child.

1 (0%) DET nodes have two children.

2 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 11 different relations: case (5; 23% instances), conj (3; 14% instances), fixed (3; 14% instances), amod (2; 9% instances), nsubj (2; 9% instances), punct (2; 9% instances), acl (1; 5% instances), advmod (1; 5% instances), compound:svc (1; 5% instances), obj (1; 5% instances), xcomp (1; 5% instances)

Children of DET nodes belong to 8 different parts of speech: VERB (6; 27% instances), PRON (5; 23% instances), PART (3; 14% instances), ADJ (2; 9% instances), ADP (2; 9% instances), PUNCT (2; 9% instances), ADV (1; 5% instances), NOUN (1; 5% instances)