home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: DET

There are 180 DET lemmas (1%), 231 DET types (1%) and 4224 DET tokens (3%). Out of 17 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: nowu, ia, awu, cewu, _, nonowu, iduia, roia, muga, jicewu

The 10 most frequent DET types: nowu, ia, awu, cewu, nono, aiduia, aroia, amuga, jice, ecewu

The 10 most frequent ambiguous lemmas: nowu (DET 1997, NOUN 9), ia (DET 769, NOUN 109, PRON 76, ADP 16, VERB 14, NUM 12, PROPN 3), awu (DET 570, ADV 29, NOUN 24, PROPN 20, X 7), cewu (DET 175, PROPN 27, VERB 5, NOUN 3), _ (NOUN 5910, VERB 3398, ADV 1856, PRON 1359, ADP 1308, PROPN 1165, X 926, PUNCT 459, DET 149, INTJ 122, SCONJ 55, CCONJ 30, PART 29), iduia (DET 32, NOUN 1), roia (NOUN 39, DET 31, ADP 8, PROPN 4, CCONJ 3), muga (NOUN 57, DET 22, VERB 2), ecewu (DET 18, PRON 8), akudawu (DET 15, NOUN 1)

The 10 most frequent ambiguous types: nowu (DET 1448, PROPN 231), ia (DET 649, NOUN 105, ADP 15, PROPN 9, NUM 4), awu (DET 425, ADV 10, NOUN 10, PROPN 9, X 7), cewu (DET 98, NOUN 71, ADP 59, PROPN 25), nono (ADV 347, DET 68, PROPN 21, NOUN 17), aroia (DET 24, NOUN 22, PROPN 2), jice (NOUN 37, ADP 26, DET 19, ADV 10), ecewu (DET 13, PRON 5, VERB 3, NOUN 1), akudawu (DET 11, NOUN 1), kowu (DET 13, NOUN 6)

Morphology

The form / lemma ratio of DET is 1.283333 (the average of all parts of speech is 1.360106).

The 1st highest number of forms (57) was observed with the lemma “_”: Cibaiaritowu, Inodowu, Kakodiwu, Pegadowu, adorodoge, agareuge, aiadugodoge, aicereuge, aidugirireuge, aidurirewuge, aijedoge, aiwu, aiwuge, akoreuge, akowage, altardoge, amalecitadoge, amedage, amorreudoge, aogobodoge, aokurewuge, aomage, aonagarege, aopegokareuge, apostolodoge, apowuge, aredumage, arege, atugarege, awuge, awuiage, awurimage, bakowu, bitowu, boetojiwu, cemagowu, ia, igoia, imeduia, iogoduia, itaiduia, iwu, jiowu, jiwu, jorduwadowu, kodiwu, kogadowu, koriwu, kowu, nowu, owu, paginorudowu, pagodudowu, pemegadowu, pijiwu, pudabowu, towu.

The 2nd highest number of forms (2) was observed with the lemma “awu”: awu, awure.

The 3rd highest number of forms (2) was observed with the lemma “cewu”: Ce, cewu.

DET occurs with 5 features: Deixis (3095; 73% instances), PronType (1593; 38% instances), Definite (1003; 24% instances), Number (652; 15% instances), Mood (5; 0% instances)

DET occurs with 8 feature-value pairs: Definite=Ind, Deixis=Med, Deixis=Prox, Deixis=Remt, Mood=Ind, Number=Sing, PronType=Art, PronType=Dem

DET occurs with 11 feature combinations. The most frequent feature combination is Deixis=Med (1767 tokens). Examples: nowu, nono, kowu, Inowu, mano, 9Nowu, pudabowu, owu, 2Nowu, 8Nowu

Relations

DET nodes are attached to their parents using 13 different relations: det (3984; 94% instances), nmod (86; 2% instances), root (48; 1% instances), nsubj (31; 1% instances), obl (24; 1% instances), dep (13; 0% instances), ccomp (10; 0% instances), obj (10; 0% instances), parataxis (9; 0% instances), conj (5; 0% instances), advmod (2; 0% instances), advcl (1; 0% instances), flat (1; 0% instances)

Parents of DET nodes belong to 16 different parts of speech: NOUN (3234; 77% instances), VERB (267; 6% instances), PROPN (200; 5% instances), ADV (182; 4% instances), PRON (105; 2% instances), X (65; 2% instances), ADP (59; 1% instances), (48; 1% instances), DET (33; 1% instances), SCONJ (9; 0% instances), AUX (7; 0% instances), INTJ (5; 0% instances), NUM (5; 0% instances), ADJ (3; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

4045 (96%) DET nodes are leaves.

128 (3%) DET nodes have one child.

32 (1%) DET nodes have two children.

19 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 5.

Children of DET nodes are attached using 14 different relations: punct (74; 29% instances), advmod (46; 18% instances), det (32; 12% instances), nmod (31; 12% instances), nsubj (29; 11% instances), case (10; 4% instances), obl (10; 4% instances), dep (7; 3% instances), obj (5; 2% instances), conj (4; 2% instances), parataxis (4; 2% instances), cc (3; 1% instances), mark (3; 1% instances), discourse (1; 0% instances)

Children of DET nodes belong to 12 different parts of speech: PUNCT (74; 29% instances), ADV (49; 19% instances), NOUN (40; 15% instances), DET (33; 13% instances), PRON (21; 8% instances), PROPN (16; 6% instances), ADP (9; 3% instances), SCONJ (5; 2% instances), CCONJ (3; 1% instances), NUM (3; 1% instances), VERB (3; 1% instances), X (3; 1% instances)