Treebank Statistics: UD_Bororo-BDT: POS Tags: DET
There are 180 DET
lemmas (1%), 231 DET
types (1%) and 4224 DET
tokens (3%).
Out of 17 observed tags, the rank of DET
is: 8 in number of lemmas, 8 in number of types and 8 in number of tokens.
The 10 most frequent DET
lemmas: nowu, ia, awu, cewu, _, nonowu, iduia, roia, muga, jicewu
The 10 most frequent DET
types: nowu, ia, awu, cewu, nono, aiduia, aroia, amuga, jice, ecewu
The 10 most frequent ambiguous lemmas: nowu (DET 1997, NOUN 9), ia (DET 769, NOUN 109, PRON 76, ADP 16, VERB 14, NUM 12, PROPN 3), awu (DET 570, ADV 29, NOUN 24, PROPN 20, X 7), cewu (DET 175, PROPN 27, VERB 5, NOUN 3), _ (NOUN 5910, VERB 3398, ADV 1856, PRON 1359, ADP 1308, PROPN 1165, X 926, PUNCT 459, DET 149, INTJ 122, SCONJ 55, CCONJ 30, PART 29), iduia (DET 32, NOUN 1), roia (NOUN 39, DET 31, ADP 8, PROPN 4, CCONJ 3), muga (NOUN 57, DET 22, VERB 2), ecewu (DET 18, PRON 8), akudawu (DET 15, NOUN 1)
The 10 most frequent ambiguous types: nowu (DET 1448, PROPN 231), ia (DET 649, NOUN 105, ADP 15, PROPN 9, NUM 4), awu (DET 425, ADV 10, NOUN 10, PROPN 9, X 7), cewu (DET 98, NOUN 71, ADP 59, PROPN 25), nono (ADV 347, DET 68, PROPN 21, NOUN 17), aroia (DET 24, NOUN 22, PROPN 2), jice (NOUN 37, ADP 26, DET 19, ADV 10), ecewu (DET 13, PRON 5, VERB 3, NOUN 1), akudawu (DET 11, NOUN 1), kowu (DET 13, NOUN 6)
- nowu
- ia
- awu
- DET 425: Ure awu feso tadawuge eiamedu boce kodi emeartorudo tagoino duji .
- ADV 10: Ure taiwodo kodudu bogai , jordure nonogo ikaji , akore awu rugadu !
- NOUN 10: Cenagore cemaragodumode awu meriji , cemaragodumode boecoji .
- PROPN 9: Akore awu Pao Aroe Eimejera o rogu .
- X 7: 2Mare Jesus akore ei Taerdure awu baikurireugei .
- cewu
- nono
- aroia
- jice
- ecewu
- DET 13: Icare ure ecewu Oieigo ko : Oie e e e e …
- PRON 5: Du koiare ecewu kodure boiwu , pega kodure turewo kori .
- VERB 3: Icare ere tugu ecewu kodoto ( kodo kigadu .
- NOUN 1: Kuogorewu bopaguduia Cibaiu Bakororo awiria uiroga ecewu Arigao Bororo bukejewuge eimejera atogewu AIJEIARI BAKORORO biria reuwore .
- akudawu
- kowu
Morphology
The form / lemma ratio of DET
is 1.283333 (the average of all parts of speech is 1.360106).
The 1st highest number of forms (57) was observed with the lemma “_”: Cibaiaritowu, Inodowu, Kakodiwu, Pegadowu, adorodoge, agareuge, aiadugodoge, aicereuge, aidugirireuge, aidurirewuge, aijedoge, aiwu, aiwuge, akoreuge, akowage, altardoge, amalecitadoge, amedage, amorreudoge, aogobodoge, aokurewuge, aomage, aonagarege, aopegokareuge, apostolodoge, apowuge, aredumage, arege, atugarege, awuge, awuiage, awurimage, bakowu, bitowu, boetojiwu, cemagowu, ia, igoia, imeduia, iogoduia, itaiduia, iwu, jiowu, jiwu, jorduwadowu, kodiwu, kogadowu, koriwu, kowu, nowu, owu, paginorudowu, pagodudowu, pemegadowu, pijiwu, pudabowu, towu.
The 2nd highest number of forms (2) was observed with the lemma “awu”: awu, awure.
The 3rd highest number of forms (2) was observed with the lemma “cewu”: Ce, cewu.
DET
occurs with 5 features: Deixis (3095; 73% instances), PronType (1593; 38% instances), Definite (1003; 24% instances), Number (652; 15% instances), Mood (5; 0% instances)
DET
occurs with 8 feature-value pairs: Definite=Ind
, Deixis=Med
, Deixis=Prox
, Deixis=Remt
, Mood=Ind
, Number=Sing
, PronType=Art
, PronType=Dem
DET
occurs with 11 feature combinations.
The most frequent feature combination is Deixis=Med
(1767 tokens).
Examples: nowu, nono, kowu, Inowu, mano, 9Nowu, pudabowu, owu, 2Nowu, 8Nowu
Relations
DET
nodes are attached to their parents using 13 different relations: det (3984; 94% instances), nmod (86; 2% instances), root (48; 1% instances), nsubj (31; 1% instances), obl (24; 1% instances), dep (13; 0% instances), ccomp (10; 0% instances), obj (10; 0% instances), parataxis (9; 0% instances), conj (5; 0% instances), advmod (2; 0% instances), advcl (1; 0% instances), flat (1; 0% instances)
Parents of DET
nodes belong to 16 different parts of speech: NOUN (3234; 77% instances), VERB (267; 6% instances), PROPN (200; 5% instances), ADV (182; 4% instances), PRON (105; 2% instances), X (65; 2% instances), ADP (59; 1% instances), (48; 1% instances), DET (33; 1% instances), SCONJ (9; 0% instances), AUX (7; 0% instances), INTJ (5; 0% instances), NUM (5; 0% instances), ADJ (3; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)
4045 (96%) DET
nodes are leaves.
128 (3%) DET
nodes have one child.
32 (1%) DET
nodes have two children.
19 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 5.
Children of DET
nodes are attached using 14 different relations: punct (74; 29% instances), advmod (46; 18% instances), det (32; 12% instances), nmod (31; 12% instances), nsubj (29; 11% instances), case (10; 4% instances), obl (10; 4% instances), dep (7; 3% instances), obj (5; 2% instances), conj (4; 2% instances), parataxis (4; 2% instances), cc (3; 1% instances), mark (3; 1% instances), discourse (1; 0% instances)
Children of DET
nodes belong to 12 different parts of speech: PUNCT (74; 29% instances), ADV (49; 19% instances), NOUN (40; 15% instances), DET (33; 13% instances), PRON (21; 8% instances), PROPN (16; 6% instances), ADP (9; 3% instances), SCONJ (5; 2% instances), CCONJ (3; 1% instances), NUM (3; 1% instances), VERB (3; 1% instances), X (3; 1% instances)