Treebank Statistics: UD_Bororo-BDT: POS Tags: PROPN
There are 996 PROPN lemmas (8%), 1335 PROPN types (7%) and 5764 PROPN tokens (4%).
Out of 17 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 7 in number of tokens.
The 10 most frequent PROPN lemmas: _, pao, bakororo, o, bororo, Eceraedu, ro, Aije, remawu, pemegarewu
The 10 most frequent PROPN types: pao, nowu, bakororo, o, bororo, Ecerae, Aije, remawu, pemegarewu, João
The 10 most frequent ambiguous lemmas: _ (NOUN 5910, VERB 3398, ADV 1856, PRON 1359, ADP 1308, PROPN 1165, X 926, PUNCT 459, DET 149, INTJ 122, SCONJ 55, CCONJ 30, PART 29), pao (PROPN 314, ADV 14, X 6, NOUN 2), bakororo (NOUN 4, PROPN 2), o (NOUN 673, ADV 130, PROPN 110, VERB 66), bororo (NOUN 48, PROPN 18, ADV 2), Eceraedu (PROPN 85, NOUN 11), ro (VERB 518, NOUN 129, PROPN 82, ADV 4, PRON 2, ADJ 1), Aije (PROPN 73, ADP 4, ADV 1, VERB 1), remawu (PROPN 67, ADV 28, VERB 12, NOUN 5, X 1), pemegarewu (PROPN 18, NOUN 4)
The 10 most frequent ambiguous types: pao (X 4, PROPN 3), nowu (DET 1448, PROPN 231), bakororo (PROPN 2, VERB 2, NOUN 1), o (NOUN 171, PROPN 138, ADV 93, VERB 6), bororo (NOUN 47, PROPN 11, VERB 2), Ecerae (PROPN 81, NOUN 20, VERB 1), Aije (PROPN 73, ADP 4, ADV 1, VERB 1), remawu (PROPN 67, ADV 28, NOUN 24, X 1), pemegarewu (NOUN 266, PROPN 18, X 3), João (PROPN 63, NOUN 2)
- pao
- nowu
- bakororo
- PROPN 2: Oieigo jewetuiiga ojewetu bokodoriware iga oieigo wararere tagaru uia cibaiurewu bakororo tuiagajejewu okwabijire .
- VERB 2: Oieigo jewetuiaiga ojewetu bokodoriware iga oieigo wararere tagaru uia aiadugodoge ewadarudodu epa bakororo tuiagajejewu okwabijire .
- NOUN 1: Arowe eregodure , pana bakororo eregodure .
- o
- bororo
- Ecerae
- Aije
- remawu
- pemegarewu
- João
Morphology
The form / lemma ratio of PROPN is 1.340361 (the average of all parts of speech is 1.360106).
The 1st highest number of forms (357) was observed with the lemma “_”: !’Jacuba’to, 2Iorduware, 2Mare, 2Oiogwari, 2Paduie, 4Mare, 6Mare, 7Mare, 8Mare, 8Tare, Absinto, Aerduware, Aeto, Aiguio, Aiquio, Akaru, Akarubo, Akaruio, Akuie, Akurubo, Apoguru, Apostolodo, Ararebo, Araru, Araruga, Arerebo, Arodo, Aroia, Aruio, Awudugugo, Awuru, Awuruio, Azul, Bairugo, Bakeraoto, Bakoroakaru, Bakororoia, Bakorororodo, Barubaru, Barubo, Batista, Batistare, Biapagare, Biaruru, Birimodo, Birmodo, Blasto, Boeco, Bokarebo, Bokodoga, Bokodoriware, Boqueraoto, Butugo, Care, Cibaibo, Ciocio, Corinto, Cristo, Diretorarodo, Ecerabaru, Edugo, Eimejerare, Eimejerarodo, Ekuie, Emeraoto, Emoduie, Enawuio, Enogwari, Enoiogwari, Eparu, Erasto, Erewakawuio, Erore’guru, Eugênio, Ewiapagare, Ewiriga, Festo, Goia, Gólgota, Ikuiabaru, Ikuiebo, Inodoguru, Iorubodare, Iparebo, Irojibo, Israel, Iwiapagare, Jakaduio, Jaruru, Jaruruto, Jessé, Jonare, Joruduware, José, Joware, Jurerodo, Jureto, Jurociwuio, Kaborewu, Kadagubo, Kagarubo, Kare, Keoguru, Kewoguru, Kie, Kijibo, Koedugo, Kogebowu, Kogedugo, Koguio, Koguiowuio, Kudorouio, Kugarubo, Kuiadawuio, Kurojibo, Kuruguga, Mare, Maria, Mariguru, Marigurubo, Mata, Meribo, Meriribaru, Meriribo, Meririrbo, Metugubo, Mileto, Missare, Muguio, Nowaboia, Okogebo, Okogebowu, Onaregeduie, Orowaribo, Orowariboia, Pagimejerare, Paoie, Pemegareuie, Piloto, Pirojibo, Preto, Pudumie, Samuel, Taboguru, Tabowu, Tadugo, Taemaru, Taerduware, Taeto, Tamagodo, Tawudugugo, Toduio, Toiogwari, Toriga, Tugare, Tumeartoru, Tururu, Tuwagowu, Uibo, Ukiga, Uruguio, Urukuio, Utoboga, aigo, akie, apo, are, aru, awie, badojebare, baiga, baiporoto, bakaru, bakowu, bakurireuto, bakuru, bapoto, baru, baruto, bataru, bie, birido’ta’nowu, boecoto, boepare, boeto, bokodoribaru, bororoto, burejoia, butudugo, butuie, caminhãoto, cedagaru, cedogeare, ceerduware, cegere’piga’kuru, cemugo, cenaguie, cenoia, cerrado, cewu, ciga, codo, curu, duie, duru, durururu, eto, finado, ga, guru, ia, iga, ioga, ipo, jado, jaruruio, jeonare, jewetuiaiga, jewetuiiga, jewoduie, jewoduio, jo, jodo, joie, jokoduie, jorduware, jorugo, joruto, ju, judeudogeie, jugo, juie, jumento, jureie, kaewu, kaworu, kiarigo, kimoduio, koia, kugo, kuio, kujibo, lmarugo, maereuto, maiwuto, makaguragare, manoto, marenaru, mariguduie, meri, meririe, meto, metuia, meturewoto, mil, moto, motoia, motoiado, mototo, muga, mugatowu, muguie, negedroguie, noidoia, nonoie, nonowu, nowagoroia, noware, nowu, nowugeraduie, o, ogeie, oiado, oie, oieigo, oinowu, okeare, okituware, okware, onare, oreie, oto, owu, padu, paduie, paerduware, pagaboie, pagadoduie, paginorudoduie, pajarugo, paraduio, pare, paruto, pawaboto, pegagoduie, pegamoduie, pegare, pegodo, peioga, pemegadowu, pemegamodeduie, pemegare, pemegareugeie, pijidoduie, pobo, pogodo, porodo, powari, profetare, pudabowu, pudare, puredugoduie, raboduie, raduie, rakaguragare, rakuduie, raru, remawuie, remoduio, reru, reruio, rie, roga, roia, roie, roiware, roreru, roto, rotodo, ru, rugadu, ruio, ruru, sinagoga, tado, tare, tl, to, towu, tugo, tuie, uiaiga, uie, ukeroia, ukuie, umanare, umuguio, unorare, upagare, uporu, uru, utoriga, utugare, utugodo, uwie, woie, wuru.
The 2nd highest number of forms (3) was observed with the lemma “Bakororo”: BAKORORO, Bakororo, Bakororodoge.
The 3rd highest number of forms (2) was observed with the lemma “Arigao”: Arigao, Arigaodoge.
PROPN occurs with 1 features: Mood (16; 0% instances)
PROPN occurs with 1 feature-value pairs: Mood=Ind
PROPN occurs with 2 feature combinations.
The most frequent feature combination is _ (5748 tokens).
Examples: pao, nowu, bakororo, o, bororo, Ecerae, Aije, remawu, pemegarewu, João
Relations
PROPN nodes are attached to their parents using 11 different relations: nmod (2152; 37% instances), flat (1182; 21% instances), nsubj (1177; 20% instances), obl (527; 9% instances), root (305; 5% instances), conj (245; 4% instances), ccomp (90; 2% instances), obj (48; 1% instances), dep (25; 0% instances), parataxis (7; 0% instances), advcl (6; 0% instances)
Parents of PROPN nodes belong to 18 different parts of speech: PROPN (2024; 35% instances), NOUN (1417; 25% instances), VERB (1236; 21% instances), (305; 5% instances), PRON (205; 4% instances), X (165; 3% instances), ADV (154; 3% instances), AUX (71; 1% instances), ADP (58; 1% instances), NUM (58; 1% instances), DET (16; 0% instances), INTJ (13; 0% instances), PART (13; 0% instances), PUNCT (13; 0% instances), ADJ (6; 0% instances), SCONJ (5; 0% instances), CCONJ (4; 0% instances), SYM (1; 0% instances)
2360 (41%) PROPN nodes are leaves.
2129 (37%) PROPN nodes have one child.
823 (14%) PROPN nodes have two children.
452 (8%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 9.
Children of PROPN nodes are attached using 16 different relations: nmod (1506; 28% instances), flat (1155; 21% instances), case (721; 13% instances), punct (688; 13% instances), nsubj (597; 11% instances), advmod (247; 5% instances), det (197; 4% instances), conj (178; 3% instances), dep (40; 1% instances), cc (10; 0% instances), obl (10; 0% instances), parataxis (10; 0% instances), obj (9; 0% instances), mark (6; 0% instances), advcl (1; 0% instances), discourse (1; 0% instances)
Children of PROPN nodes belong to 15 different parts of speech: PROPN (2024; 38% instances), NOUN (1110; 21% instances), PUNCT (688; 13% instances), ADP (674; 13% instances), ADV (286; 5% instances), DET (200; 4% instances), NUM (131; 2% instances), X (89; 2% instances), PRON (71; 1% instances), VERB (53; 1% instances), SCONJ (26; 0% instances), CCONJ (12; 0% instances), AUX (8; 0% instances), INTJ (3; 0% instances), ADJ (1; 0% instances)