home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: ADP

There are 1273 ADP lemmas (10%), 1659 ADP types (9%) and 16590 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: ji, apo, keje, _, kae, ai, bogai, piji, tada, koia

The 10 most frequent ADP types: ji, tabo, keje, kae, ei, bogai, piji, ai, ii, tai

The 10 most frequent ambiguous lemmas: ji (ADP 2814, ADV 387, VERB 20, NOUN 18, X 10, PRON 6), apo (ADP 1683, ADV 48, NOUN 16, PROPN 9, X 7, VERB 3), keje (ADP 1500, VERB 85, NOUN 50, SCONJ 45, PRON 1), _ (NOUN 5910, VERB 3398, ADV 1856, PRON 1359, ADP 1308, PROPN 1165, X 926, PUNCT 459, DET 149, INTJ 122, SCONJ 55, CCONJ 30, PART 29), kae (ADP 1013, NOUN 18, ADV 10, SCONJ 3, PRON 1), ai (ADP 730, VERB 24, NOUN 11, ADV 7), bogai (ADP 592, NOUN 32, ADV 13, VERB 3, PRON 1), piji (ADP 484, X 45, ADV 19, VERB 16, NOUN 6), tada (ADP 280, NOUN 83, PRON 7, PROPN 3, VERB 2), koia (ADP 257, VERB 46, NOUN 43, ADV 8, PROPN 3, CCONJ 2, PRON 2)

The 10 most frequent ambiguous types: ji (ADP 1598, X 4), keje (ADP 1262, ADV 2), ei (ADP 463, NOUN 26, ADV 6, PROPN 6), bogai (ADP 459, NOUN 25), piji (ADP 386, X 53, ADV 12), ai (ADP 349, NOUN 7), tai (ADP 290, NOUN 14), to (ADP 261, VERB 120, NOUN 103, X 2, PROPN 1), tada (ADP 259, NOUN 59, PROPN 3), koia (ADP 228, NOUN 36, PROPN 4, CCONJ 2)

Morphology

The form / lemma ratio of ADP is 1.303221 (the average of all parts of speech is 1.360106).

The 1st highest number of forms (284) was observed with the lemma “_”: Cedaregodure’trocamento’kae, Cegodo’ca’nowu’colchete’kae, Emodukae, Enogwariji, Epijire, Etada, Etagoda, Pagaobijire, ae, aiba, akaba, akagemokaba, akaragudukaba, akiba, akuruba, apo, apueceba, apuie, areduba, areduie, aroereboeie, atugoie, awaraba, awie, awuie, bae, baiji, bakujebiji, biji, biriji, boetoji, bokwae, buregi, butugugodae, cebegi, cebiji, cedae, cedoda, cedododai, cedogi, ceeda, cegi, cegiwae, ceibagi, ceiogi, cemaragodae, cenodobiji, cewoadae, cibae, ciegi, cigi, da, duotodai, dupiji, eegai, eegorai, eemarukai, eiameduto, eiaogwai, eiaoto, eiemejerai, eimejarai, eimejerai, ekaguruto, ekai, ekudaeto, ekurito, ekuruto, emagai, emeardaeto, enogituwai, enogwato, enorai, epagai, epagakai, epemegai, eporai, erojai, etae, etaorato, etaoto, etododai, ewadaruto, ewai, ewarito, ewiato, ewugeto, gi, gigi, iaogwai, icai, idurumagai, iegai, ii’café’keje, iiageje, iji, ikajeje, ikeje, ikimejerai, inai, iogi, iogorai, iordukai, iorduwakai, iporai, irawuje, iregodukai, irojai, itaogeje, itawuje, itododai, itogajeje, itoreduje, itowuje, iwiegai, iwogai, iwugeje, jabiji, jae, jagi, jekareae, jerigiji, ji, jiba, jipagi, jire, jiwae, jorudae, kae, kae’café’kae, kahae, keadae, kiji, koda, kodukae, koriji, kuda, kuiada, kujebiji, kujiagi, kuriae, kuridodae, mae, maragodae, meardae, mearudae, megi, meriji, meririji, moriji, morogigi, mugukae, nada, nowu’bolicho’kae, nowu’caminhao’kae, oiadoda, oiagi, oiji, okae, okudae, okwabiji, okwabijire, okwamagudae, otobiji, otodai, pabiji, padogi, paduwo’Jirau’kae, pagi, pagododai, paiadoda, paiogi, pariji, piji, pijire, pintada, puae, pubiji, pudada, pudae, pudai, puguda, pugujiagi, puibagi, puibiji, pumegi, raikae, rekodaiji, rekodaji, rekodajire, roçada, tabo, tabo’caminhão’tada, taboba, tabowu, tabowuto, tada, tadawu, tadawuto, tadoda, tadodawu, taerito, tagajeje, tagaoto, tageje, tagi, tagiba, tagodobo, tagoiadoda, taguda, taiadoda, taiwu, takorewu, tamagokaba, taogajeje, taogajejewu, taoto, tarigorewu, tawabo, tawaigace, tawugeje, tawureto, testamento, to, toda, todogajeje, todoguruto, todowu, togudugajeje, togwadawu, togwawu, toiadoda, toriguruto, toripegato, torito, torowu, toruto, toto, tototototo, towubo, towugeje, towujewu, tuboreto, tudada, tudoda, tudugaregece, tudugoce, tugarece, tugeje, tuginoiwu, tuginoiwuto, tuguda, tuguwu, tuiagajejewu, tuiagajejjewu, tuiagejewu, tuiaoto, tuiato, tuiegarewowu, tuierigajeje, tuiewowu, tuigarece, tuiorubo, tujeto, tumoridojeba, tumoridowu, tumorijewu, turojaiwu, turuguce, tuwabo, tuwaito, tuwiaboroto, tuwiato, tuwirebo, tuwirito, tuwugajeje, tuwugeje, ucebae, uiagudae, ukae, ukudae, umaragodae, uwadodae, uwaiji.

The 2nd highest number of forms (13) was observed with the lemma “ai”: Kae, ai, aire, aiwu, akai, cenai, etai, iai, inai, inaidu, pudai, tagai, tai.

The 3rd highest number of forms (13) was observed with the lemma “ji”: ai, duji, ei, i, ii, iji, ji, jiba, jire, jiwu, jiwuge, pai, pudui.

ADP occurs with 11 features: Number (13379; 81% instances), Person (13227; 80% instances), Nomzr (645; 4% instances), Subord (293; 2% instances), Reflex (284; 2% instances), Mood (208; 1% instances), Clusivity (164; 1% instances), Int (131; 1% instances), PronType (68; 0% instances), Speech (47; 0% instances), Gender (1; 0% instances)

ADP occurs with 15 feature-value pairs: Clusivity=Ex, Clusivity=In, Gender=Fem, Int=Yes, Mood=Ind, Nomzr=Rel, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Rcp, Reflex=Yes, Speech=Ind, Subord=Yes

ADP occurs with 31 feature combinations. The most frequent feature combination is Number=Sing|Person=3 (8253 tokens). Examples: ji, tabo, kae, bogai, piji, koia, to, keje, ae, duji

Relations

ADP nodes are attached to their parents using 13 different relations: case (11057; 67% instances), obl (5125; 31% instances), root (139; 1% instances), nmod (104; 1% instances), nsubj (99; 1% instances), dep (17; 0% instances), obj (15; 0% instances), parataxis (12; 0% instances), mark (10; 0% instances), conj (4; 0% instances), advcl (3; 0% instances), ccomp (3; 0% instances), discourse (2; 0% instances)

Parents of ADP nodes belong to 17 different parts of speech: NOUN (9657; 58% instances), VERB (5329; 32% instances), PROPN (674; 4% instances), ADV (360; 2% instances), ADP (163; 1% instances), (139; 1% instances), X (110; 1% instances), PRON (62; 0% instances), NUM (30; 0% instances), PART (25; 0% instances), DET (9; 0% instances), CCONJ (8; 0% instances), AUX (7; 0% instances), INTJ (6; 0% instances), ADJ (5; 0% instances), SCONJ (5; 0% instances), PUNCT (1; 0% instances)

15908 (96%) ADP nodes are leaves.

441 (3%) ADP nodes have one child.

135 (1%) ADP nodes have two children.

106 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 16 different relations: nmod (208; 19% instances), punct (164; 15% instances), case (163; 15% instances), nsubj (117; 11% instances), advmod (111; 10% instances), dep (78; 7% instances), det (59; 5% instances), obl (54; 5% instances), parataxis (45; 4% instances), conj (40; 4% instances), obj (29; 3% instances), mark (9; 1% instances), cc (4; 0% instances), flat (3; 0% instances), advcl (2; 0% instances), cop (1; 0% instances)

Children of ADP nodes belong to 13 different parts of speech: NOUN (322; 30% instances), PUNCT (164; 15% instances), ADP (163; 15% instances), ADV (125; 11% instances), PRON (67; 6% instances), DET (59; 5% instances), PROPN (58; 5% instances), VERB (48; 4% instances), X (44; 4% instances), AUX (16; 1% instances), SCONJ (12; 1% instances), CCONJ (6; 1% instances), NUM (3; 0% instances)