home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: ADP

There are 54 ADP lemmas (4%), 131 ADP types (6%) and 754 ADP tokens (11%). Out of 16 observed tags, the rank of ADP is: 5 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: ji, apo, to, keje, _, kae, piji, bogai, ai, koia

The 10 most frequent ADP types: ji, tabo, to, kae, keje, piji, ii, ai, bogai, apo

The 10 most frequent ambiguous lemmas: ji (ADP 253, NOUN 5, ADV 2), apo (ADP 98, ADV 1), to (ADP 56, VERB 26, NOUN 3, ADV 1, X 1), keje (ADP 46, SCONJ 13, ADV 5, NOUN 1), _ (NOUN 201, VERB 142, ADV 84, PUNCT 64, X 56, ADP 44, PRON 42, PROPN 36, DET 10, PART 6, SCONJ 6, CCONJ 2, ADJ 1), kae (ADP 38, PART 2, ADV 1, NOUN 1), bogai (ADP 26, ADV 1), ai (ADP 23, NOUN 1), koia (ADP 18, CCONJ 1, SCONJ 1), tada (ADP 16, NOUN 2)

The 10 most frequent ambiguous types: ji (ADP 203, NOUN 5), tabo (ADP 66, ADV 1), to (ADP 65, VERB 26, NOUN 2, X 1), keje (ADP 29, SCONJ 11), ii (ADP 22, NOUN 1), tada (ADP 16, NOUN 1), koiare (ADP 2, ADV 1, SCONJ 1, VERB 1), koia (ADP 7, SCONJ 1), ae (ADP 4, NOUN 4), dyji (X 15, ADP 4, PART 1, SCONJ 1)

Morphology

The form / lemma ratio of ADP is 2.425926 (the average of all parts of speech is 1.661916).

The 1st highest number of forms (29) was observed with the lemma “_”: ae, aiba, apo, apoba, cedododai, eiato, eigoiare, etododai, gi, inai, itada, iwogai, iwugeje, ji, jire, joki, kae, kaiba, koia, kujiagi, piji, rekodaji, tabo, taboba, tada, to, toce, tuiagajejewy, tuwygeje.

The 2nd highest number of forms (15) was observed with the lemma “apo”: abo, akabo, apo, apore, apowu, cedabo, ebo, itabo, itabowy, pagabo, pudabo, tabo, tabore, tagabo, tapo.

The 3rd highest number of forms (13) was observed with the lemma “ji”: ae, ai, dyji, ei, eji, i, ii, iji, ji, jiba, jiwyji, pudyi, tai.

ADP occurs with 11 features: Number (600; 80% instances), Person (596; 79% instances), Reflex (29; 4% instances), Mood (26; 3% instances), Nomzr (21; 3% instances), Int (13; 2% instances), Clusivity (8; 1% instances), Gender (2; 0% instances), PronType (2; 0% instances), Speech (2; 0% instances), Subord (2; 0% instances)

ADP occurs with 16 feature-value pairs: Clusivity=Ex, Clusivity=In, Gender=Fem, Int=Yes, Mood=Ind, Nomzr=Rel, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Bi, PronType=Rcp, Reflex=Yes, Speech=Ind, Subord=Yes

ADP occurs with 33 feature combinations. The most frequent feature combination is Number=Sing|Person=3 (427 tokens). Examples: ji, to, tabo, kae, keje, bogai, piji, apo, tada, kajeje

Relations

ADP nodes are attached to their parents using 16 different relations: case (537; 71% instances), obl (179; 24% instances), mark (7; 1% instances), discourse (6; 1% instances), acl (4; 1% instances), advcl (3; 0% instances), conj (3; 0% instances), dep (3; 0% instances), root (3; 0% instances), ccomp (2; 0% instances), dislocated (2; 0% instances), compound (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 10 different parts of speech: NOUN (478; 63% instances), VERB (195; 26% instances), PROPN (44; 6% instances), PRON (15; 2% instances), ADV (9; 1% instances), ADP (4; 1% instances), (3; 0% instances), X (3; 0% instances), DET (2; 0% instances), SCONJ (1; 0% instances)

724 (96%) ADP nodes are leaves.

19 (3%) ADP nodes have one child.

7 (1%) ADP nodes have two children.

4 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 11 different relations: punct (16; 31% instances), nsubj (7; 14% instances), advmod (5; 10% instances), nmod (5; 10% instances), conj (4; 8% instances), parataxis (4; 8% instances), dep (3; 6% instances), obl (3; 6% instances), case (2; 4% instances), advcl (1; 2% instances), obj (1; 2% instances)

Children of ADP nodes belong to 7 different parts of speech: PUNCT (16; 31% instances), NOUN (13; 25% instances), ADV (6; 12% instances), PRON (6; 12% instances), ADP (4; 8% instances), PROPN (4; 8% instances), VERB (2; 4% instances)