home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: ADV

There are 133 ADV lemmas (11%), 161 ADV types (8%) and 681 ADV tokens (10%). Out of 16 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADV lemmas: icare, _, oino, pugeje, ty, woe, toro, dykeje, mato, ica

The 10 most frequent ADV types: icare, pugeje, oino, ty, toro, Dykeje, oinore, woe, mato, jii

The 10 most frequent ambiguous lemmas: icare (ADV 104, ADJ 1), _ (NOUN 201, VERB 142, ADV 84, PUNCT 64, X 56, ADP 44, PRON 42, PROPN 36, DET 10, PART 6, SCONJ 6, CCONJ 2, ADJ 1), ty (VERB 66, ADV 22, PRON 8, NOUN 2, X 1), toro (ADV 19, NOUN 1), dykeje (ADV 17, SCONJ 14, X 1), mato (ADV 14, NOUN 1), pemega (VERB 13, NOUN 10, ADV 8, ADJ 1), rugadu (ADV 8, NOUN 1, X 1), kodire (ADV 6, SCONJ 2), ca (ADV 6, INTJ 6, PART 1, X 1)

The 10 most frequent ambiguous types: icare (ADV 75, ADJ 1), pugeje (ADV 51, ADJ 1), ty (ADV 24, PART 2, X 1), toro (ADV 20, NOUN 1), Dykeje (ADV 17, SCONJ 1), mato (ADV 14, NOUN 1), jamedy (ADV 9, PRON 3, NOUN 1, X 1), je (ADV 9, ADP 2, INTJ 1, X 1), kodire (SCONJ 4, ADV 3), rugadu (ADV 8, X 1)

Morphology

The form / lemma ratio of ADV is 1.210526 (the average of all parts of speech is 1.661916).

The 1st highest number of forms (47) was observed with the lemma “_”: Dukodi, Dyinody, Dykeje, Dykejere, Dykodie, Kocare, Kode, Kodi, Woie, boekare, boetoji, care, dykaere, dykodi, dytabore, guragare, iadukeje, icai, icare, inoba, jamedy, jaogwai, je, jii, kejeboe, kimore, kodo, koiaie, kuri, kuricigore, marigudu, nono, nonore, oinono, oinore, pugeje, raka, rakakare, remawure, reore, rugadu, toro, tubiji, tuku, ty, woere, woje.

The 2nd highest number of forms (5) was observed with the lemma “keje”: Dykeje, Dykejere, dukeje, kejeba, kejere.

The 3rd highest number of forms (3) was observed with the lemma “inoba”: Noba, inoba, nuba.

ADV occurs with 12 features: Mood (85; 12% instances), AdvType (42; 6% instances), Deixis (42; 6% instances), Polarity (16; 2% instances), PronType (7; 1% instances), Int (5; 1% instances), Nomzr (3; 0% instances), Speech (3; 0% instances), Intens (2; 0% instances), Voice (2; 0% instances), Aspect (1; 0% instances), Tense (1; 0% instances)

ADV occurs with 18 feature-value pairs: AdvType=Loc, AdvType=Man, AdvType=Mod, AdvType=Tim, Aspect=IncProg, Deixis=Med, Deixis=Prox, Deixis=Remt, Int=Yes, Intens=Yes, Mood=Ind, Nomzr=Clau, Nomzr=Rel, Polarity=Neg, PronType=Int, Speech=Ind, Tense=Fut, Voice=Cau

ADV occurs with 21 feature combinations. The most frequent feature combination is _ (499 tokens). Examples: icare, pugeje, ty, oino, Dykeje, mato, jii, jamedy, je, rugadu

Relations

ADV nodes are attached to their parents using 17 different relations: advmod (559; 82% instances), advcl (47; 7% instances), parataxis (13; 2% instances), compound (12; 2% instances), conj (10; 1% instances), root (7; 1% instances), mark (6; 1% instances), ccomp (5; 1% instances), case (4; 1% instances), dep (4; 1% instances), nsubj (4; 1% instances), discourse (3; 0% instances), nmod (2; 0% instances), obl (2; 0% instances), dislocated (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Parents of ADV nodes belong to 9 different parts of speech: VERB (520; 76% instances), NOUN (74; 11% instances), ADV (35; 5% instances), PRON (23; 3% instances), PROPN (8; 1% instances), (7; 1% instances), X (7; 1% instances), ADP (6; 1% instances), DET (1; 0% instances)

597 (88%) ADV nodes are leaves.

57 (8%) ADV nodes have one child.

15 (2%) ADV nodes have two children.

12 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 19 different relations: punct (37; 28% instances), advmod (16; 12% instances), nsubj (14; 11% instances), compound (13; 10% instances), conj (8; 6% instances), obl (8; 6% instances), ccomp (6; 5% instances), parataxis (6; 5% instances), dep (5; 4% instances), case (4; 3% instances), appos (3; 2% instances), nmod (3; 2% instances), obj (2; 2% instances), cop (1; 1% instances), det (1; 1% instances), discourse (1; 1% instances), dislocated (1; 1% instances), flat (1; 1% instances), nummod (1; 1% instances)

Children of ADV nodes belong to 12 different parts of speech: PUNCT (37; 28% instances), ADV (35; 27% instances), NOUN (19; 15% instances), PRON (12; 9% instances), ADP (9; 7% instances), VERB (8; 6% instances), PROPN (3; 2% instances), X (3; 2% instances), DET (2; 2% instances), AUX (1; 1% instances), NUM (1; 1% instances), PART (1; 1% instances)