home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tupinamba-TuDeT: POS Tags: VERB

There are 295 VERB lemmas (24%), 537 VERB types (27%) and 692 VERB tokens (15%). Out of 14 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: ʔi, _, so, potar, iko, ur, meʔeŋ, awsuβ, mono, suβ

The 10 most frequent VERB types: eʔi, ojaβo, eʔipe, aimono, aso, neʔi, oso, rawsupa, witekoβo, Ejori

The 10 most frequent ambiguous lemmas: ʔi (VERB 66, NOUN 5, DET 1), _ (NOUN 87, VERB 34, PUNCT 12, ADP 9, PRON 9, PROPN 9, PART 6, ADV 5, NUM 2, DET 1, X 1), so (VERB 32, NOUN 16), potar (VERB 28, NOUN 4), iko (NOUN 43, VERB 27, DET 5, ADV 1), ur (VERB 21, NOUN 8), meʔeŋ (VERB 14, NOUN 4), awsuβ (VERB 13, NOUN 12), mono (VERB 8, NOUN 2), suβ (VERB 7, NOUN 1)

The 10 most frequent ambiguous types: oso (NOUN 1, VERB 1), reja (VERB 2, NOUN 1), saʔaŋa (VERB 2, NOUN 1), Aʔe (PRON 2, PART 1, VERB 1), aʔuβ (ADV 1, VERB 1), imoja (NOUN 1, VERB 1), nekɨriri (NOUN 1, VERB 1), oka (NOUN 1, VERB 1), pesema (NOUN 1, VERB 1), peʔaβo (NOUN 1, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.820339 (the average of all parts of speech is 1.577170).

The 1st highest number of forms (34) was observed with the lemma “_”: Amoramwe, Ejemorɨrɨj, Ejotĩ, Ojenomũnomũ, Ojerokɨpe, aimono, asekɨj, ipopwa, moasɨ́aβo, mojɨrõŋatuaβo, momurukatuaβo, mopena, moʔemoʔebo, najejɨj, nesunesupa, oiemoatãmo, ojemomotaβejẽmo, ojemoŋetaŋetaβo, ojmoariβeukar, ojoporuporwaβo, omemwãnamo, opja, oporaseja, osapukaja, pejeaŋune, pepuʔã, petejẽumẽ, sasapa, tapeikuwa, tapeimoaβaíβ, tapejmono, tomonarõ, toroʔekatu, ʔarine.

The 2nd highest number of forms (20) was observed with the lemma “potar”: Aipotakatu, Ajpotaretekatu, Ejpota, Ereipotape, Naipotar, Najpotareʔɨm, Ojpotakatu, Ojpotakatupe, mota, motarete, motaretekatu, naipotari, najpotari, ojoamotareʔɨ̃, oreamotareʔɨm, pota, potara, potarete, potareʔɨma, serupota.

The 3rd highest number of forms (16) was observed with the lemma “iko”: Aiko, Aikoβe, Oiko, ajko, ejkoβo, ereiko, erejko, nerejkoj, nojko, oikoβo, ojkoβo, oroiko, peikoβo, tojko, tojkone, witekoβo.

VERB occurs with 28 features: VerbForm (261; 38% instances), Person (244; 35% instances), Person[subj] (238; 34% instances), Number (198; 29% instances), Voice (110; 16% instances), Person[obj] (103; 15% instances), Rel (101; 15% instances), Mood (92; 13% instances), Number[subj] (68; 10% instances), Reflex (55; 8% instances), Polarity (43; 6% instances), Clusivity (41; 6% instances), Intens (31; 4% instances), Int (29; 4% instances), Aspect (27; 4% instances), Tense (18; 3% instances), Corf (12; 2% instances), Red (10; 1% instances), Number[obj] (8; 1% instances), Incorp (7; 1% instances), Case (4; 1% instances), Priv (4; 1% instances), Foc (3; 0% instances), Nomzr (3; 0% instances), Person[psor] (3; 0% instances), Animacy (2; 0% instances), Hum (2; 0% instances), Recip (2; 0% instances)

VERB occurs with 50 feature-value pairs: Animacy=Hum, Aspect=Compl, Aspect=Iter, Aspect=Lus, Case=Loc, Clusivity=Ex, Clusivity=In, Corf=Yes, Foc=Yes, Hum=Yes, Incorp=Yes, Int=Yes, Intens=Yes, Mood=Cnd, Mood=Imp, Mood=Per, Mood=Sub, Nomzr=Circ, Nomzr=Rel, Number=Plur, Number=Sing, Number[obj]=Sing, Number[subj]=Plur, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[obj]=1, Person[obj]=2, Person[obj]=3, Person[psor]=1, Person[psor]=2, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, Priv=Yes, Recip=Yes, Red=Di, Red=Mo, Reflex=Yes, Rel=Cont, Rel=NCont, Tense=Fut, Tense=Past, VerbForm=Ger, Voice=Cau, Voice=Mid, Voice=Rcp, Voice=SCau

VERB occurs with 272 feature combinations. The most frequent feature combination is VerbForm=Ger (44 tokens). Examples: ojaβo, meʔeŋa, pota, momewaβo, mota, supa, rawsupa, witekoβo, moŋetaβo, ojkoβo

Relations

VERB nodes are attached to their parents using 12 different relations: root (313; 45% instances), advcl (171; 25% instances), parataxis (106; 15% instances), xcomp (36; 5% instances), dep (22; 3% instances), conj (16; 2% instances), obl (9; 1% instances), ccomp (7; 1% instances), discourse (6; 1% instances), nmod (3; 0% instances), obj (2; 0% instances), dislocated (1; 0% instances)

Parents of VERB nodes belong to 4 different parts of speech: (313; 45% instances), VERB (277; 40% instances), NOUN (101; 15% instances), SCONJ (1; 0% instances)

85 (12%) VERB nodes are leaves.

152 (22%) VERB nodes have one child.

160 (23%) VERB nodes have two children.

295 (43%) VERB nodes have three or more children.

The highest child degree of a VERB node is 18.

Children of VERB nodes are attached using 22 different relations: punct (508; 29% instances), obl (258; 15% instances), obj (200; 12% instances), advmod (152; 9% instances), advcl (151; 9% instances), parataxis (112; 6% instances), discourse (89; 5% instances), nsubj (82; 5% instances), xcomp (40; 2% instances), dep (33; 2% instances), conj (28; 2% instances), ccomp (19; 1% instances), nmod (18; 1% instances), case (11; 1% instances), vocative (11; 1% instances), dislocated (7; 0% instances), appos (5; 0% instances), acl (3; 0% instances), mark (3; 0% instances), cc (2; 0% instances), compound (2; 0% instances), iobj (2; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (508; 29% instances), PUNCT (508; 29% instances), VERB (277; 16% instances), ADV (165; 10% instances), PART (80; 5% instances), PRON (65; 4% instances), ADP (52; 3% instances), PROPN (45; 3% instances), DET (16; 1% instances), INTJ (13; 1% instances), SCONJ (3; 0% instances), CCONJ (2; 0% instances), NUM (1; 0% instances), X (1; 0% instances)