Treebank Statistics: UD_Nheengatu-CompLin: POS Tags: VERB
There are 522 VERB lemmas (27%), 1202 VERB types (40%) and 4287 VERB tokens (16%).
Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 3 in number of tokens.
The 10 most frequent VERB lemmas: sú, munhã, nheẽ, maã, rikú, sika, putari, yuri, mbeú, pisika
The 10 most frequent VERB types: unheẽ, usú, usika, umaã, umunhã, urikú, upitá, upisika, umbeú, uri
The 10 most frequent ambiguous lemmas: sú (VERB 238, AUX 215), maã (PRON 161, VERB 159, NOUN 30, DET 27, PART 25), rikú (VERB 126, NOUN 1), putari (VERB 81, AUX 79), yuíri (VERB 52, ADV 44, CCONJ 26, AUX 1), kwáu (AUX 58, VERB 48), suaxara (VERB 33, NOUN 13), katú (ADV 41, VERB 23, ADJ 20, PART 5), purakí (VERB 21, NOUN 1), kwá (DET 179, PRON 48, ADV 20, VERB 19, AUX 2)
The 10 most frequent ambiguous types: usú (VERB 111, AUX 74), urikú (VERB 56, AUX 1), xasú (VERB 22, AUX 4), asú (AUX 30, VERB 15), katú (ADV 41, VERB 23, ADJ 20, PART 5), resú (VERB 15, AUX 5), yasú (AUX 16, VERB 11), pusé (VERB 11, ADJ 1), sasí (VERB 7, NOUN 2), sakú (VERB 7, ADJ 2, ADV 1)
- usú
- urikú
- xasú
- asú
- katú
- resú
- yasú
- pusé
- sasí
- sakú
Morphology
The form / lemma ratio of VERB is 2.302682 (the average of all parts of speech is 1.529412).
The 1st highest number of forms (16) was observed with the lemma “sú”: Asuwara, Ekũi, Rekũi, asú, hasú, ikũi, isú, kũi, pasú, pesú, resú, tausú, usutiwa, usú, xasú, yasú.
The 2nd highest number of forms (12) was observed with the lemma “munhã”: amunhã, hamunhã, munhã, pemunhã, remunhã, tamunhã, taumunhã, umunhã, uyumunhã, xamunhã, yamunhã, yumunhã.
The 3rd highest number of forms (11) was observed with the lemma “manduári”: Amanduariwara, amanduári, hamanduarisawa, manduári, pemanduarisawa, pemanduári, remanduarisawa, remanduári, umanduári, yamanduarisawa, yamanduári.
VERB occurs with 13 features: VerbForm (4287; 100% instances), Mood (4177; 97% instances), Person (4007; 93% instances), Number (1683; 39% instances), Style (403; 9% instances), Rel (115; 3% instances), Voice (54; 1% instances), Typo (49; 1% instances), Red (20; 0% instances), Aspect (13; 0% instances), Degree (2; 0% instances), Tense (2; 0% instances), Derivation (1; 0% instances)
VERB occurs with 24 feature-value pairs: Aspect=Freq, Aspect=Hab, Aspect=Iter, Degree=Aug, Derivation=Priv, Mood=Imp, Mood=Imp,Ind, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Red=Yes, Rel=Cont, Rel=NCont, Style=Arch, Style=Rare, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Inf, VerbForm=Vnoun, Voice=Mid,Pass
VERB occurs with 64 feature combinations.
The most frequent feature combination is Mood=Ind|Person=3|VerbForm=Fin (2236 tokens).
Examples: unheẽ, usú, usika, umaã, umunhã, urikú, upitá, upisika, umbeú, uri
Relations
VERB nodes are attached to their parents using 14 different relations: root (2193; 51% instances), parataxis (751; 18% instances), advcl (503; 12% instances), ccomp (309; 7% instances), acl:relcl (231; 5% instances), xcomp (130; 3% instances), conj (98; 2% instances), advcl:relcl (19; 0% instances), acl (18; 0% instances), csubj (17; 0% instances), obj (7; 0% instances), obl (7; 0% instances), nsubj (3; 0% instances), nmod:poss (1; 0% instances)
Parents of VERB nodes belong to 10 different parts of speech: (2193; 51% instances), VERB (1696; 40% instances), NOUN (162; 4% instances), PRON (128; 3% instances), ADJ (50; 1% instances), ADV (34; 1% instances), PART (19; 0% instances), NUM (2; 0% instances), PROPN (2; 0% instances), ADP (1; 0% instances)
80 (2%) VERB nodes are leaves.
322 (8%) VERB nodes have one child.
766 (18%) VERB nodes have two children.
3119 (73%) VERB nodes have three or more children.
The highest child degree of a VERB node is 10.
Children of VERB nodes are attached using 30 different relations: punct (3516; 24% instances), advmod (2686; 18% instances), nsubj (1929; 13% instances), obj (1540; 10% instances), obl (1393; 9% instances), parataxis (736; 5% instances), mark (556; 4% instances), aux (546; 4% instances), advcl (523; 4% instances), ccomp (411; 3% instances), iobj (208; 1% instances), xcomp (188; 1% instances), expl (137; 1% instances), conj (100; 1% instances), vocative (92; 1% instances), cc (73; 0% instances), dislocated (45; 0% instances), discourse (21; 0% instances), case (10; 0% instances), goeswith (5; 0% instances), acl:relcl (3; 0% instances), csubj (3; 0% instances), nmod (3; 0% instances), dep (2; 0% instances), det (2; 0% instances), acl (1; 0% instances), advcl:relcl (1; 0% instances), amod (1; 0% instances), compound (1; 0% instances), nmod:poss (1; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: PUNCT (3516; 24% instances), NOUN (3253; 22% instances), PRON (1949; 13% instances), VERB (1696; 12% instances), PART (1589; 11% instances), ADV (1194; 8% instances), SCONJ (550; 4% instances), AUX (546; 4% instances), PROPN (178; 1% instances), ADJ (79; 1% instances), CCONJ (73; 0% instances), ADP (47; 0% instances), INTJ (36; 0% instances), DET (12; 0% instances), NUM (9; 0% instances), X (6; 0% instances)