home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bambara-CRB: POS Tags: VERB

There are 275 VERB lemmas (26%), 292 VERB types (26%) and 2498 VERB tokens (18%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: kó, tagara, tága, kɛ́, bɛ́, _, dí, fɔ́, nà, yé

The 10 most frequent VERB types: ko, tagara, taga, kè, bè, na, ye, fò, di, nana

The 10 most frequent ambiguous lemmas: (PART 358, VERB 228, SCONJ 25, NOUN 8), tága (VERB 119, NOUN 1), kɛ́ (VERB 95, NOUN 1), bɛ́ (VERB 82, DET 1), _ (NOUN 94, VERB 71, PUNCT 63, PRON 60, ADP 44, PART 37, DET 12, ADJ 9, PROPN 9, ADV 8, CCONJ 6, SCONJ 3, NUM 2), (VERB 70, NOUN 14), (ADP 171, VERB 68), sɔ̀rɔ (VERB 59, NOUN 1), dún (VERB 38, PART 22, NOUN 1), (VERB 29, NOUN 25)

The 10 most frequent ambiguous types: ko (PART 388, VERB 249, SCONJ 25, NOUN 9), taga (VERB 120, NOUN 1), (AUX 234, VERB 71, DET 2), na (ADP 131, VERB 71, NOUN 15, AUX 4), ye (AUX 372, ADP 181, VERB 71), di (VERB 69, PRON 2), sòrò (VERB 56, NOUN 1), to (VERB 52, NOUN 2), ta (VERB 43, NOUN 22, INTJ 16), dun (VERB 39, PART 33, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.061818 (the average of all parts of speech is 1.060207).

The 1st highest number of forms (47) was observed with the lemma “_”: ban, banna, bila, bɔ, bɛ, da, delila, di, don, donna, dun, dɔn, fara, feere, fililen, fiyɛ, fɔ, gɛlɛya, kan, ko, kologɛlɛya, kɔlɔsira, kɛ, laban, lafilila, latilen, mɔsɔn, na, san, se, selen, sigi, surunyana, sɔn, t’, ta, taa, taga, tagara, tigɛ, tlala, to, tɛmɛ, wele, wulila, yaala, ye.

The 2nd highest number of forms (3) was observed with the lemma “bɛ́”: be, bè, bɛ.

The 3rd highest number of forms (3) was observed with the lemma “dòn”: don, dò, dòn.

VERB occurs with 5 features: Polarity (570; 23% instances), Aspect (490; 20% instances), Valency (448; 18% instances), VerbForm (37; 1% instances), Voice (15; 1% instances)

VERB occurs with 8 feature-value pairs: Aspect=Imp, Aspect=Perf, Aspect=Prog, Polarity=Neg, Polarity=Pos, Valency=1, VerbForm=Part, Voice=Cau

VERB occurs with 9 feature combinations. The most frequent feature combination is _ (1876 tokens). Examples: ko, taga, kè, na, fò, di, sòrò, to, se, ta

Relations

VERB nodes are attached to their parents using 16 different relations: root (966; 39% instances), xcomp (547; 22% instances), parataxis (434; 17% instances), parataxis:obj (222; 9% instances), advcl (162; 6% instances), ccomp (79; 3% instances), acl (69; 3% instances), amod (4; 0% instances), conj (4; 0% instances), reparandum (3; 0% instances), discourse (2; 0% instances), fixed (2; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances), nmod:poss (1; 0% instances), nsubj (1; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: VERB (1480; 59% instances), (966; 39% instances), PRON (20; 1% instances), NOUN (12; 0% instances), ADV (11; 0% instances), PROPN (5; 0% instances), AUX (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

17 (1%) VERB nodes are leaves.

90 (4%) VERB nodes have one child.

368 (15%) VERB nodes have two children.

2023 (81%) VERB nodes have three or more children.

The highest child degree of a VERB node is 10.

Children of VERB nodes are attached using 25 different relations: nsubj (1847; 20% instances), punct (1608; 17% instances), aux (1376; 15% instances), obl (1118; 12% instances), obj (1044; 11% instances), xcomp (549; 6% instances), discourse (529; 6% instances), parataxis (442; 5% instances), parataxis:obj (281; 3% instances), mark (192; 2% instances), advcl (164; 2% instances), vocative (109; 1% instances), ccomp (79; 1% instances), dislocated (43; 0% instances), acl (36; 0% instances), advmod (23; 0% instances), cc (5; 0% instances), reparandum (5; 0% instances), case (4; 0% instances), nmod:poss (4; 0% instances), conj (3; 0% instances), dep (3; 0% instances), orphan (3; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: PRON (2031; 21% instances), NOUN (1870; 20% instances), PUNCT (1608; 17% instances), VERB (1480; 16% instances), AUX (1381; 15% instances), PART (452; 5% instances), ADV (179; 2% instances), SCONJ (177; 2% instances), PROPN (144; 2% instances), INTJ (111; 1% instances), NUM (19; 0% instances), CCONJ (7; 0% instances), ADP (5; 0% instances), ADJ (4; 0% instances), DET (1; 0% instances)