home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sindhi-Isra: POS Tags: VERB

There are 798 VERB lemmas (14%), 1984 VERB types (18%) and 13093 VERB tokens (14%). Out of 15 observed tags, the rank of VERB is: 3 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: ڪر, ويو, ڪيو, آهي, وڃ, چئو, پيو, ڪن, اچ, ره

The 10 most frequent VERB types: ڪري, چيو, ڪرڻ, ويو, ڪيو, وڃي, ڪئي, اچي, ويا, پيو

The 10 most frequent ambiguous lemmas: ڪيو (VERB 529, PROPN 10, NOUN 1), آهي (AUX 4663, VERB 499, PRON 4), چئو (VERB 448, ADJ 3, NOUN 1), ڪن (VERB 322, NOUN 13), ڏي (VERB 187, ADP 4, ADV 1), _ (NOUN 3850, PROPN 838, VERB 185, ADJ 66, NUM 58, ADV 30, ADP 24, PART 24, PUNCT 22, PRON 17, AUX 16, SCONJ 9, DET 8, INTJ 2, CCONJ 1), لڳ (VERB 171, ADJ 5, NOUN 3, ADP 2, ADV 1), ڏنو (VERB 158, NOUN 2, PROPN 1), ڏس (VERB 152, NOUN 9), رک (VERB 128, NOUN 7)

The 10 most frequent ambiguous types: ڪري (VERB 568, ADP 63), چيو (VERB 411, NOUN 1), ڪيو (VERB 331, PROPN 10, NOUN 1), ڏنو (VERB 116, NOUN 2, PROPN 1), ٿيڻ (VERB 87, AUX 1), پيا (VERB 78, NOUN 2), ٿئي (VERB 72, AUX 19), هوندو (VERB 64, AUX 16), ڪن (VERB 60, DET 11, NOUN 10), ملي (VERB 54, ADV 1)

Morphology

The form / lemma ratio of VERB is 2.486216 (the average of all parts of speech is 1.872520).

The 1st highest number of forms (159) was observed with the lemma “_”: Kill, آيوآهي, آيوآهيان, آيوآهين, ايندين, اُٿيو, اٿندا, بچاءِ, ترندي, تري, جاڳ, جاڳيو, جلدي, جنبي, حاصل, ذميوارآهن, رسائي, رنو, رهائڻ, رهندينءَ, رهه, رهين, رُلن, رُنو, رڙيو, زور, ساراهيو, سانگي, ساهي, سبب, سجمهڻ, سـُڪي, سمهندو, سنواريون, سوار, سينگارڻ, سيکاريون, سڙن, طئه, قرارڏئي, لاتو, لاڳاپيل, ليلائيندي, ليلائڻ, ليٽيا, لڙڪي, لکنديس, لڪايو, مارو, ماريندا, ماڻس, ملتوي, ملنداسون, ملندس, منتقل, مڙي, مڪسل, نچندو, نڪرڻ, هئي, هارائڻ, هنڊايو, هوئي, واري, ورتائين, وهنجاريو, وهنجندي, ويوهو, وٺندومانس, وڄائي, وڄايون, وڍڻ, وڪوڙي, وڪڻڻ, ياد, ٺهرائي, ٻاھر, ٻڌاءَ, ٻڌائيندس, ٻڌائيندين, ٻڌايوته, ٻڌندئي, ٽـُڪي, ٽـُڪڻ, ٽڙيل, پاءِ, پائيندو, پرن, پهچين, پويان, پياري, پياريون, پيٺل, پچائيندي, پڇندو, پڇين, پڙهجي, پڙهندو, پڙهون, ٿائو, ٿوڏنو, ڀاڱي, ڀرجي, ڀريا, ڀيري, ڀيڻ, ڀڄائي, ڀڄبو, ڄمي, چـَندا, چيتو, چيوته, چيومانس, چٽڻ, ڇرڪي, ڇـُهيو, ڇندا, ڇهي, ڇُهڻ, ڇڏينداسين, ڊوڙن, ڊوڙندي, ڊوڙيون, ڊگهيريون, ڌوئندي, ڍڪايو, ڍڪيان, ڍڪڻ, ڏسجانءِ, ڏٺل, ڏکڻ, ڏڪڻي, ڦٻئون, ڦٻائي, ڦٽائي, کاڌا, کـڻ, کلندو, کلون, کلين, کينو, کٽن, کڙڪائيندو, ڪائو, ڪاهڻ, ڪنڙ, ڪهدو, ڪوڙ, ڪيٻائين, ڪڙمي, گامزن, گجندي, گهرائي, گهربل, گهرندينءَ, گهليندا, گهمندا, ڳالهائينديمانوَ, ۾ملهايو.

The 2nd highest number of forms (30) was observed with the lemma “آهي”: آهيان, آهيو, اٿئي, هئائون, هئم, هئڻ, هوندا, هوندي, هونديون, هياسين, ٿي, ٿيئي, ٿيا, ٿيل, ٿيم, ٿين, ٿيندا, ٿينداسون, ٿينداسين, ٿيندس, ٿيندو, ٿيندي, ٿينديس, ٿيندين, ٿينديون, ٿيون, ٿيڻ, ٿيڻي, ٿِي, ھئڻ.

The 3rd highest number of forms (21) was observed with the lemma “ڏي”: ڏبا, ڏبيون, ڏي, ڏين, ڏيندا, ڏينداسين, ڏيندس, ڏيندوسانءِ, ڏيندي, ڏينديس, ڏيندين, ڏينديون, ڏينس, ڏينم, ڏيو, ڏيون, ڏيکاريو, ڏيکاريون, ڏيڻ, ڏيڻو, ڏيڻيون.

VERB occurs with 15 features: Aspect (12909; 99% instances), Number (7841; 60% instances), Gender (4491; 34% instances), Person (4307; 33% instances), VerbForm (3402; 26% instances), Voice (3222; 25% instances), Tense (343; 3% instances), Person[subj] (82; 1% instances), Number[subj] (76; 1% instances), Number[obj] (36; 0% instances), Person[obj] (33; 0% instances), Gender[subj] (23; 0% instances), Case (18; 0% instances), Gender[obj] (11; 0% instances), Mood (3; 0% instances)

VERB occurs with 35 feature-value pairs: Aspect=Imp, Aspect=Perf, Aspect=Prog, Case=Acc, Case=Nom, Gender=Fem, Gender=Masc, Gender[obj]=Fem, Gender[obj]=Masc, Gender[subj]=Fem, Gender[subj]=Masc, Mood=Sub, Number=Plur, Number=Sing, Number[obj]=Plur, Number[obj]=Sing, Number[subj]=Plur, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[obj]=1, Person[obj]=2, Person[obj]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Inf, VerbForm=Vnoun, Voice=Act, Voice=Pass

VERB occurs with 312 feature combinations. The most frequent feature combination is Aspect=Imp|VerbForm=Inf (1543 tokens). Examples: ڪرڻ, ٿيڻ, ڏيڻ, چوڻ, وڃڻ, ڏسڻ, اچڻ, کائڻ, رهڻ, رکڻ

Relations

VERB nodes are attached to their parents using 21 different relations: root (4432; 34% instances), advcl (3688; 28% instances), compound (2456; 19% instances), conj (808; 6% instances), xcomp (533; 4% instances), acl (418; 3% instances), nmod (357; 3% instances), ccomp (185; 1% instances), obl (91; 1% instances), amod (47; 0% instances), nsubj (34; 0% instances), acl:relcl (23; 0% instances), parataxis (7; 0% instances), compound:redup (4; 0% instances), obj (3; 0% instances), discourse (2; 0% instances), appos (1; 0% instances), case (1; 0% instances), dislocated (1; 0% instances), iobj (1; 0% instances), vocative (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (6873; 52% instances), (4432; 34% instances), NOUN (1243; 9% instances), ADJ (266; 2% instances), AUX (154; 1% instances), DET (41; 0% instances), PRON (37; 0% instances), ADV (15; 0% instances), NUM (15; 0% instances), PROPN (15; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances)

2865 (22%) VERB nodes are leaves.

1191 (9%) VERB nodes have one child.

1199 (9%) VERB nodes have two children.

7838 (60%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 33 different relations: obl (6444; 15% instances), punct (5136; 12% instances), nsubj (5084; 12% instances), obj (4188; 10% instances), compound (4050; 10% instances), advcl (3518; 8% instances), mark (3093; 7% instances), aux (2721; 7% instances), advmod (2479; 6% instances), xcomp (1392; 3% instances), conj (831; 2% instances), cc (803; 2% instances), iobj (505; 1% instances), dep (376; 1% instances), ccomp (222; 1% instances), det (118; 0% instances), nmod (110; 0% instances), advmod:emph (93; 0% instances), discourse (81; 0% instances), nsubj:pass (76; 0% instances), vocative (52; 0% instances), amod (49; 0% instances), dislocated (45; 0% instances), case (31; 0% instances), parataxis (31; 0% instances), acl (26; 0% instances), cop (23; 0% instances), nmod:poss (5; 0% instances), nummod (5; 0% instances), compound:redup (4; 0% instances), acl:relcl (2; 0% instances), csubj (1; 0% instances), flat (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (14492; 35% instances), VERB (6873; 17% instances), PUNCT (5136; 12% instances), AUX (2835; 7% instances), ADV (2453; 6% instances), SCONJ (1846; 4% instances), DET (1689; 4% instances), ADP (1375; 3% instances), PRON (1256; 3% instances), ADJ (1050; 3% instances), PROPN (1046; 3% instances), CCONJ (802; 2% instances), PART (600; 1% instances), NUM (83; 0% instances), INTJ (59; 0% instances)