home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: VERB

There are 398 VERB lemmas (3%), 1004 VERB types (8%) and 12695 VERB tokens (9%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: کر، ہو، دے، کہہ، ہے، لے، بتا، آ، رکھ، رہ

The 10 most frequent VERB types: کیا، کہا، کر، کرنے، کی، ہو، دیا، کرتے، ہے، دی

The 10 most frequent ambiguous lemmas: کر (VERB 4173, AUX 295, ADP 8, PRON 4, PROPN 2, DET 1, NOUN 1, PART 1), ہو (VERB 1548, AUX 693, NOUN 2, ADJ 1, ADP 1), دے (VERB 1161, PROPN 4, NOUN 2), کہہ (VERB 943, NOUN 1), ہے (AUX 3549, VERB 447, PROPN 4, ADP 1, PRON 1, PUNCT 1), لے (VERB 368, ADP 3, PROPN 2, PART 1), آ (VERB 307, PROPN 23, PRON 2, ADJ 1, CCONJ 1, NOUN 1), رکھ (VERB 283, PROPN 1), رہ (AUX 652, VERB 279, PROPN 1), بنا (VERB 211, ADP 4, PART 2, NOUN 1)

The 10 most frequent ambiguous types: کیا (VERB 986, PRON 21, DET 12, AUX 3, PART 1, PROPN 1), کر (VERB 778, AUX 230, PART 1), کرنے (VERB 641, AUX 7), کی (ADP 3558, VERB 634, PROPN 5, AUX 2, NOUN 1), ہو (VERB 602, AUX 15), کرتے (VERB 436, AUX 13), ہے (AUX 2436, VERB 312, PROPN 2, PUNCT 1), دی (VERB 262, PROPN 3), کیے (VERB 141, AUX 1), کریں (VERB 137, PROPN 1)

Morphology

The form / lemma ratio of VERB is 2.522613 (the average of all parts of speech is 1.103404).

The 1st highest number of forms (32) was observed with the lemma “کر”: کئے, کر, کرا, کرتا, کرتی, کرتیں, کرتے, کرنا, کرنی, کرنے, کرو, کروا, کروں, کروںگا, کرکے, کرینگے, کریگا, کریگی, کریں, کریں_گی, کریں_گے, کریںگے, کرے, کرےنگے, کرےگا, کرےگی, کہا, کی, کیا, کیں, کیے, گے.

The 2nd highest number of forms (26) was observed with the lemma “دے”: دئیے, دئے, دلاکر, دی, دیئے, دیا, دیتا, دیتی, دیتے, دیجئے, دیدی, دینا, دینی, دینے, دیکر, دیں, دیں_گی, دیں_گے, دیںگی, دیںگے, دیے, دے, دےکر, دےگا, دےگی, کر.

The 3rd highest number of forms (21) was observed with the lemma “ہو”: ہو, ہوئ, ہوئہی, ہوئی, ہوئیں, ہوئے, ہوا, ہوتا, ہوتی, ہوتیں, ہوتے, ہونا, ہونگے, ہونی, ہونے, ہوکر, ہوگا, ہوگی, ہوں, ہوں_گی, ہوں_گے.

VERB occurs with 11 features: VerbForm (10132; 80% instances), Voice (8711; 69% instances), Number (8539; 67% instances), Gender (7550; 59% instances), Aspect (7067; 56% instances), Person (1670; 13% instances), Case (1611; 13% instances), Mood (1051; 8% instances), Tense (779; 6% instances), Polite (107; 1% instances), Echo (2; 0% instances)

VERB occurs with 26 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Nom, Echo=Rdp, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Polite=Infm, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 326 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|VerbForm=Part|Voice=Act (2350 tokens). Examples: کہا، کیا، بتایا، دیا، ہوا، آیا، رہا، بنایا، لیا، رکھا

Relations

VERB nodes are attached to their parents using 17 different relations: root (4521; 36% instances), advcl (2190; 17% instances), conj (1285; 10% instances), obj (1185; 9% instances), acl:relcl (698; 5% instances), compound (680; 5% instances), nmod (582; 5% instances), acl (451; 4% instances), amod (397; 3% instances), ccomp (280; 2% instances), xcomp (204; 2% instances), nsubj (139; 1% instances), dep (57; 0% instances), obl (14; 0% instances), iobj (9; 0% instances), mark (2; 0% instances), case (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (5730; 45% instances), (4521; 36% instances), NOUN (1753; 14% instances), PRON (377; 3% instances), PROPN (148; 1% instances), ADJ (138; 1% instances), ADV (13; 0% instances), DET (4; 0% instances), NUM (4; 0% instances), PART (3; 0% instances), ADP (2; 0% instances), AUX (2; 0% instances)

762 (6%) VERB nodes are leaves.

300 (2%) VERB nodes have one child.

1422 (11%) VERB nodes have two children.

10211 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 17.

Children of VERB nodes are attached using 26 different relations: obl (9543; 17% instances), compound (8293; 15% instances), aux (7879; 14% instances), nsubj (6269; 11% instances), obj (6084; 11% instances), punct (5208; 9% instances), mark (3958; 7% instances), advcl (2105; 4% instances), cc (1410; 3% instances), advmod (1305; 2% instances), conj (1295; 2% instances), iobj (885; 2% instances), xcomp (647; 1% instances), acl (404; 1% instances), ccomp (271; 0% instances), case (166; 0% instances), dep (127; 0% instances), amod (14; 0% instances), vocative (14; 0% instances), nmod (11; 0% instances), discourse (8; 0% instances), cop (6; 0% instances), acl:relcl (5; 0% instances), dislocated (3; 0% instances), det (1; 0% instances), nummod (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (19065; 34% instances), AUX (7931; 14% instances), VERB (5730; 10% instances), PUNCT (5208; 9% instances), PROPN (3830; 7% instances), ADJ (3476; 6% instances), PRON (3389; 6% instances), SCONJ (2300; 4% instances), ADP (1731; 3% instances), CCONJ (1389; 2% instances), ADV (943; 2% instances), PART (808; 1% instances), NUM (79; 0% instances), DET (22; 0% instances), INTJ (8; 0% instances), X (3; 0% instances)