home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: VERB

There are 398 VERB lemmas (3%), 1004 VERB types (8%) and 12695 VERB tokens (9%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: کرنا، ہونا، دےنا، کہہنا، ہے، لےنا، بتانا، آنا، رکھنا، رہنا

The 10 most frequent VERB types: کیا، کہا، کر، کرنے، کی، ہو، دیا، کرتے، ہے، دی

The 10 most frequent ambiguous lemmas: کرنا (VERB 4173, AUX 295), ہونا (VERB 1548, AUX 693), ہے (AUX 3549, VERB 447, PROPN 4, ADP 1, PRON 1, PUNCT 1), رہنا (AUX 652, VERB 279), تھا (AUX 990, VERB 89), جانا (AUX 2195, VERB 81, PROPN 2), پانا (VERB 57, AUX 15), پڑنا (AUX 61, VERB 32), کھانا (VERB 13, NOUN 8), چاہئے (AUX 107, VERB 10)

The 10 most frequent ambiguous types: کیا (VERB 986, PRON 21, DET 12, AUX 3, PART 1, PROPN 1), کر (VERB 778, AUX 230, PART 1), کرنے (VERB 641, AUX 7), کی (ADP 3558, VERB 634, PROPN 5, AUX 2, NOUN 1), ہو (VERB 602, AUX 15), کرتے (VERB 436, AUX 13), ہے (AUX 2436, VERB 312, PROPN 2, PUNCT 1), دی (VERB 262, PROPN 3), کیے (VERB 141, AUX 1), کریں (VERB 137, PROPN 1)

Morphology

The form / lemma ratio of VERB is 2.522613 (the average of all parts of speech is 1.103404).

The 1st highest number of forms (32) was observed with the lemma “کرنا”: کئے, کر, کرا, کرتا, کرتی, کرتیں, کرتے, کرنا, کرنی, کرنے, کرو, کروا, کروں, کروںگا, کرکے, کرینگے, کریگا, کریگی, کریں, کریں_گی, کریں_گے, کریںگے, کرے, کرےنگے, کرےگا, کرےگی, کہا, کی, کیا, کیں, کیے, گے.

The 2nd highest number of forms (26) was observed with the lemma “دےنا”: دئیے, دئے, دلاکر, دی, دیئے, دیا, دیتا, دیتی, دیتے, دیجئے, دیدی, دینا, دینی, دینے, دیکر, دیں, دیں_گی, دیں_گے, دیںگی, دیںگے, دیے, دے, دےکر, دےگا, دےگی, کر.

The 3rd highest number of forms (21) was observed with the lemma “ہونا”: ہو, ہوئ, ہوئہی, ہوئی, ہوئیں, ہوئے, ہوا, ہوتا, ہوتی, ہوتیں, ہوتے, ہونا, ہونگے, ہونی, ہونے, ہوکر, ہوگا, ہوگی, ہوں, ہوں_گی, ہوں_گے.

VERB occurs with 11 features: VerbForm (10132; 80% instances), Voice (8711; 69% instances), Number (8539; 67% instances), Gender (7550; 59% instances), Aspect (7067; 56% instances), Person (1670; 13% instances), Case (1611; 13% instances), Mood (1051; 8% instances), Tense (779; 6% instances), Polite (107; 1% instances), Echo (2; 0% instances)

VERB occurs with 26 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Nom, Echo=Rdp, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Polite=Infm, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 326 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|VerbForm=Part|Voice=Act (2350 tokens). Examples: کہا، کیا، بتایا، دیا، ہوا، آیا، رہا، بنایا، لیا، رکھا

Relations

VERB nodes are attached to their parents using 17 different relations: root (4521; 36% instances), advcl (2190; 17% instances), conj (1285; 10% instances), obj (1185; 9% instances), acl:relcl (698; 5% instances), compound (680; 5% instances), nmod (582; 5% instances), acl (451; 4% instances), amod (397; 3% instances), ccomp (280; 2% instances), xcomp (204; 2% instances), nsubj (139; 1% instances), dep (57; 0% instances), obl (14; 0% instances), iobj (9; 0% instances), mark (2; 0% instances), case (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (5730; 45% instances), (4521; 36% instances), NOUN (1753; 14% instances), PRON (377; 3% instances), PROPN (148; 1% instances), ADJ (138; 1% instances), ADV (13; 0% instances), DET (4; 0% instances), NUM (4; 0% instances), PART (3; 0% instances), ADP (2; 0% instances), AUX (2; 0% instances)

762 (6%) VERB nodes are leaves.

300 (2%) VERB nodes have one child.

1422 (11%) VERB nodes have two children.

10211 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 17.

Children of VERB nodes are attached using 26 different relations: obl (9543; 17% instances), compound (8293; 15% instances), aux (7879; 14% instances), nsubj (6269; 11% instances), obj (6084; 11% instances), punct (5208; 9% instances), mark (3958; 7% instances), advcl (2105; 4% instances), cc (1410; 3% instances), advmod (1305; 2% instances), conj (1295; 2% instances), iobj (885; 2% instances), xcomp (647; 1% instances), acl (404; 1% instances), ccomp (271; 0% instances), case (166; 0% instances), dep (127; 0% instances), amod (14; 0% instances), vocative (14; 0% instances), nmod (11; 0% instances), discourse (8; 0% instances), cop (6; 0% instances), acl:relcl (5; 0% instances), dislocated (3; 0% instances), det (1; 0% instances), nummod (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (19065; 34% instances), AUX (7931; 14% instances), VERB (5730; 10% instances), PUNCT (5208; 9% instances), PROPN (3830; 7% instances), ADJ (3476; 6% instances), PRON (3389; 6% instances), SCONJ (2300; 4% instances), ADP (1731; 3% instances), CCONJ (1389; 2% instances), ADV (943; 2% instances), PART (808; 1% instances), NUM (79; 0% instances), DET (22; 0% instances), INTJ (8; 0% instances), X (3; 0% instances)