home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: VERB

There are 399 VERB lemmas (3%), 987 VERB types (8%) and 11862 VERB tokens (9%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: کر، ہو، کہہ، دے، ہے، بتا، آ، رہ، رکھ، بنا

The 10 most frequent VERB types: کیا، کہا، کر، کرنے، کی، ہو، کرتے، ہے، بتایا، دیا

The 10 most frequent ambiguous lemmas: کر (VERB 4173, AUX 291, ADP 8, PRON 4, PROPN 2, DET 1, NOUN 1, PART 1), ہو (VERB 1548, AUX 693, NOUN 2, ADJ 1, ADP 1), کہہ (VERB 943, NOUN 1), دے (VERB 747, AUX 403, PROPN 4, NOUN 2), ہے (AUX 3511, VERB 446, PROPN 4, ADP 1, PRON 1, PUNCT 1), آ (VERB 292, PROPN 23, AUX 14, PRON 2, ADJ 1, CCONJ 1, NOUN 1), رہ (AUX 638, VERB 280, PROPN 1), رکھ (VERB 272, AUX 11, PROPN 1), بنا (VERB 211, ADP 4, AUX 4, PART 2, NOUN 1), لے (VERB 190, AUX 170, ADP 3, PROPN 2, PART 1)

The 10 most frequent ambiguous types: کیا (VERB 986, PRON 21, DET 12, AUX 3, PART 1, PROPN 1), کر (VERB 778, AUX 230, PART 1), کرنے (VERB 641, AUX 7), کی (ADP 3557, VERB 634, PROPN 5, AUX 3, NOUN 1), ہو (VERB 602, AUX 15), کرتے (VERB 436, AUX 13), ہے (AUX 2435, VERB 313, PROPN 2, PUNCT 1), دیا (AUX 249, VERB 230), دی (VERB 194, AUX 68, PROPN 3), کیے (VERB 141, AUX 1)

Morphology

The form / lemma ratio of VERB is 2.473684 (the average of all parts of speech is 1.101903).

The 1st highest number of forms (32) was observed with the lemma “کر”: کئے, کر, کرا, کرتا, کرتی, کرتیں, کرتے, کرنا, کرنی, کرنے, کرو, کروا, کروں, کروںگا, کرکے, کرینگے, کریگا, کریگی, کریں, کریں_گی, کریں_گے, کریںگے, کرے, کرےنگے, کرےگا, کرےگی, کہا, کی, کیا, کیں, کیے, گے.

The 2nd highest number of forms (25) was observed with the lemma “دے”: دئیے, دئے, دلاکر, دی, دیئے, دیا, دیتا, دیتی, دیتے, دیجئے, دیدی, دینا, دینی, دینے, دیکر, دیں, دیں_گی, دیں_گے, دیںگی, دیںگے, دیے, دے, دےکر, دےگی, کر.

The 3rd highest number of forms (21) was observed with the lemma “ہو”: ہو, ہوئ, ہوئہی, ہوئی, ہوئیں, ہوئے, ہوا, ہوتا, ہوتی, ہوتیں, ہوتے, ہونا, ہونگے, ہونی, ہونے, ہوکر, ہوگا, ہوگی, ہوں, ہوں_گی, ہوں_گے.

VERB occurs with 11 features: VerbForm (9328; 79% instances), Voice (8711; 73% instances), Number (7794; 66% instances), Gender (6845; 58% instances), Aspect (6330; 53% instances), Case (1578; 13% instances), Person (1544; 13% instances), Mood (1015; 9% instances), Tense (770; 6% instances), Polite (97; 1% instances), Echo (2; 0% instances)

VERB occurs with 26 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Nom, Echo=Rdp, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Polite=Infm, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 304 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|VerbForm=Part|Voice=Act (2350 tokens). Examples: کہا، کیا، بتایا، دیا، ہوا، آیا، رہا، بنایا، لیا، رکھا

Relations

VERB nodes are attached to their parents using 18 different relations: root (4521; 38% instances), advcl (1842; 16% instances), obj (1466; 12% instances), conj (1284; 11% instances), acl:relcl (698; 6% instances), nmod (583; 5% instances), acl (449; 4% instances), amod (397; 3% instances), advmod (349; 3% instances), nsubj (140; 1% instances), dep (57; 0% instances), compound (35; 0% instances), obl (14; 0% instances), iobj (9; 0% instances), case (6; 0% instances), xcomp (6; 0% instances), mark (4; 0% instances), cc (2; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (4898; 41% instances), (4521; 38% instances), NOUN (1752; 15% instances), PRON (377; 3% instances), PROPN (148; 1% instances), ADJ (138; 1% instances), ADV (13; 0% instances), DET (4; 0% instances), NUM (4; 0% instances), PART (3; 0% instances), ADP (2; 0% instances), AUX (2; 0% instances)

86 (1%) VERB nodes are leaves.

177 (1%) VERB nodes have one child.

1392 (12%) VERB nodes have two children.

10207 (86%) VERB nodes have three or more children.

The highest child degree of a VERB node is 17.

Children of VERB nodes are attached using 23 different relations: obl (9522; 17% instances), aux (8746; 16% instances), compound (7278; 13% instances), obj (6369; 11% instances), nsubj (6285; 11% instances), punct (5154; 9% instances), mark (4223; 8% instances), advcl (1796; 3% instances), advmod (1654; 3% instances), cc (1411; 3% instances), conj (1303; 2% instances), iobj (884; 2% instances), xcomp (449; 1% instances), acl (409; 1% instances), case (247; 0% instances), dep (128; 0% instances), vocative (14; 0% instances), acl:relcl (7; 0% instances), cop (6; 0% instances), nmod (5; 0% instances), amod (4; 0% instances), dislocated (4; 0% instances), nummod (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (19059; 34% instances), AUX (8799; 16% instances), PUNCT (5209; 9% instances), VERB (4898; 9% instances), PROPN (3832; 7% instances), PRON (3387; 6% instances), ADJ (3114; 6% instances), SCONJ (2295; 4% instances), ADP (2055; 4% instances), CCONJ (1389; 2% instances), ADV (942; 2% instances), PART (808; 1% instances), NUM (79; 0% instances), DET (22; 0% instances), X (9; 0% instances), INTJ (2; 0% instances)