home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-Vedic: POS Tags: VERB

There are 2973 VERB lemmas (21%), 13044 VERB types (35%) and 39836 VERB tokens (19%). Out of 13 observed tags, the rank of VERB is: 3 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: kṛ, bhū, vid, hu, as, yaj, ah, vac, dhā, i

The 10 most frequent VERB types: bhavati, veda, asi, uvāca, āha, juhoti, āhuḥ, juhuyāt, karoti, kṛtvā

The 10 most frequent ambiguous lemmas: kṛ (VERB 1415, ADV 1), bhū (VERB 1328, AUX 610, NOUN 130), vid (VERB 997, ADJ 86), as (AUX 1191, VERB 789), yaj (VERB 703, ADJ 2), dhā (VERB 571, ADJ 23), i (VERB 476, AUX 15), gam (VERB 414, NOUN 1), dā (VERB 313, ADJ 32), sthā (VERB 277, AUX 3, ADJ 2)

The 10 most frequent ambiguous types: bhavati (VERB 507, AUX 348), veda (VERB 386, NOUN 14), asi (VERB 369, AUX 271, NOUN 2), vidvān (VERB 148, ADJ 2), yajamānaḥ (VERB 146, NOUN 2), abhavat (VERB 135, AUX 3), kuryāt (VERB 114, ADV 1), eti (VERB 113, AUX 1), _ (NOUN 2255, ADJ 407, CCONJ 331, SCONJ 216, NUM 186, PRON 124, VERB 106, INTJ 86, ADP 73, ADV 54, DET 14), ādāya (VERB 98, ADJ 2)

Morphology

The form / lemma ratio of VERB is 4.387487 (the average of all parts of speech is 2.674382).

The 1st highest number of forms (188) was observed with the lemma “kṛ”: _, acakriran, akaram, akarat, akaraḥ, akarma, akarot, akarta, akaḥ, akirat, akran, akrata, akrataṁ, akuruta, akurvan, akurvata, akāri, akāriṣam, akārṣma, akṛta, akṛṇot, akṛṇotana, akṛṇoḥ, akṛṇuta, akṛṇutam, akṛṇvan, akṛṇvata, cakartha, cakra, cakrathuḥ, cakre, cakrire, cakrivasaḥ, cakriyāḥ, cakruḥ, cakruṣe, cakrāte, cakrāṇaḥ, cakāra, cakṛma, cakṛmā, cakṛṣe, karase, karasi, karat, karatam, karataḥ, karati, karavāma, karavāmahai, karavāṇi, karaḥ, kariṣyan, kariṣyantaḥ, kariṣyanti, kariṣyantīm, kariṣyasi, kariṣyataḥ, kariṣyatha, kariṣyathaḥ, kariṣyati, kariṣye, kariṣyāmaḥ, kariṣyāmi, karma, karomi, karoti, karotu, karta, kartavyam, kartavyau, kartavyaḥ, kartavye, kartavyā, kartavyāḥ, kartum, kartvam, kartvena, kartvāni, kartā, karāma, karāmahe, kaḥ, kira, kran, krantaḥ, kriyamāṇam, kriyamāṇe, kriyamāṇā, kriyamāṇām, kriyamāṇāya, kriyante, kriyate, kriyatām, kriyeran, kriyete, kriyāma, krān, kurmahe, kurmaḥ, kuru, kurudhvam, kuruta, kurutaḥ, kurute, kurutāt, kuruṣva, kurvan, kurvantaḥ, kurvanti, kurvataḥ, kurvate, kurvatām, kurve, kurvāṇaḥ, kurvāṇāḥ, kurvīta, kurvīya, kuryuḥ, kuryām, kuryāma, kuryāt, kāryam, kāryau, kāryaḥ, kāryā, kāryāḥ, kāryāṇi, kṛdhi, kṛdhvam, kṛdhī, kṛta, kṛtam, kṛtasya, kṛtaḥ, kṛte, kṛtebhyaḥ, kṛtena, kṛthaḥ, kṛthāḥ, kṛtvā, kṛtvāya, kṛtvī, kṛtyam, kṛtā, kṛtābhiḥ, kṛtām, kṛtāni, kṛtānām, kṛtāsu, kṛtāḥ, kṛṇavan, kṛṇavase, kṛṇavat, kṛṇavate, kṛṇavaḥ, kṛṇavāma, kṛṇavāmā, kṛṇmahe, kṛṇmasi, kṛṇmaḥ, kṛṇomi, kṛṇota, kṛṇoti, kṛṇotu, kṛṇoṣi, kṛṇu, kṛṇudhvam, kṛṇuhi, kṛṇuta, kṛṇutam, kṛṇute, kṛṇuthaḥ, kṛṇuṣva, kṛṇvan, kṛṇvantam, kṛṇvantaḥ, kṛṇvanti, kṛṇvantu, kṛṇvataḥ, kṛṇvatī, kṛṇve, kṛṇvānaḥ, kṛṇvānā, kṛṇvānām, kṛṇvānāḥ, kṛṣe, kṛṣva.

The 2nd highest number of forms (113) was observed with the lemma “vid”: avedam, avediṣuḥ, avediṣyam, avediṣyan, avedīt, avet, avidam, avidan, avidat, avidaḥ, avide, aviduḥ, avidāma, avindan, avindat, avindata, avindaḥ, avitsi, veda, vedat, vedate, vedati, veditavyam, veditavyaḥ, veditavye, veditavyāḥ, vediṣyante, vediṣyanti, vedyam, vedyaḥ, vedyāya, vedāma, vedāni, vetsyan, vetsyatha, vettavai, vettha, vetti, vettu, vida, vidam, vidan, vidanta, vidat, vidatam, vidatha, viddhi, vide, videya, videḥ, viditam, viditaḥ, viditvā, viditāt, vidma, vidmasi, vidre, viduḥ, viduṣaḥ, viduṣe, viduṣā, viduṣām, viduṣī, vidvān, vidvāṁ, vidvāṃsam, vidvāṃsaḥ, vidyamāne, vidyante, vidyate, vidyeran, vidyeta, vidyuḥ, vidyām, vidyāma, vidyāt, vidyāta, vidyātām, vidāma, vidāmakran, vidānaḥ, vidāne, vidāyyaḥ, vidāḥ, vidāṃcakāra, vinda, vindan, vindante, vindanti, vindase, vindasva, vindata, vindate, vindati, vindatu, vinderan, vindet, vindeta, vindeya, vindeyam, vindeyuḥ, vindāmi, vindāvahai, vitse, vittaḥ, vittha, vittvā, vittāt, viveda, vivide, vividuḥ, vividvān, vivitse.

The 3rd highest number of forms (97) was observed with the lemma “gam”: _, agacchan, agacchanta, agacchat, agacchatam, agacchaḥ, agamam, agan, aganma, aganmahi, agasmahi, agata, agathāḥ, agman, agmata, ajagan, ajagmiran, ajīgamat, gaccha, gacchadhvam, gacchan, gacchantam, gacchanti, gacchantu, gacchasva, gacchata, gacchatam, gacchataḥ, gacchate, gacchatha, gacchathaḥ, gacchati, gacchatu, gacchatyaḥ, gacchatām, gacchatāt, gacchatī, gacchet, gaccheyam, gaccheyuḥ, gacchān, gacchāsi, gacchāti, gahi, gamadhyai, gamadhye, gamanti, gamantu, gamanīyām, gamat, gamaḥ, gamema, gamemahi, gamet, gameyam, gamiṣyasi, gamiṣyati, gamiṣyāmi, gamiṣyāvaḥ, gamyāt, gamyāḥ, gamāma, gamātha, gan, ganma, ganta, gantam, gantana, gantavyam, gantoḥ, gantu, gantum, gantā, gata, gatam, gatasya, gataḥ, gate, gatena, gatvā, gatvī, gatām, gatān, gatāni, gatāḥ, gman, gmantā, gmiṣīya, gmīya, jagamyāt, jaganvān, jaganvāṃsaḥ, jagmathuḥ, jagmatuḥ, jagmire, jagmuḥ, jagāma.

VERB occurs with 9 features: Number (35938; 90% instances), Tense (35863; 90% instances), Mood (27437; 69% instances), Person (27437; 69% instances), VerbForm (12399; 31% instances), Case (8501; 21% instances), Gender (8501; 21% instances), Voice (1032; 3% instances), Compound (439; 1% instances)

VERB occurs with 33 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Compound=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Jus, Mood=Opt, Mood=Pot, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pqp, Tense=Pres, VerbForm=Conv, VerbForm=Gdv, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 303 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres (7148 tokens). Examples: bhavati, juhoti, karoti, dadhāti, eti, gacchati, avarunddhe, jāyate, prīṇāti, asti

Relations

VERB nodes are attached to their parents using 52 different relations: root (19462; 49% instances), flat (3147; 8% instances), conj (3041; 8% instances), advcl (2616; 7% instances), acl (2372; 6% instances), advcl:tcl (1598; 4% instances), ccomp (1227; 3% instances), acl:relcl (1218; 3% instances), parataxis (994; 2% instances), advcl:cond (522; 1% instances), csubj (381; 1% instances), orphan (350; 1% instances), obj (298; 1% instances), advcl:manner (290; 1% instances), acl:ptcp (269; 1% instances), advcl:ccomp (261; 1% instances), nsubj (228; 1% instances), nmod (211; 1% instances), acl:dpct (192; 0% instances), xcomp (148; 0% instances), iobj (147; 0% instances), advcl:dpct (134; 0% instances), advcl:fin (96; 0% instances), advcl:caus (92; 0% instances), obl (65; 0% instances), acl:attr (42; 0% instances), advcl:concess (40; 0% instances), xcomp:result (39; 0% instances), obl:goal (33; 0% instances), advcl:lcl (31; 0% instances), dislocated (27; 0% instances), obl:lmod (26; 0% instances), compound:coord (22; 0% instances), vocative (21; 0% instances), acl:crel (20; 0% instances), ccomp:rel (20; 0% instances), obl:benef (20; 0% instances), obl:instr (20; 0% instances), nmod:appos (19; 0% instances), obl:source (17; 0% instances), amod (15; 0% instances), obl:manner (14; 0% instances), obl:agent (12; 0% instances), obl:path (10; 0% instances), obl:soc (8; 0% instances), appos (5; 0% instances), advcl:consec (4; 0% instances), obl:grad (4; 0% instances), acl:cont (3; 0% instances), obl:tmod (3; 0% instances), compound (1; 0% instances), discourse (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: (19462; 49% instances), VERB (11093; 28% instances), NOUN (5182; 13% instances), PRON (2397; 6% instances), ADJ (1072; 3% instances), ADV (229; 1% instances), PART (162; 0% instances), NUM (93; 0% instances), ADP (87; 0% instances), CCONJ (33; 0% instances), INTJ (12; 0% instances), SCONJ (10; 0% instances), DET (3; 0% instances), AUX (1; 0% instances)

3405 (9%) VERB nodes are leaves.

10894 (27%) VERB nodes have one child.

10751 (27%) VERB nodes have two children.

14786 (37%) VERB nodes have three or more children.

The highest child degree of a VERB node is 18.

Children of VERB nodes are attached using 60 different relations: obj (14305; 16% instances), advmod (11623; 13% instances), nsubj (11470; 13% instances), obl (5117; 6% instances), mark (4397; 5% instances), conj (4164; 5% instances), discourse (3705; 4% instances), advcl (3435; 4% instances), iobj (2403; 3% instances), obl:instr (2395; 3% instances), ccomp (2273; 3% instances), flat (2256; 3% instances), obl:goal (2082; 2% instances), vocative (1605; 2% instances), advcl:tcl (1572; 2% instances), obl:lmod (1408; 2% instances), advcl:ccomp (1344; 2% instances), xcomp (1279; 1% instances), parataxis (942; 1% instances), cc (891; 1% instances), obl:source (845; 1% instances), obl:tmod (826; 1% instances), advcl:manner (726; 1% instances), xcomp:result (725; 1% instances), obl:manner (698; 1% instances), advcl:fin (495; 1% instances), advcl:cond (473; 1% instances), obl:soc (422; 0% instances), csubj (388; 0% instances), advcl:dpct (343; 0% instances), obl:agent (342; 0% instances), obl:benef (337; 0% instances), aux (290; 0% instances), obl:path (233; 0% instances), orphan (180; 0% instances), advcl:caus (135; 0% instances), cop (89; 0% instances), nmod (83; 0% instances), dislocated (82; 0% instances), mark:sim (75; 0% instances), acl (65; 0% instances), advcl:concess (46; 0% instances), det (46; 0% instances), acl:dpct (40; 0% instances), nummod (32; 0% instances), advcl:lcl (28; 0% instances), case (28; 0% instances), ccomp:rel (26; 0% instances), compound:coord (25; 0% instances), obl:grad (20; 0% instances), acl:relcl (19; 0% instances), case:sim (19; 0% instances), acl:ptcp (13; 0% instances), amod (13; 0% instances), nmod:appos (7; 0% instances), acl:attr (6; 0% instances), appos (6; 0% instances), advcl:consec (4; 0% instances), acl:crel (3; 0% instances), compound (3; 0% instances)

Children of VERB nodes belong to 13 different parts of speech: NOUN (35168; 40% instances), PRON (13009; 15% instances), VERB (11093; 13% instances), ADV (9802; 11% instances), PART (8981; 10% instances), ADJ (5226; 6% instances), SCONJ (1377; 2% instances), CCONJ (884; 1% instances), NUM (561; 1% instances), AUX (379; 0% instances), INTJ (231; 0% instances), ADP (189; 0% instances), DET (2; 0% instances)