home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Nepali-BK: POS Tags: VERB

There are 58 VERB lemmas (21%), 113 VERB types (31%) and 187 VERB tokens (23%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent VERB lemmas: गर्नु, भन्नु, लाग्नु, हुनु, आउनु, भाग्नु, कराउनु, घस्नु, आर्जनु, जानु

The 10 most frequent VERB types: भनेर, गर्दा, भागेछ, आर्जन, आएर, कराउँदै, गर्नुपर्छ, घस्दे, पारेर, भन्ने

The 10 most frequent ambiguous lemmas: हुनु (VERB 14, AUX 8), हो (VERB 4, PART 1), (AUX 2, VERB 1)

The 10 most frequent ambiguous types: (AUX 5, VERB 3), रहेछ (VERB 3, AUX 1), हो (AUX 2, VERB 2, PART 1)

Morphology

The form / lemma ratio of VERB is 1.948276 (the average of all parts of speech is 1.329630).

The 1st highest number of forms (11) was observed with the lemma “गर्नु”: गरिएको, गरेका, गरेको, गर्छ, गर्छन्, गर्थ्यो, गर्दा, गर्दै, गर्न, गर्नुपर्छ, गर्ने.

The 2nd highest number of forms (11) was observed with the lemma “हुनु”: छ, थियो, नहुने, भइन्छ, भएको, रहेको, रहेछ, हुदैन, हुन्, हुन्छ, होला.

The 3rd highest number of forms (9) was observed with the lemma “लाग्नु”: ला, लाको, लागी, लागे, लागेको, लागेन, लागेर, लाग्दैन, लाग्यो.

VERB occurs with 9 features: VerbForm (185; 99% instances), Mood (177; 95% instances), Person (177; 95% instances), Tense (177; 95% instances), Aspect (176; 94% instances), Number (175; 94% instances), Evident (9; 5% instances), Polarity (5; 3% instances), Voice (5; 3% instances)

VERB occurs with 20 feature-value pairs: Aspect=Imp, Aspect=Perf, Aspect=Prog, Evident=Nfh, Mood=Imp, Mood=Ind, Mood=Nec, Number=Plur, Number=Sing, Person=2, Person=3, Polarity=Neg, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 36 feature combinations. The most frequent feature combination is Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Conv (34 tokens). Examples: भनेर, आएर, पारेर, फर्केर, लागेर, लिएर, खर्चेर, गएर, घस्दिपछि, जिकेर

Relations

VERB nodes are attached to their parents using 11 different relations: root (67; 36% instances), advcl (55; 29% instances), acl (19; 10% instances), ccomp (14; 7% instances), xcomp (12; 6% instances), parataxis (10; 5% instances), conj (4; 2% instances), compound (2; 1% instances), compound:redup (2; 1% instances), acl:relcl (1; 1% instances), reparandum (1; 1% instances)

Parents of VERB nodes belong to 4 different parts of speech: VERB (102; 55% instances), (67; 36% instances), NOUN (17; 9% instances), PROPN (1; 1% instances)

45 (24%) VERB nodes are leaves.

35 (19%) VERB nodes have one child.

21 (11%) VERB nodes have two children.

86 (46%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 20 different relations: punct (120; 23% instances), discourse (63; 12% instances), advmod (57; 11% instances), advcl (55; 10% instances), obj (43; 8% instances), nsubj (40; 8% instances), cc (37; 7% instances), obl (35; 7% instances), xcomp (19; 4% instances), ccomp (18; 3% instances), iobj (12; 2% instances), parataxis (11; 2% instances), aux (6; 1% instances), acl (5; 1% instances), conj (4; 1% instances), compound (2; 0% instances), compound:redup (2; 0% instances), dislocated (2; 0% instances), det (1; 0% instances), reparandum (1; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: PUNCT (120; 23% instances), NOUN (111; 21% instances), VERB (102; 19% instances), ADV (61; 11% instances), PART (60; 11% instances), CCONJ (36; 7% instances), PRON (21; 4% instances), AUX (6; 1% instances), PROPN (5; 1% instances), ADJ (3; 1% instances), INTJ (3; 1% instances), DET (2; 0% instances), X (2; 0% instances), ADP (1; 0% instances)