home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: VERB

There are 102 VERB lemmas (27%), 102 VERB types (24%) and 148 VERB tokens (22%). Out of 14 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: ගන්න, වෙන්න, කර, ආ, කරනවා, දැක, පැන, බඳිනවා, යන්න, වෙන

The 10 most frequent VERB types: උනා, ගත්තා, ගියා, ආපු, කලා, හිටියා, කරන්න, කියන, ගන්න, පැනලා

The 10 most frequent ambiguous lemmas: කිය (SCONJ 3, VERB 2), අර (PRON 1, VERB 1), පිටත් (ADV 1, VERB 1), වැඩ (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: පිටත් (ADV 1, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.100000).

The 1st highest number of forms (4) was observed with the lemma “කර”: කරන්න, කලා, කලේ, කෙරුනා.

The 2nd highest number of forms (3) was observed with the lemma “ගන්න”: ගත්තා, ගත්තේ, ගන්න.

The 3rd highest number of forms (3) was observed with the lemma “දැක”: දැකපු, දැකලා, දැක්කා.

VERB occurs with 7 features: VerbForm (148; 100% instances), Tense (89; 60% instances), Mood (73; 49% instances), Aspect (31; 21% instances), Voice (10; 7% instances), Number (7; 5% instances), Person (7; 5% instances)

VERB occurs with 15 feature-value pairs: Aspect=Perf, Aspect=Prog, Mood=Ind, Mood=Pot, Number=Sing, Person=3, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 25 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Past|VerbForm=Fin (48 tokens). Examples: උනා, ගත්තා, ගියා, හිටියා, කලා, කලේ, දැක්කා, ඇතිඋනා, උනයි, උනා.

Relations

VERB nodes are attached to their parents using 8 different relations: root (63; 43% instances), advcl (34; 23% instances), acl (21; 14% instances), compound:lvc (14; 9% instances), xcomp (11; 7% instances), compound:svc (2; 1% instances), parataxis (2; 1% instances), ccomp (1; 1% instances)

Parents of VERB nodes belong to 8 different parts of speech: (63; 43% instances), VERB (59; 40% instances), NOUN (17; 11% instances), PRON (3; 2% instances), ADJ (2; 1% instances), PROPN (2; 1% instances), ADV (1; 1% instances), AUX (1; 1% instances)

26 (18%) VERB nodes are leaves.

32 (22%) VERB nodes have one child.

19 (13%) VERB nodes have two children.

71 (48%) VERB nodes have three or more children.

The highest child degree of a VERB node is 8.

Children of VERB nodes are attached using 20 different relations: nsubj (67; 18% instances), obl (67; 18% instances), punct (62; 17% instances), obj (55; 15% instances), advcl (36; 10% instances), compound:lvc (20; 5% instances), advmod (16; 4% instances), xcomp (15; 4% instances), iobj (12; 3% instances), mark (5; 1% instances), obl:tmod (4; 1% instances), compound (2; 1% instances), compound:svc (2; 1% instances), discourse (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), nmod (1; 0% instances), nmod:poss (1; 0% instances), parataxis (1; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (178; 48% instances), PUNCT (62; 17% instances), VERB (59; 16% instances), PROPN (26; 7% instances), ADV (15; 4% instances), PRON (12; 3% instances), ADJ (6; 2% instances), SCONJ (5; 1% instances), PART (3; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), NUM (1; 0% instances)