Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: VERB
There are 102 VERB lemmas (27%), 102 VERB types (24%) and 148 VERB tokens (22%).
Out of 14 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent VERB lemmas: ගන්න, වෙන්න, කර, ආ, කරනවා, දැක, පැන, බඳිනවා, යන්න, වෙන
The 10 most frequent VERB types: උනා, ගත්තා, ගියා, ආපු, කලා, හිටියා, කරන්න, කියන, ගන්න, පැනලා
The 10 most frequent ambiguous lemmas: කිය (SCONJ 3, VERB 2), අර (PRON 1, VERB 1), පිටත් (ADV 1, VERB 1), වැඩ (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: පිටත් (ADV 1, VERB 1)
- පිටත්
Morphology
The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.100000).
The 1st highest number of forms (4) was observed with the lemma “කර”: කරන්න, කලා, කලේ, කෙරුනා.
The 2nd highest number of forms (3) was observed with the lemma “ගන්න”: ගත්තා, ගත්තේ, ගන්න.
The 3rd highest number of forms (3) was observed with the lemma “දැක”: දැකපු, දැකලා, දැක්කා.
VERB occurs with 7 features: VerbForm (148; 100% instances), Tense (89; 60% instances), Mood (73; 49% instances), Aspect (31; 21% instances), Voice (10; 7% instances), Number (7; 5% instances), Person (7; 5% instances)
VERB occurs with 15 feature-value pairs: Aspect=Perf, Aspect=Prog, Mood=Ind, Mood=Pot, Number=Sing, Person=3, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass
VERB occurs with 25 feature combinations.
The most frequent feature combination is Mood=Ind|Tense=Past|VerbForm=Fin (48 tokens).
Examples: උනා, ගත්තා, ගියා, හිටියා, කලා, කලේ, දැක්කා, ඇතිඋනා, උනයි, උනා.
Relations
VERB nodes are attached to their parents using 8 different relations: root (63; 43% instances), advcl (34; 23% instances), acl (21; 14% instances), compound:lvc (14; 9% instances), xcomp (11; 7% instances), compound:svc (2; 1% instances), parataxis (2; 1% instances), ccomp (1; 1% instances)
Parents of VERB nodes belong to 8 different parts of speech: (63; 43% instances), VERB (59; 40% instances), NOUN (17; 11% instances), PRON (3; 2% instances), ADJ (2; 1% instances), PROPN (2; 1% instances), ADV (1; 1% instances), AUX (1; 1% instances)
26 (18%) VERB nodes are leaves.
32 (22%) VERB nodes have one child.
19 (13%) VERB nodes have two children.
71 (48%) VERB nodes have three or more children.
The highest child degree of a VERB node is 8.
Children of VERB nodes are attached using 20 different relations: nsubj (67; 18% instances), obl (67; 18% instances), punct (62; 17% instances), obj (55; 15% instances), advcl (36; 10% instances), compound:lvc (20; 5% instances), advmod (16; 4% instances), xcomp (15; 4% instances), iobj (12; 3% instances), mark (5; 1% instances), obl:tmod (4; 1% instances), compound (2; 1% instances), compound:svc (2; 1% instances), discourse (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), nmod (1; 0% instances), nmod:poss (1; 0% instances), parataxis (1; 0% instances)
Children of VERB nodes belong to 14 different parts of speech: NOUN (178; 48% instances), PUNCT (62; 17% instances), VERB (59; 16% instances), PROPN (26; 7% instances), ADV (15; 4% instances), PRON (12; 3% instances), ADJ (6; 2% instances), SCONJ (5; 1% instances), PART (3; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), NUM (1; 0% instances)