home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-STB: POS Tags: PRON

There are 15 PRON lemmas (3%), 21 PRON types (4%) and 44 PRON tokens (5%). Out of 13 observed tags, the rank of PRON is: 7 in number of lemmas, 7 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: ඔහු, එය, ඒ, අපි, සිය, අප, එකිනෙක, එහි, ඔවුන්, ඔවුහු

The 10 most frequent PRON types: ඔහු, ඒ, එය, එහි, ඊට, ඔහුට, සිය, අප, අපට, අපේ

The 10 most frequent ambiguous lemmas: (PRON 9, DET 3, ADV 1), එහි (ADV 2, PRON 1), කිහිප (NOUN 3, PRON 1), මේ (DET 5, ADV 1, PRON 1)

The 10 most frequent ambiguous types: (PRON 7, DET 2), එහි (ADV 3, PRON 3), එම (DET 5, PRON 1), කිහිපයක් (NOUN 1, PRON 1), මේ (DET 5, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.400000 (the average of all parts of speech is 1.145336).

The 1st highest number of forms (4) was observed with the lemma “එය”: ඉන්, ඊට, එය, එහි.

The 2nd highest number of forms (3) was observed with the lemma “ඒ”: එම, එය, ඒ.

The 3rd highest number of forms (3) was observed with the lemma “ඔහු”: ඔව්හු, ඔහු, ඔහුට.

PRON occurs with 9 features: PronType (42; 95% instances), Case (35; 80% instances), Number (29; 66% instances), Gender (21; 48% instances), Animacy (7; 16% instances), Person (5; 11% instances), Poss (5; 11% instances), Definite (1; 2% instances), Typo (1; 2% instances)

PRON occurs with 20 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Abl, Case=Acc, Case=Dat, Case=Loc, Case=Nom, Definite=Ind, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Prs, PronType=Rcp, Typo=Yes

PRON occurs with 24 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|PronType=Prs (6 tokens). Examples: ඔහු

Relations

PRON nodes are attached to their parents using 9 different relations: nsubj (23; 52% instances), obj (7; 16% instances), nmod:poss (4; 9% instances), nmod (3; 7% instances), dep (2; 5% instances), det (2; 5% instances), det:poss (1; 2% instances), obl (1; 2% instances), obl:lmod (1; 2% instances)

Parents of PRON nodes belong to 5 different parts of speech: NOUN (25; 57% instances), VERB (16; 36% instances), ADJ (1; 2% instances), ADV (1; 2% instances), AUX (1; 2% instances)

36 (82%) PRON nodes are leaves.

8 (18%) PRON nodes have one child.

The highest child degree of a PRON node is 1.

Children of PRON nodes are attached using 2 different relations: case (7; 88% instances), nmod (1; 13% instances)

Children of PRON nodes belong to 3 different parts of speech: PART (6; 75% instances), ADP (1; 13% instances), NOUN (1; 13% instances)