home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: PROPN

There are 19 PROPN lemmas (5%), 22 PROPN types (5%) and 60 PROPN tokens (9%). Out of 14 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent PROPN lemmas: අප්පුවා, එතනා, සිරිමල්, වත්හිමි, අග්බෝ, බුවනෙකබා, ඇතුගල, කුරුණෑගල, වීරගල, ඇතුගල්පුර

The 10 most frequent PROPN types: අප්පුවා, සිරිමල්, එතනා, වත්හිමි, අග්බෝ, බුවනෙකබා, අප්පුවාට, ඇතුගල, කුරුණෑගල, වීරගල

The 10 most frequent ambiguous lemmas: එතනා (PROPN 10, NOUN 1), (PRON 5, PROPN 1)

The 10 most frequent ambiguous types: එතනා (PROPN 8, NOUN 1), (PRON 5, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.157895 (the average of all parts of speech is 1.100000).

The 1st highest number of forms (3) was observed with the lemma “එතනා”: එතනා, එතනාට, එතනාව.

The 2nd highest number of forms (2) was observed with the lemma “අප්පුවා”: අප්පුවා, අප්පුවාට.

The 3rd highest number of forms (1) was observed with the lemma “අග්බෝ”: අග්බෝ.

PROPN occurs with 4 features: Number (47; 78% instances), Gender (27; 45% instances), Case (19; 32% instances), Animacy (15; 25% instances)

PROPN occurs with 10 feature-value pairs: Animacy=Anim, Animacy=Hum, Case=Abl, Case=Acc, Case=Dat, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Sing

PROPN occurs with 22 feature combinations. The most frequent feature combination is Number=Sing (11 tokens). Examples: සිරිමල්, එතනා, අප්පුවා, අග්බෝ, වීරගල, අප්පුවාට, බුවනෙකබා, යාපහුව, රන්බමරකෙත, හැංගවත්ත

Relations

PROPN nodes are attached to their parents using 12 different relations: nsubj (16; 27% instances), compound (14; 23% instances), obl (7; 12% instances), nmod (6; 10% instances), obj (5; 8% instances), flat:name (4; 7% instances), iobj (3; 5% instances), acl (1; 2% instances), appos (1; 2% instances), conj (1; 2% instances), discourse (1; 2% instances), root (1; 2% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (26; 43% instances), NOUN (19; 32% instances), PROPN (10; 17% instances), ADJ (3; 5% instances), (1; 2% instances), SCONJ (1; 2% instances)

41 (68%) PROPN nodes are leaves.

14 (23%) PROPN nodes have one child.

3 (5%) PROPN nodes have two children.

2 (3%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 13 different relations: flat:name (6; 21% instances), case (3; 11% instances), nmod (3; 11% instances), acl (2; 7% instances), amod (2; 7% instances), appos (2; 7% instances), compound (2; 7% instances), nsubj (2; 7% instances), punct (2; 7% instances), conj (1; 4% instances), det (1; 4% instances), nmod:poss (1; 4% instances), obj (1; 4% instances)

Children of PROPN nodes belong to 8 different parts of speech: PROPN (10; 36% instances), NOUN (9; 32% instances), ADJ (2; 7% instances), PUNCT (2; 7% instances), VERB (2; 7% instances), ADP (1; 4% instances), PRON (1; 4% instances), SCONJ (1; 4% instances)