Treebank Statistics: UD_Sinhala-Appuwa: POS Tags: PROPN
There are 19 PROPN lemmas (5%), 22 PROPN types (5%) and 60 PROPN tokens (9%).
Out of 14 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.
The 10 most frequent PROPN lemmas: අප්පුවා, එතනා, සිරිමල්, වත්හිමි, අග්බෝ, බුවනෙකබා, ඇතුගල, කුරුණෑගල, වීරගල, ඇතුගල්පුර
The 10 most frequent PROPN types: අප්පුවා, සිරිමල්, එතනා, වත්හිමි, අග්බෝ, බුවනෙකබා, අප්පුවාට, ඇතුගල, කුරුණෑගල, වීරගල
The 10 most frequent ambiguous lemmas: එතනා (PROPN 10, NOUN 1), ඒ (PRON 5, PROPN 1)
The 10 most frequent ambiguous types: එතනා (PROPN 8, NOUN 1), ඒ (PRON 5, PROPN 1)
- එතනා
- ඒ
Morphology
The form / lemma ratio of PROPN is 1.157895 (the average of all parts of speech is 1.100000).
The 1st highest number of forms (3) was observed with the lemma “එතනා”: එතනා, එතනාට, එතනාව.
The 2nd highest number of forms (2) was observed with the lemma “අප්පුවා”: අප්පුවා, අප්පුවාට.
The 3rd highest number of forms (1) was observed with the lemma “අග්බෝ”: අග්බෝ.
PROPN occurs with 4 features: Number (47; 78% instances), Gender (27; 45% instances), Case (19; 32% instances), Animacy (15; 25% instances)
PROPN occurs with 10 feature-value pairs: Animacy=Anim, Animacy=Hum, Case=Abl, Case=Acc, Case=Dat, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Sing
PROPN occurs with 22 feature combinations.
The most frequent feature combination is Number=Sing (11 tokens).
Examples: සිරිමල්, එතනා, අප්පුවා, අග්බෝ, වීරගල, අප්පුවාට, බුවනෙකබා, යාපහුව, රන්බමරකෙත, හැංගවත්ත
Relations
PROPN nodes are attached to their parents using 12 different relations: nsubj (16; 27% instances), compound (14; 23% instances), obl (7; 12% instances), nmod (6; 10% instances), obj (5; 8% instances), flat:name (4; 7% instances), iobj (3; 5% instances), acl (1; 2% instances), appos (1; 2% instances), conj (1; 2% instances), discourse (1; 2% instances), root (1; 2% instances)
Parents of PROPN nodes belong to 6 different parts of speech: VERB (26; 43% instances), NOUN (19; 32% instances), PROPN (10; 17% instances), ADJ (3; 5% instances), (1; 2% instances), SCONJ (1; 2% instances)
41 (68%) PROPN nodes are leaves.
14 (23%) PROPN nodes have one child.
3 (5%) PROPN nodes have two children.
2 (3%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 5.
Children of PROPN nodes are attached using 13 different relations: flat:name (6; 21% instances), case (3; 11% instances), nmod (3; 11% instances), acl (2; 7% instances), amod (2; 7% instances), appos (2; 7% instances), compound (2; 7% instances), nsubj (2; 7% instances), punct (2; 7% instances), conj (1; 4% instances), det (1; 4% instances), nmod:poss (1; 4% instances), obj (1; 4% instances)
Children of PROPN nodes belong to 8 different parts of speech: PROPN (10; 36% instances), NOUN (9; 32% instances), ADJ (2; 7% instances), PUNCT (2; 7% instances), VERB (2; 7% instances), ADP (1; 4% instances), PRON (1; 4% instances), SCONJ (1; 4% instances)