Treebank Statistics: UD_Sinhala-Appuwa: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
61 tokens (9%) have a non-empty value of Gender.
33 types (8%) occur at least once with a non-empty value of Gender.
25 lemmas (7%) occur at least once with a non-empty value of Gender.
The feature is used with 3 part-of-speech tags: NOUN (33; 5% instances), PROPN (27; 4% instances), PRON (1; 0% instances).
NOUN
33 NOUN tokens (13% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (32; 97%), Definite=EMPTY (31; 94%), Animacy=EMPTY (28; 85%).
NOUN tokens may have the following values of Gender:
Fem(13; 39% of non-emptyGender): දුව, නැන්දා, අගබිසොව, අගමෙහෙසිය, එතනා, දූලා, බිරිඳගේ, බිසව, බිසොවක්Masc(20; 61% of non-emptyGender): රජතුමා, කුමාරයා, කුමාරයාට, ඇතා, කුමාරයාගේ, කුමාරයාත්, කුමාරයායි, කුමාරයාව, කුමාරයෙක්, කුමාරාEMPTY(226): රජතුමා, රජ, රෙදි, ඇමැතිවරු, පදිංචියට, මාළිගාවේ, යුද්ධේ, රටේ, ඇතා, ඉදිරියේ
Gender seems to be lexical feature of NOUN. 100% lemmas (17) occur only with one value of Gender.
PROPN
27 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=EMPTY (25; 93%), Animacy=EMPTY (20; 74%), Number=Sing (20; 74%).
PROPN tokens may have the following values of Gender:
Fem(12; 44% of non-emptyGender): එතනා, සිරිමල්, එතනාවMasc(15; 56% of non-emptyGender): වත්හිමි, අප්පුවා, අග්බෝ, බුවනෙකබා, ගලබැද්දේරාළ, බණ්ඩාරEMPTY(33): සිරිමල්, අප්පුවා, අප්පුවාට, ඇතුගල, කුරුණෑගල, බුවනෙකබා, වීරගල, අග්බෝ, ඇතුගල්පුරේ, එතනාට
PRON
1 PRON tokens (4% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Case=Dat (1; 100%), Number=EMPTY (1; 100%), Person=EMPTY (1; 100%), PronType=Prs (1; 100%).
PRON tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): ඇයටEMPTY(26): ඒ, මේ, එක, මං, සිය, අර, එකයි, ඒකට, ඔය, ඔයා
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[compound]–> PROPN (5; 71%),
PROPN –[nsubj]–> NOUN (2; 100%),
PROPN –[conj]–> PROPN (1; 100%).