Treebank Statistics: UD_Sinhala-STB: Features: Number
This feature is universal.
It occurs with 3 different values: Plur, Ptan, Sing.
304 tokens (35%) have a non-empty value of Number.
249 types (50%) occur at least once with a non-empty value of Number.
215 lemmas (52%) occur at least once with a non-empty value of Number.
The feature is used with 4 part-of-speech tags: NOUN (242; 28% instances), PRON (29; 3% instances), PROPN (28; 3% instances), VERB (5; 1% instances).
NOUN
242 NOUN tokens (79% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Animacy=EMPTY (182; 75%), Gender=Neut (161; 67%).
NOUN tokens may have the following values of Number:
Plur(59; 24% of non-emptyNumber): කොටි, අංශ, අභියෝග, අයවැය, අස්සන්, ආණ්ඩුව, ආයතන, ආරාධනා, ආරාමවලට, උපදේශPtan(10; 4% of non-emptyNumber): යුද, කලකට, කලක්, ගිනි, දේශපාලන, බදු, විනය, සල්ලි, හමුදාවේSing(173; 71% of non-emptyNumber): මහතා, කිරීම, ජනතාව, තත්ත්වය, අයවැය, අවස්ථාව, ආණ්ඩුව, ආර්ථික, ආර්ථිකය, උද්ධමනයEMPTY(66): ආර්ථික, සිදු, හමුදා, අද, අහෝසි, දේශපාලන, බොහෝ, වෙනස්, අත්අඩංගුවේ, අනාගත
| Paradigm දේශපාලන | Plur | Ptan |
|---|---|---|
| Animacy=Inan | දේශපාලන | |
| Gender=Neut | දේශපාලන |
Number seems to be lexical feature of NOUN. 97% lemmas (183) occur only with one value of Number.
PRON
29 PRON tokens (66% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Poss=EMPTY (29; 100%), Animacy=EMPTY (24; 83%), Person=EMPTY (24; 83%), Case=Nom (18; 62%).
PRON tokens may have the following values of Number:
Plur(4; 14% of non-emptyNumber): අපට, ඔව්හු, ඔවුනට, ඔවුන්Sing(25; 86% of non-emptyNumber): ඔහු, එය, එහි, ඒ, ඔහුට, ඉන්, කිහිපයක්, මීට, මෙයEMPTY(15): ඒ, ඊට, සිය, අප, අපේ, එකිනෙකා, එම, තම, මේ
| Paradigm ඔහු | Sing | Plur |
|---|---|---|
| Animacy=Anim|Case=Nom|Typo=Yes | ඔව්හු | |
| Case=Dat|Gender=Masc|Person=3 | ඔහුට | |
| Case=Dat|Gender=Masc | ඔහුට | |
| Case=Nom|Gender=Masc|Person=3 | ඔහු | |
| Case=Nom|Gender=Masc | ඔහු |
PROPN
28 PROPN tokens (74% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Foreign=EMPTY (27; 96%), Animacy=EMPTY (20; 71%), Case=Nom (20; 71%), Definite=EMPTY (16; 57%).
PROPN tokens may have the following values of Number:
Sing(28; 100% of non-emptyNumber): ලංකාව, මහින්ද, රනිල්, රාජපක්ෂ, වික්රමසිංහ, ෆොන්සේකා, අමෙරිකාවේ, ඉන්දියාව, ඉරානය, චීනයEMPTY(10): ශ්රී, කොසෝවෝ, යුනෙස්කෝ, ලිප්ටන්, ෂැවොලින්, සර්බියානු
Number seems to be lexical feature of PROPN. 100% lemmas (19) occur only with one value of Number.
VERB
5 VERB tokens (5% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Aspect=EMPTY (5; 100%), Mood=Ind (3; 60%), Tense=Pres (3; 60%), VerbForm=Fin (3; 60%), Voice=Act (3; 60%).
VERB tokens may have the following values of Number:
Plur(2; 40% of non-emptyNumber): කරමු, ගනිතිSing(3; 60% of non-emptyNumber): කිරීමට, දරයි, යැවීමේEMPTY(102): කර, තිබේ, ඇත්තේ, කළ, කළේ, දී, පාවා, වන්නේ, විය, වී
| Paradigm කර | Sing | Plur |
|---|---|---|
| Definite=Def|VerbForm=Ger | කිරීමට | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | කරමු |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[nsubj]–> PRON (7; 78%),
PROPN –[flat]–> NOUN (7; 88%),
PROPN –[flat]–> PROPN (6; 67%),
NOUN –[acl]–> NOUN (1; 100%),
NOUN –[compound:prt]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod:poss]–> PROPN (1; 100%),
PROPN –[conj]–> PROPN (1; 100%),
VERB –[compound:lvc]–> NOUN (1; 100%).