Treebank Statistics: UD_Sinhala-STB: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
191 tokens (22%) have a non-empty value of Definite.
166 types (33%) occur at least once with a non-empty value of Definite.
142 lemmas (34%) occur at least once with a non-empty value of Definite.
The feature is used with 5 part-of-speech tags: NOUN (175; 20% instances), PROPN (12; 1% instances), VERB (2; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances).
NOUN
175 NOUN tokens (57% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (158; 90%), Animacy=EMPTY (138; 79%), Gender=Neut (119; 68%).
NOUN tokens may have the following values of Definite:
Def(126; 72% of non-emptyDefinite): මහතා, කිරීම, ආණ්ඩුව, ජනතාව, තත්ත්වය, අයවැය, අවස්ථාව, ආර්ථිකය, උද්ධමනය, ක්රමයInd(49; 28% of non-emptyDefinite): ප්රධානයකු, හැඟීමක්, අදහසක්, කතාවක්, කයිවාරුවක්, කලකට, කලක්, කලාපයෙකි, කාරණයෙකි, කාලයකින්EMPTY(133): ආර්ථික, දේශපාලන, යුද, සිදු, හමුදා, අද, අහෝසි, කොටි, ජනතාවට, බොහෝ
| Paradigm තත්ත්ව | Ind | Def |
|---|---|---|
| Case=Acc | තත්ත්වයක් | |
| Case=Loc | තත්ත්වයක | |
| Case=Nom | තත්ත්වය |
Definite seems to be lexical feature of NOUN. 94% lemmas (126) occur only with one value of Definite.
PROPN
12 PROPN tokens (32% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (12; 100%), Person=EMPTY (12; 100%), Animacy=EMPTY (11; 92%), Foreign=EMPTY (11; 92%), Gender=Neut (11; 92%).
PROPN tokens may have the following values of Definite:
Def(11; 92% of non-emptyDefinite): ලංකාව, අමෙරිකාවේ, ඉන්දියාව, ඉරානය, චීනය, ටැන්සානියාව, පලස්තීනය, පාකිස්ථානය, ලංකාවට, සිංගප්පූරුවInd(1; 8% of non-emptyDefinite): ලංකාවක්EMPTY(26): ශ්රී, මහින්ද, රනිල්, රාජපක්ෂ, වික්රමසිංහ, ෆොන්සේකා, කොසෝවෝ, ජුලියස්, නියරේරේ, මාඕ
| Paradigm ලංකා | Ind | Def |
|---|---|---|
| Case=Dat | ලංකාවට | |
| Case=Nom | ලංකාවක් | ලංකාව |
VERB
2 VERB tokens (2% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Aspect=EMPTY (2; 100%), Mood=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Ger (2; 100%), Voice=EMPTY (2; 100%).
VERB tokens may have the following values of Definite:
Def(2; 100% of non-emptyDefinite): කිරීමට, යැවීමේEMPTY(105): කර, තිබේ, ඇත්තේ, කළ, කළේ, දී, පාවා, වන්නේ, විය, වී
ADV
1 ADV tokens (3% of all ADV tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADV and Definite co-occurred: AdvType=Tim (1; 100%).
ADV tokens may have the following values of Definite:
Def(1; 100% of non-emptyDefinite): අවසානයේEMPTY(35): ඉතා, එහි, දැන්, අද, පෙර, මුළුමනින්, අඛණ්ඩව, එදා, එලෙස, එසේ
PRON
1 PRON tokens (2% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Animacy=EMPTY (1; 100%), Case=Nom (1; 100%), Gender=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=Ind (1; 100%).
PRON tokens may have the following values of Definite:
Ind(1; 100% of non-emptyDefinite): කිහිපයක්EMPTY(43): ඔහු, ඒ, එය, එහි, ඊට, ඔහුට, සිය, අප, අපට, අපේ
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[compound:prt]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod:poss]–> PROPN (1; 100%),
PROPN –[conj]–> PROPN (1; 100%),
VERB –[compound:lvc]–> NOUN (1; 100%).