Treebank Statistics: UD_Sinhala-STB: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
191 tokens (22%) have a non-empty value of Definite
.
166 types (33%) occur at least once with a non-empty value of Definite
.
142 lemmas (34%) occur at least once with a non-empty value of Definite
.
The feature is used with 5 part-of-speech tags: NOUN (175; 20% instances), PROPN (12; 1% instances), VERB (2; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances).
NOUN
175 NOUN tokens (57% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (158; 90%), Animacy=EMPTY (138; 79%), Gender=Neut (119; 68%).
NOUN
tokens may have the following values of Definite
:
Def
(126; 72% of non-emptyDefinite
): මහතා, කිරීම, ආණ්ඩුව, ජනතාව, තත්ත්වය, අයවැය, අවස්ථාව, ආර්ථිකය, උද්ධමනය, ක්රමයInd
(49; 28% of non-emptyDefinite
): ප්රධානයකු, හැඟීමක්, අදහසක්, කතාවක්, කයිවාරුවක්, කලකට, කලක්, කලාපයෙකි, කාරණයෙකි, කාලයකින්EMPTY
(133): ආර්ථික, දේශපාලන, යුද, සිදු, හමුදා, අද, අහෝසි, කොටි, ජනතාවට, බොහෝ
Paradigm තත්ත්ව | Ind | Def |
---|---|---|
Case=Acc | තත්ත්වයක් | |
Case=Loc | තත්ත්වයක | |
Case=Nom | තත්ත්වය |
Definite
seems to be lexical feature of NOUN
. 94% lemmas (126) occur only with one value of Definite
.
PROPN
12 PROPN tokens (32% of all PROPN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PROPN
and Definite
co-occurred: Number=Sing (12; 100%), Person=EMPTY (12; 100%), Animacy=EMPTY (11; 92%), Foreign=EMPTY (11; 92%), Gender=Neut (11; 92%).
PROPN
tokens may have the following values of Definite
:
Def
(11; 92% of non-emptyDefinite
): ලංකාව, අමෙරිකාවේ, ඉන්දියාව, ඉරානය, චීනය, ටැන්සානියාව, පලස්තීනය, පාකිස්ථානය, ලංකාවට, සිංගප්පූරුවInd
(1; 8% of non-emptyDefinite
): ලංකාවක්EMPTY
(26): ශ්රී, මහින්ද, රනිල්, රාජපක්ෂ, වික්රමසිංහ, ෆොන්සේකා, කොසෝවෝ, ජුලියස්, නියරේරේ, මාඕ
Paradigm ලංකා | Ind | Def |
---|---|---|
Case=Dat | ලංකාවට | |
Case=Nom | ලංකාවක් | ලංකාව |
VERB
2 VERB tokens (2% of all VERB
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which VERB
and Definite
co-occurred: Aspect=EMPTY (2; 100%), Mood=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Ger (2; 100%), Voice=EMPTY (2; 100%).
VERB
tokens may have the following values of Definite
:
Def
(2; 100% of non-emptyDefinite
): කිරීමට, යැවීමේEMPTY
(105): කර, තිබේ, ඇත්තේ, කළ, කළේ, දී, පාවා, වන්නේ, විය, වී
ADV
1 ADV tokens (3% of all ADV
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADV
and Definite
co-occurred: AdvType=Tim (1; 100%).
ADV
tokens may have the following values of Definite
:
Def
(1; 100% of non-emptyDefinite
): අවසානයේEMPTY
(35): ඉතා, එහි, දැන්, අද, පෙර, මුළුමනින්, අඛණ්ඩව, එදා, එලෙස, එසේ
PRON
1 PRON tokens (2% of all PRON
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PRON
and Definite
co-occurred: Animacy=EMPTY (1; 100%), Case=Nom (1; 100%), Gender=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=Ind (1; 100%).
PRON
tokens may have the following values of Definite
:
Ind
(1; 100% of non-emptyDefinite
): කිහිපයක්EMPTY
(43): ඔහු, ඒ, එය, එහි, ඊට, ඔහුට, සිය, අප, අපට, අපේ
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[compound:prt]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod:poss]–> PROPN (1; 100%),
PROPN –[conj]–> PROPN (1; 100%),
VERB –[compound:lvc]–> NOUN (1; 100%).