Treebank Statistics: UD_Erzya-JR: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
6401 tokens (31%) have a non-empty value of Definite.
3467 types (52%) occur at least once with a non-empty value of Definite.
1751 lemmas (57%) occur at least once with a non-empty value of Definite.
The feature is used with 12 part-of-speech tags: NOUN (4278; 21% instances), PROPN (692; 3% instances), PRON (465; 2% instances), ADJ (395; 2% instances), VERB (211; 1% instances), DET (137; 1% instances), NUM (123; 1% instances), ADV (73; 0% instances), ADP (13; 0% instances), PART (11; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances).
NOUN
4278 NOUN tokens (84% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number[psor]=EMPTY (4270; 100%), Person[psor]=EMPTY (4270; 100%).
NOUN tokens may have the following values of Definite:
Def(1571; 37% of non-emptyDefinite): бандитэсь, партизантнэ, кенкшенть, бандитнэ, бандитнэнь, чись, ломанесь, моданть, тайганть, цёрасьInd(2707; 63% of non-emptyDefinite): лангс, ёнов, лангсо, кудов, тев, ялгат, ломань, течи, ёндо, лангаEMPTY(842): кедензэ, прянзо, авазо, сельмензэ, аванзо, чамазо, кедьсэнзэ, лангозонзо, пильгензэ, седеезэ
| Paradigm ланго | Ind | Def |
|---|---|---|
| Case=Ela|NounType=Relat|Number=Plur,Sing | лангсто | |
| Case=Ela|Number=Plur,Sing | лангсто | |
| Case=Gen|NounType=Relat|Number=Sing | лангонть | |
| Case=Ill|Clitic=Add|Number=Plur,Sing | лангскак | |
| Case=Ill|NounType=Relat|Number=Plur,Sing | лангс, ланкс | |
| Case=Ill|NounType=Relat|Number=Plur,Sing|Typo=Yes | ланкс | |
| Case=Ine|NounType=Relat|Number=Plur,Sing | лангсо, ланксо | |
| Case=Ine|NounType=Relat|Number=Plur,Sing|Number[psor]=Sing|Person[psor]=3 | Лангсонзо | |
| Case=Ine|NounType=Relat|Number=Plur,Sing|Typo=Yes | ланксо | |
| Case=Ine|Number=Plur,Sing | лангсо, ланксо | |
| Case=Ine|Number=Plur,Sing|Number[psor]=Sing|Person[psor]=3 | лангсонзо | |
| Case=Lat|Number=Plur,Sing | лангов | |
| Case=Nom|NounType=Relat|Number=Sing | лангось | |
| Case=Prl|NounType=Relat|Number=Plur,Sing | ланга | |
| Case=Prl|Number=Plur,Sing | ланга | |
| Case=Prl|Number=Plur,Sing|Number[psor]=Sing|Person[psor]=2 | лангат | |
| Case=Prl|Number=Plur,Sing|Number[psor]=Sing|Person[psor]=3 | ланганзо |
PROPN
692 PROPN tokens (98% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Gender=EMPTY (576; 83%), Case=Nom (487; 70%), Number=Sing (486; 70%), Animacy=Hum (467; 67%), NameType=Giv (362; 52%).
PROPN tokens may have the following values of Definite:
Def(10; 1% of non-emptyDefinite): Ведеськак, Игаузусь, Инечись, Кучаевтнень, Нишкенть, Паргелесь, Равось, Ховра, Христозонть, ЦярданесьInd(682; 99% of non-emptyDefinite): Микол, Ястребов, Любань, Палько, Люба, Федоров, Маря, Кирё, Кечай, МиколоньEMPTY(16): Степан, Ардан, Баёвасо, Бездна, Кедяровонь, Надюм, Нефёдкань, Обранынкань, Олодимирэнь, Охон
| Paradigm Цярдань | Ind | Def |
|---|---|---|
| Case=Lat|Clitic=Add|Number=Plur,Sing | Цярданевгак | |
| Case=Lat|Number=Plur,Sing | Цярданев | |
| Case=Nom|Number=Sing | Цярдань | Цярданесь |
Definite seems to be lexical feature of PROPN. 97% lemmas (177) occur only with one value of Definite.
PRON
465 PRON tokens (39% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Person=EMPTY (465; 100%), Variant=EMPTY (465; 100%), Case=Nom (328; 71%), Number=Sing (306; 66%).
PRON tokens may have the following values of Definite:
Def(86; 18% of non-emptyDefinite): вейкесь, мейсь, эрьвась, сь, конась, ламотне, нетне, нть, весементь, конасонтьInd(379; 82% of non-emptyDefinite): весе, те, мезе, кона, конань, неть, конат, истямо, тень, мезеньEMPTY(734): сон, мон, сонзэ, тон, сонсь, минь, сынь, минек, тензэ, сынст
| Paradigm мезе | Ind | Def |
|---|---|---|
| Case=Abl|Number=Plur,Sing|PronType=Ind | мезде | |
| Case=Abl|Number=Plur,Sing|PronType=Int | мезде | |
| Case=Abl|Number=Plur,Sing|PronType=Rel | мезде | |
| Case=Gen|ExtPos=PRON|Number=Plur,Sing|PronType=Int | мезень | |
| Case=Gen|Number=Plur,Sing|PronType=Ind | мезень | |
| Case=Gen|Number=Plur,Sing|PronType=Int | мезень, мень | |
| Case=Ill|Number=Plur,Sing|PronType=Int | мезес | |
| Case=Ine|Number=Plur,Sing|Number[subj]=Sing|Person[subj]=3|PronType=Ind|Tense=Past | мейсэль | |
| Case=Ine|Number=Plur,Sing|PronType=Int | мейсэ | |
| Case=Nom|ExtPos=PRON|Number=Sing|PronType=Int | мезе | |
| Case=Nom|ExtPos=PRON|Number=Plur|PronType=Int | месть | |
| Case=Nom|Number=Sing|Number[subj]=Sing|Person[subj]=3|PronType=Int|Tense=Past | мезель | |
| Case=Nom|Number=Sing|PronType=Ind | мезе | |
| Case=Nom|Number=Sing|PronType=Int | мезе | |
| Case=Nom|Number=Sing|PronType=Rel | мезе | мезесь |
| Case=Nom|Number=Plur|PronType=Int | мезть | |
| Case=Tra|Number=Plur,Sing|PronType=Int | мезекс |
ADJ
395 ADJ tokens (44% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Number[subj]=EMPTY (374; 95%), Person[subj]=EMPTY (374; 95%), Tense=EMPTY (374; 95%), Case=Nom (348; 88%), Number=Sing (316; 80%).
ADJ tokens may have the following values of Definite:
Def(39; 10% of non-emptyDefinite): омбоцесь, васенценть, вишкинетне, колмоцесь, омбоценть, Остаткась, Превеесь, Псись, беднойтне, берятненьInd(356; 90% of non-emptyDefinite): кодамо, од, омбоце, мазый, лембе, паро, покш, виев, кельме, кодатEMPTY(506): од, арась, паро, покш, якстере, пиже, сэрей, тусто, кедровой, берянь
| Paradigm покш | Ind | Def |
|---|---|---|
| Case=Abl|Clitic=Add|Number=Plur,Sing | Покштояк | |
| Case=Dat|Number=Plur | покштненень | |
| Case=Nom|Number=Sing | покш | покшось |
| Case=Nom|Number=Sing|Number[subj]=Sing|Person[subj]=3|Tense=Past | покшоль | |
| Case=Nom|Number=Plur | покшт |
VERB
211 VERB tokens (6% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Mood=EMPTY (211; 100%), Number[obj]=EMPTY (211; 100%), Person[obj]=EMPTY (211; 100%), Number[subj]=EMPTY (208; 99%), Person[subj]=EMPTY (208; 99%), Case=Nom (147; 70%), Tense=EMPTY (122; 58%).
VERB tokens may have the following values of Definite:
Def(43; 20% of non-emptyDefinite): молицятнень, сыцятнень, Ацирьгадоманть, Ванстыцясь, Неезденть, ванстомкась, ванстомкатне, ванстыцятне, вастнематне, видематненьInd(168; 80% of non-emptyDefinite): сэредиця, вечкевикс, касыця, молиця, Ярсамодо, аштиця, валгиця, вечкема, кадовозь, солавтозьEMPTY(3518): мерсь, лиссь, ютась, мольсь, ашти, неяви, совась, маряви, саизе, сась
| Paradigm молемс | Ind | Def |
|---|---|---|
| Case=Gen|Nomzr=Ag|Number=Plur|Tense=Pres|VerbForm=Part | молицятнень | |
| Case=Gen|Nomzr=Ag|Number=Plur|VerbForm=Part | молицятнень | |
| Case=Gen|Number=Sing|VerbForm=Vnoun | молеманть | |
| Case=Nom|Nomzr=Ag|Number=Sing|Tense=Pres|VerbForm=Part | молиця | |
| Case=Tra|Nomzr=Ag|Number=Plur,Sing|Tense=Pres|VerbForm=Part | молицякс |
Definite seems to be lexical feature of VERB. 92% lemmas (140) occur only with one value of Definite.
DET
137 DET tokens (60% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=Nom (137; 100%), Reflex=EMPTY (137; 100%), Number=Sing (128; 93%).
DET tokens may have the following values of Definite:
Def(2; 1% of non-emptyDefinite): Истятнэ, конаськакInd(135; 99% of non-emptyDefinite): эрьва, те, истямо, лия, ламо, кона, зяро, истят, весе, сеEMPTY(91): те, эсь, ве, се, не, теке, Зяро, Кона, Нона, конань
| Paradigm истямо | Ind | Def |
|---|---|---|
| Number=Sing | истямо | |
| Number=Plur | истят | Истятнэ |
NUM
123 NUM tokens (72% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: Case=Nom (108; 88%), Number=Sing (99; 80%), NumType=Card (86; 70%).
NUM tokens may have the following values of Definite:
Def(7; 6% of non-emptyDefinite): Веенстнэнь, Кавтонтень, Кавтотне, Колмонть, Омбонстнэ, колмоценстнэ, нилетнеInd(116; 94% of non-emptyDefinite): вейке, кавто, колмо, ниле, вейкеть, кавто-колмо, колоньгеменьшка, Комсь, вейкеде, вейксэEMPTY(47): ве, кавто, колмо, веенст, Кавонстонь-кавонстонь, Кеветеешка, Колмонь-колмонь, вейкекс, ветешка, кавто-колмо
| Paradigm кавто | Ind | Def |
|---|---|---|
| Case=Cmp|Number=Plur,Sing|NumType=Card | Кавтошка | |
| Case=Dat|Number=Sing|NumType=Card | Кавтонтень | |
| Case=Nom|Number=Sing | кавто | |
| Case=Nom|Number=Sing|NumType=Card | кавто | |
| Case=Nom|Number=Plur|NumType=Card | Кавтотне |
ADV
73 ADV tokens (4% of all ADV tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADV and Definite co-occurred: PronType=EMPTY (66; 90%), AdvType=EMPTY (50; 68%).
ADV tokens may have the following values of Definite:
Ind(73; 100% of non-emptyDefinite): колияк, ламо, зярдо-бути, зярдояк, истямо, мекев, аламо, аламодо, вельть, кодамоEMPTY(1600): ансяк, кода, пек, истя, мейле, ней, уш, седе, прок, яла
Definite seems to be lexical feature of ADV. 100% lemmas (40) occur only with one value of Definite.
ADP
13 ADP tokens (3% of all ADP tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADP and Definite co-occurred: AdpType=Post (13; 100%), Number[psor]=EMPTY (13; 100%), Person[psor]=EMPTY (13; 100%), AdvType=Loc (8; 62%).
ADP tokens may have the following values of Definite:
Ind(13; 100% of non-emptyDefinite): перька, ваксс, вакссо, томбалев, ало, вакска, удаловEMPTY(463): марто, мельга, кис, эйстэ, мельганзо, пачк, эйсэ, мартонзо, туртов, ваксс
PART
11 PART tokens (8% of all PART tokens) have a non-empty value of Definite.
PART tokens may have the following values of Definite:
Ind(11; 100% of non-emptyDefinite): ялатеке, допрок, тыц, Каня, Эрь, ведьEMPTY(125): жо, бути, ли, прок, вана, буто, эно, Арази, Ведь, Бульчом
INTJ
2 INTJ tokens (2% of all INTJ tokens) have a non-empty value of Definite.
INTJ tokens may have the following values of Definite:
Ind(2; 100% of non-emptyDefinite): Бах, ОйEMPTY(117): вана, ох, виде, ну, вай, Арась, Эх, ура, Я, Да
SCONJ
1 SCONJ tokens (2% of all SCONJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which SCONJ and Definite co-occurred: AdvType=EMPTY (1; 100%).
SCONJ tokens may have the following values of Definite:
Ind(1; 100% of non-emptyDefinite): текеEMPTY(65): зярдо, бути, теке, кода, куш, што, штобу, прок, хоть, Коли
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[conj]–> NOUN (153; 88%),
NOUN –[nummod]–> NUM (77; 63%),
PROPN –[nmod]–> PROPN (23; 96%),
ADJ –[conj]–> ADJ (20; 91%),
PROPN –[flat:name]–> PROPN (16; 89%),
PROPN –[conj]–> PROPN (13; 100%),
NOUN –[obl]–> NOUN (11; 55%),
NOUN –[amod]–> VERB (9; 53%),
NOUN –[amod]–> DET (8; 89%),
NOUN –[fixed]–> NOUN (8; 100%).