Treebank Statistics: UD_Pomak-Philotis: Features: Case
This feature is universal.
It occurs with 4 different values: Acc
, Gen
, Nom
, Voc
.
28087 tokens (32%) have a non-empty value of Case
.
5879 types (54%) occur at least once with a non-empty value of Case
.
1979 lemmas (50%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (12757; 15% instances), PRON (8230; 9% instances), DET (2933; 3% instances), ADJ (2219; 3% instances), PROPN (1086; 1% instances), VERB (471; 1% instances), NUM (391; 0% instances).
NOUN
12757 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (10031; 79%), Definite=Ind (6883; 54%), Deixis=EMPTY (6883; 54%).
NOUN
tokens may have the following values of Case
:
Acc
(8062; 63% of non-emptyCase
): déne, godíny, kóštono, rábato, vódo, čulǽka, vréme, mǽsto, rábaty, vakýtGen
(396; 3% of non-emptyCase
): bubájku, žanójne, kópeløtune, májci, čulǽkune, synúne, vasiļázune, brátu, momójne, čárüneNom
(4021; 32% of non-emptyCase
): májka, čulǽkon, čulǽk, kópeløno, bubájko, žanána, mómičeno, momána, rábata, astinomíjenaVoc
(278; 2% of non-emptyCase
): sýne, dǽdo, ma, bubá, bábo, pópe, báte, dóšterø, čárü, žónoEMPTY
(102): gün, keré, korf, dumá, kerét, sredénošt, senǽ, i.d., gündé, inkǽr
Paradigm čulǽk | Nom | Acc | Gen | Voc |
---|---|---|---|---|
Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | čulǽkase | |||
Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | čulǽkot | |||
Definite=Def|Deixis=Remt|Number=Sing | čulǽkon, čulékon, čulékan | čulǽkane | čulǽkune, čulékune | |
Definite=Ind|Degree=Dim|Number=Sing | čulǽček | čulǽčka | čulǽčku | |
Definite=Ind|Number=Sing | čulǽk, čulék | čulǽka, čulék | čulǽku | čulǽku |
Definite=Ind|Number=Plur | čulǽkove | |||
Definite=Ind|Number=Count | čulǽka |
PRON
8230 PRON tokens (94% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: PronType=Prs (8223; 100%), Reflex=EMPTY (5822; 71%), Number=Sing (4754; 58%), Gender=EMPTY (4650; 57%), Person=3 (4177; 51%).
PRON
tokens may have the following values of Case
:
Acc
(4309; 52% of non-emptyCase
): só, go, jé, gi, mó, sa, tó, tóga, to, móneGen
(2676; 33% of non-emptyCase
): sí, mú, mí, mu, tí, ji, jí, mi, ti, hiNom
(1245; 15% of non-emptyCase
): toj, tja, ja, ty, to, tíje, nýje, výje, teh, jeEMPTY
(540): kaná, kakná, kanána, síčkono, kaknána, kanáta, síčko, kaknása, síčkoso, Kanása
Paradigm ja | Nom | Acc | Gen |
---|---|---|---|
Animacy=Hum|Gender=Masc|Number=Plur|Person=3|PronType=Prs | tíje | tæh | |
Animacy=Nhum|Gender=Masc|Number=Plur|Person=3|PronType=Prs | to | to | |
Gender=Masc|Number=Sing|Person=3|PronType=Prs | toj | go, tóga, gu, tógu, néga | mú, mo, tómu |
Gender=Masc|Number=Plur|Person=3|PronType=Prs | to | Tæm | |
Gender=Fem|Number=Sing|Person=3 | jé | ||
Gender=Fem|Number=Sing|Person=3|PronType=Prs | tja, te, tje, tæ | jé, týje, jo, ja, néje, týjo | jí, hi, je, tój |
Gender=Fem|Number=Plur|Person=3|PronType=Prs | to | to | |
Gender=Neut|Number=Sing|Person=3|PronType=Prs | to | go, to, gu | mú, mo, tómu, mó |
Gender=Neut|Number=Plur|Person=3|PronType=Prs | to | to | |
Number=Sing|Person=1|PronType=Int | Móne | ||
Number=Sing|Person=1|PronType=Prs | ja, je | mó, móne, ma, me, méne, máne | mí, móne, máne, Méne |
Number=Sing|Person=2|PronType=Prs | ty, ti | tó, tébe, ta, te | tí, tébe, di, t |
Number=Sing|Person=3|PronType=Prs | to | ||
Number=Plur|Person=1|PronType=Prs | nýje, níje | nú, námi, mí, no, ny | mí, nú, ny, no, Námi, ni |
Number=Plur|Person=2|PronType=Prs | výje, ve | vú, vámi, u | vú, u, vi, vo, vámi |
Number=Plur|Person=3|PronType=Prs | teh, te | gi, te, teh, gy | mí, tæm |
DET
2933 DET tokens (86% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: DeixisRef=EMPTY (2558; 87%), Number=Sing (2399; 82%), Deixis=EMPTY (1630; 56%), Gender=Masc (1538; 52%).
DET
tokens may have the following values of Case
:
Acc
(1670; 57% of non-emptyCase
): annó, annók, ennó, inazí, drúgo, žýne, drúgy, žókne, ennók, kakvóGen
(80; 3% of non-emptyCase
): vritsǽm, annómu, drúgumune, kutrómu, žómune, inózimu, žǽmne, druzǽmne, annój, bannómuNom
(1180; 40% of non-emptyCase
): adín, kutrí, žýjen, inazí, vrítsi, anná, žíne, badín, kotrí, annóVoc
(3; 0% of non-emptyCase
): móje, nášoEMPTY
(490): bir, vrit, nǽko, kólko, inélkus, her, kač, sǽko, kólkono, bu
ADJ
2219 ADJ tokens (85% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Number=Sing (1714; 77%), Definite=Ind (1297; 58%), Deixis=EMPTY (1297; 58%).
ADJ
tokens may have the following values of Case
:
Acc
(1145; 52% of non-emptyCase
): cǽla, cǽlo, gulǽma, gulǽmo, Pomácko, húbava, húbavo, kámatno, móske, górnonoGen
(46; 2% of non-emptyCase
): stárumune, mládumune, míčkumune, stároj, stáru, čúdnune, žývumune, Evréjinu, Evréjinune, KarakačéninuneNom
(1012; 46% of non-emptyCase
): stáryjen, mládyjen, starána, míčkyjen, gulǽmyjen, gulǽma, čárckyjen, gulǽm, star, PomáckaVoc
(16; 1% of non-emptyCase
): májčin, bábino, májčino, alláhovo, kámatny, mílu, míčko, stárku, červénoEMPTY
(393): mlógo, razý, málko, ájni, bajá, mlógu, mífko, nétekin, üčünǧǘno, ájnisi
Paradigm star | Nom | Acc | Gen | Voc |
---|---|---|---|---|
Animacy=Hum|Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Masc|Number=Plur | Stárite | |||
Animacy=Hum|Definite=Def|Deixis=Remt|Gender=Masc|Number=Plur | stárine | stárehne, stárene | stáremne | |
Animacy=Hum|Definite=Ind|Gender=Masc|Number=Plur | stári | stáreh | ||
Animacy=Nhum|Definite=Def|Deixis=Remt|Gender=Masc|Number=Plur | stáryne | |||
Definite=Def|Deixis=Prox|DeixisRef=1|Gender=Neut|Number=Sing | Stároso | |||
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Masc|Number=Sing | stáryjet | stárokte | stárumute | |
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Fem|Number=Sing | staráta | stároto | ||
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Fem|Number=Plur | stáryte | |||
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Neut|Number=Sing | stároto | |||
Definite=Def|Deixis=Remt|Gender=Masc|Number=Sing | stáryjen, stárijon, stáryen | stárokne, stárane | stárumune, stáromune | |
Definite=Def|Deixis=Remt|Gender=Fem|Number=Sing | starána | stárono | stárojne | |
Definite=Def|Deixis=Remt|Gender=Fem|Number=Plur | stáryne | |||
Definite=Ind|Degree=Dim|Gender=Masc|Number=Sing | stárku | |||
Definite=Ind|Gender=Masc|Number=Sing | star | stára, stárok | stáru | |
Definite=Ind|Gender=Fem|Number=Sing | stará | stáro, stára | stároj | |
Definite=Ind|Gender=Fem|Number=Plur | stáry | stáry |
PROPN
1086 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (971; 89%), Definite=Ind (906; 83%).
PROPN
tokens may have the following values of Case
:
Acc
(548; 50% of non-emptyCase
): Ksánti, Elláda, Ǧumágün, Komotiní, Panedélnik, Kavála, Srǽdo, Siría, Tórnik, NedéleGen
(32; 3% of non-emptyCase
): Kélčetune, Melihí, Mári, Isénu, Nasradínu, Ajší, Asánu, Asíp, Azraílu, GalínkiNom
(469; 43% of non-emptyCase
): Aminǽ, Isén, Alí, Galínka, Nasradín, Hilmijá, Kélčeno, Ǧemilǽ, Ají, MáraVoc
(37; 3% of non-emptyCase
): Kíme, Jaút, Aminǽ, Pétre, Ahmét, Hamdí, Hasán, Mustufá, BABU, HóǧaEMPTY
(12): HIPNOSEDON, PAME, VULBEGAL, H., Kopsidá, LEXOTANIL, Miaúli, STEDON, XANAX
Paradigm Isén | Nom | Acc | Gen | Voc |
---|---|---|---|---|
Definite=Ind|Number=Sing | Isén | Isénu | Iséne | |
Isén | Isén, Iséna |
VERB
471 VERB tokens (3% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Mood=EMPTY (471; 100%), Person=EMPTY (471; 100%), Tense=EMPTY (471; 100%), VerbForm=Part (470; 100%), Voice=Pass (470; 100%), Aspect=Perf (425; 90%), Number=Sing (345; 73%).
VERB
tokens may have the following values of Case
:
Acc
(211; 45% of non-emptyCase
): skrýto, platéno, umrǽta, atvóreny, nagadéno, spúšanokne, spúšanono, ukrádena, umrǽtokne, astávenaGen
(6; 1% of non-emptyCase
): davédenu, pǽtumune, skrýtumune, umrétune, zbrátem, šaštísanuneNom
(254; 54% of non-emptyCase
): naučéna, naučény, pǽtyjen, zarýtyjen, začúden, pǽti, zaglavény, zatvóren, začúdeni, kápnatiEMPTY
(14539): víka, reklól, íma, trǽbava, móža, reklála, hódi, právi, zøl, atišlól
Paradigm skrýjem | Nom | Acc | Gen |
---|---|---|---|
Animacy=Hum|Definite=Ind|Gender=Masc|Number=Plur | skrýte | ||
Definite=Def|Deixis=Remt|Gender=Masc|Number=Sing | skrýtumune | ||
Definite=Def|Deixis=Remt|Gender=Neut|Number=Sing | skrýtono | ||
Definite=Ind|Gender=Masc|Number=Sing | skryt | ||
Definite=Ind|Gender=Fem|Number=Sing | skrýta | skrýto | |
Definite=Ind|Gender=Neut|Number=Sing | skrýto | skrýto |
NUM
391 NUM tokens (33% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumType=Card (391; 100%), Definite=Ind (278; 71%), Deixis=EMPTY (278; 71%), Gender=Masc (257; 66%), Animacy=EMPTY (241; 62%), Number=Sing (237; 61%).
NUM
tokens may have the following values of Case
:
Acc
(259; 66% of non-emptyCase
): annó, annók, dvamína, ennók, dvamínana, dvomínehne, trimína, ennó, annókne, annónoGen
(8; 2% of non-emptyCase
): dvémne, annój, annómune, dvamínem, dvomínemne, trimínemne, trímneNom
(124; 32% of non-emptyCase
): dvomínana, dvamínana, trimínana, dvomína, adín, anná, trimína, annó, adínyjen, dvomínataEMPTY
(785): tri, dve, dva, dvéne, kyrk, beš, on, tríne, 6, jedí
Paradigm adín | Nom | Acc | Gen |
---|---|---|---|
Definite=Def|Deixis=Prox|DeixisRef=1|Gender=Fem | annóso | ||
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Masc | annógate | ||
Definite=Def|Deixis=Prox|DeixisRef=2|Gender=Neut | annóto, ennóto | ||
Definite=Def|Deixis=Remt|Gender=Masc | adínyjen, edíņon | annókne | |
Definite=Def|Deixis=Remt|Gender=Fem | annána | annóno | |
Definite=Def|Deixis=Remt|Gender=Neut | annóno | annóno | annómune |
Definite=Ind|Degree=Dim|Gender=Masc | anníček | ||
Definite=Ind|Degree=Dim|Gender=Fem | anníčka | ||
Definite=Ind|Gender=Masc | adín, jedín | annók, ennók, jennók, jedín | |
Definite=Ind|Gender=Fem | anná, enná | annó, ennó | annój |
Definite=Ind|Gender=Neut | annó | annó, ennó, jennó |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[det]–> DET (1833; 84%),
NOUN –[amod]–> ADJ (1404; 88%),
NOUN –[nmod]–> NOUN (635; 64%),
NOUN –[conj]–> NOUN (381; 95%),
NOUN –[amod]–> VERB (151; 90%),
NOUN –[nmod]–> PROPN (107; 58%),
PROPN –[nmod]–> PROPN (85; 86%),
ADJ –[det]–> DET (60; 73%),
ADJ –[conj]–> ADJ (56; 92%),
PROPN –[nmod]–> NOUN (46; 75%).