home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: Features: Case

This feature is universal. It occurs with 7 different values: Acc, Dat, Gen, Ins, Loc, Nom, Voc.

295929 tokens (60%) have a non-empty value of Case. 51533 types (83%) occur at least once with a non-empty value of Case. 22186 lemmas (78%) occur at least once with a non-empty value of Case. The feature is used with 7 part-of-speech tags: NOUN (135027; 27% instances), ADJ (68871; 14% instances), ADP (48326; 10% instances), DET (17556; 4% instances), PRON (15863; 3% instances), PROPN (7815; 2% instances), NUM (2471; 0% instances).

NOUN

135027 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (95304; 71%), Animacy=EMPTY (79149; 59%).

NOUN tokens may have the following values of Case:

Paradigm člověkNomAccDatGenVocLocIns
Number=Singčlověkčlověkačlověkučlověkačlověkučlověkem
Number=Plurlidé, lidilidilidemlidíLidilidechlidmi

ADJ

68871 ADJ tokens (93% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Polarity=Pos (66036; 96%), Degree=Pos (62554; 91%), VerbForm=EMPTY (60939; 88%), Voice=EMPTY (60939; 88%), Number=Sing (44618; 65%), Animacy=EMPTY (41318; 60%).

ADJ tokens may have the following values of Case:

Paradigm mladýNomAccDatGenVocLocIns
Animacy=Anim|Degree=Pos|Gender=Masc|Number=Singmladýmladéhomladémumladého
Animacy=Anim|Degree=Pos|Gender=Masc|Number=Plurmladímladémladýmmladýchmladímladými
Animacy=Anim|Degree=Cmp|Gender=Masc|Number=Singmladšímladšího
Animacy=Anim|Degree=Cmp|Gender=Masc|Number=Plurmladšímladšímladším
Animacy=Anim|Degree=Sup|Gender=Masc|Number=Singnejmladšímu
Animacy=Anim|Degree=Sup|Gender=Masc|Number=Plurnejmladší
Animacy=Inan|Degree=Pos|Gender=Masc|Number=SingmladýMladýmladéhomladém
Animacy=Inan|Degree=Pos|Gender=Masc|Number=Plurmladémladýchmladými
Animacy=Inan|Degree=Cmp|Gender=Masc|Number=Singmladší
Animacy=Inan|Degree=Cmp|Gender=Masc|Number=Plurmladšímladšímladších
Animacy=Inan|Degree=Sup|Gender=Masc|Number=Plurnejmladšínejmladší
Degree=Pos|Gender=Fem|Number=SingmladámladoumladéMladémladou
Degree=Pos|Gender=Fem|Number=Plurmladémladémladých
Degree=Pos|Gender=Neut|Number=Singmladémladé
Degree=Pos|Gender=Neut|Number=Plurmladámladých
Degree=Cmp|Gender=Fem|Number=SingmladšíMladšímladšímladšímladší
Degree=Cmp|Gender=Fem|Number=Plurmladšímmladšíchmladšími
Degree=Cmp|Gender=Neut|Number=Singmladšího
Degree=Sup|Gender=Fem|Number=SingnejmladšíNejmladšínejmladší
Degree=Sup|Gender=Neut|Number=Singnejmladšího

ADP

48326 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (44494; 92%).

ADP tokens may have the following values of Case:

Paradigm oAccGenLoc
ooo

DET

17556 DET tokens (89% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Number[psor]=EMPTY (15879; 90%), Person=EMPTY (15879; 90%), Animacy=EMPTY (15130; 86%), Poss=EMPTY (14439; 82%), Number=Sing (12040; 69%).

DET tokens may have the following values of Case:

Paradigm tenNomAccDatGenLocIns
Animacy=Anim|Gender=Masc|Number=Singtoho
Animacy=Anim|Gender=Masc|Number=Plurtity
Animacy=Inan|Gender=Masc|Number=Singten
Animacy=Inan|Gender=Masc|Number=Plurtyty
Gender=Masc,Neut|Number=Singtomutohotomtím
Gender=Masc|Number=Singten
Gender=Fem|Number=Singtatutou
Gender=Fem|Number=Sing|Style=Coll
Gender=Fem|Number=Dualtěma
Gender=Fem|Number=Plurtyty
Gender=Neut|Number=Singtoto
Gender=Neut|Number=PlurtaTa
Number=Plurtěmtěchtěchtěmi

PRON

15863 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: Gender=EMPTY (13052; 82%), PrepCase=EMPTY (13006; 82%), PronType=Prs (12495; 79%), Person=EMPTY (12410; 78%), Number=EMPTY (10157; 64%), Variant=Short (9196; 58%), Reflex=Yes (9042; 57%).

PRON tokens may have the following values of Case:

Paradigm tyNomAccDatGenVocLocIns
Number=Singtytebetebetytebou
Number=Sing|Variant=Shortti
Number=PlurvyvásvámvásVásvámi

PROPN

7815 PROPN tokens (80% of all PROPN tokens) have a non-empty value of Case.

The most frequent other feature values with which PROPN and Case co-occurred: Abbr=EMPTY (7810; 100%), Number=Sing (7154; 92%), Gender=Masc (4783; 61%).

PROPN tokens may have the following values of Case:

Paradigm PrahaNomAccDatGenLocIns
PrahaPrahuPrazePrahyPrazePrahou

NUM

2471 NUM tokens (34% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (2411; 98%), NumType=Card (2411; 98%), Number=Plur (1281; 52%), Gender=EMPTY (1272; 51%).

NUM tokens may have the following values of Case:

Paradigm jedenNomAccDatGenLocIns
Animacy=Anim|Gender=Mascjednoho
Animacy=Inan|Gender=Mascjeden
Gender=Masc,Neutjednomujednohojednomjedním
Gender=Mascjeden
Gender=Femjednajednujednéjednéjednéjednou
Gender=Neutjednojedno

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[amod]–> ADJ (59222; 98%), NOUN –[case]–> ADP (38428; 96%), NOUN –[conj]–> NOUN (13551; 95%), NOUN –[det]–> DET (8688; 78%), ADJ –[conj]–> ADJ (3470; 94%), PRON –[case]–> ADP (2185; 99%), PROPN –[case]–> ADP (1846; 83%), ADJ –[nsubj]–> NOUN (1446; 58%), DET –[case]–> ADP (1416; 97%), ADP –[fixed]–> NOUN (1346; 100%).