home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: Features: Case

This feature is universal. It occurs with 3 different values: Acc, Gen, Nom.

334414 tokens (45%) have a non-empty value of Case. 1 types (0) occur at least once with a non-empty value of Case. 231 lemmas (5%) occur at least once with a non-empty value of Case. The feature is used with 16 part-of-speech tags: NOUN (209062; 28% instances), ADJ (63518; 9% instances), PRON (22765; 3% instances), ADV (20740; 3% instances), PROPN (11495; 2% instances), NUM (3282; 0% instances), SCONJ (1516; 0% instances), ADP (685; 0% instances), PUNCT (486; 0% instances), CCONJ (396; 0% instances), VERB (214; 0% instances), AUX (92; 0% instances), DET (70; 0% instances), X (48; 0% instances), PART (44; 0% instances), INTJ (1; 0% instances).

NOUN

209062 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (185021; 89%), Gender=Masc (143234; 69%).

NOUN tokens may have the following values of Case:

ADJ

63518 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (60596; 95%), Definite=Def (42979; 68%), Gender=Masc (31950; 50%).

ADJ tokens may have the following values of Case:

PRON

22765 PRON tokens (73% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: PronType=Prs (22577; 99%), Definite=Def (20814; 91%), Person=3 (20363; 89%), Number=Sing (18430; 81%), Gender=Masc (14810; 65%).

PRON tokens may have the following values of Case:

ADV

20740 ADV tokens (78% of all ADV tokens) have a non-empty value of Case.

The most frequent other feature values with which ADV and Case co-occurred: Number=Sing (20580; 99%), Gender=Masc (19871; 96%), Definite=Com (15606; 75%).

ADV tokens may have the following values of Case:

PROPN

11495 PROPN tokens (20% of all PROPN tokens) have a non-empty value of Case.

The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (11066; 96%), Gender=Masc (9848; 86%).

PROPN tokens may have the following values of Case:

Case seems to be lexical feature of PROPN. 98% lemmas (212) occur only with one value of Case.

NUM

3282 NUM tokens (22% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (3200; 98%), Number=Sing (2886; 88%), Definite=Com (2317; 71%), Gender=Masc (1937; 59%).

NUM tokens may have the following values of Case:

SCONJ

1516 SCONJ tokens (6% of all SCONJ tokens) have a non-empty value of Case.

The most frequent other feature values with which SCONJ and Case co-occurred: Number=Sing (1251; 83%), Definite=Ind (1118; 74%), Gender=Masc (938; 62%).

SCONJ tokens may have the following values of Case:

ADP

685 ADP tokens (1% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (685; 100%).

ADP tokens may have the following values of Case:

PUNCT

486 PUNCT tokens (1% of all PUNCT tokens) have a non-empty value of Case.

PUNCT tokens may have the following values of Case:

CCONJ

396 CCONJ tokens (1% of all CCONJ tokens) have a non-empty value of Case.

CCONJ tokens may have the following values of Case:

VERB

214 VERB tokens (0% of all VERB tokens) have a non-empty value of Case.

The most frequent other feature values with which VERB and Case co-occurred: Aspect=EMPTY (214; 100%), Mood=EMPTY (214; 100%), Voice=EMPTY (214; 100%), Person=EMPTY (193; 90%), Number=Sing (187; 87%), Gender=Masc (152; 71%).

VERB tokens may have the following values of Case:

AUX

92 AUX tokens (1% of all AUX tokens) have a non-empty value of Case.

The most frequent other feature values with which AUX and Case co-occurred: Mood=EMPTY (92; 100%), Voice=EMPTY (92; 100%), Number=Sing (88; 96%), Person=EMPTY (87; 95%), Gender=Masc (76; 83%).

AUX tokens may have the following values of Case:

DET

70 DET tokens (1% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Definite=Ind (45; 64%), Gender=Masc (42; 60%), Number=Dual (39; 56%).

DET tokens may have the following values of Case:

X

48 X tokens (5% of all X tokens) have a non-empty value of Case.

The most frequent other feature values with which X and Case co-occurred: Mood=EMPTY (48; 100%), Voice=EMPTY (48; 100%), Person=EMPTY (47; 98%), Gender=Masc (43; 90%).

X tokens may have the following values of Case:

Paradigm NoneNomAccGen
Definite=Com|Gender=Masc|Number=Sing__
Definite=Com|Gender=Masc|Number=Dual_
Definite=Com|Gender=Masc|Number=Plur_
Definite=Def|Gender=Masc|Number=Sing_
Definite=Def|Gender=Masc|Number=Dual__
Definite=Ind|Gender=Masc|Number=Sing_
Definite=Ind|Gender=Masc|Number=Dual_
Definite=Ind|Gender=Masc|Number=Plur_
Definite=Ind|Gender=Fem|Number=Dual_

PART

44 PART tokens (1% of all PART tokens) have a non-empty value of Case.

The most frequent other feature values with which PART and Case co-occurred: Polarity=EMPTY (44; 100%).

PART tokens may have the following values of Case:

INTJ

1 INTJ tokens (2% of all INTJ tokens) have a non-empty value of Case.

INTJ tokens may have the following values of Case:

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[amod]–> ADJ (48200; 88%), NOUN –[nmod:poss]–> NOUN (36026; 66%), NOUN –[nmod]–> NOUN (22628; 56%), NOUN –[conj]–> NOUN (12324; 87%), NOUN –[nmod:poss]–> PRON (9312; 62%), ADJ –[conj]–> ADJ (1714; 95%), ADJ –[amod]–> ADJ (980; 78%), NOUN –[nmod:poss]–> ADJ (436; 56%), ADJ –[nsubj]–> NOUN (318; 58%), ADV –[amod]–> ADJ (306; 55%).