## Treebank Statistics: UD_Arabic-NYUAD: Features: `Case`

This feature is universal.
It occurs with 3 different values: `Acc`

, `Gen`

, `Nom`

.

334414 tokens (45%) have a non-empty value of `Case`

.
1 types (0) occur at least once with a non-empty value of `Case`

.
231 lemmas (5%) occur at least once with a non-empty value of `Case`

.
The feature is used with 16 part-of-speech tags: `NOUN` (209062; 28% instances), `ADJ` (63518; 9% instances), `PRON` (22765; 3% instances), `ADV` (20740; 3% instances), `PROPN` (11495; 2% instances), `NUM` (3282; 0% instances), `SCONJ` (1516; 0% instances), `ADP` (685; 0% instances), `PUNCT` (486; 0% instances), `CCONJ` (396; 0% instances), `VERB` (214; 0% instances), `AUX` (92; 0% instances), `DET` (70; 0% instances), `X` (48; 0% instances), `PART` (44; 0% instances), `INTJ` (1; 0% instances).

`NOUN`

209062 `NOUN` tokens (96% of all `NOUN`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `NOUN`

and `Case`

co-occurred: `Number``=Sing` (185021; 89%), `Gender``=Masc` (143234; 69%).

`NOUN`

tokens may have the following values of `Case`

:

`Acc`

(37726; 18% of non-empty`Case`

): _`Gen`

(142071; 68% of non-empty`Case`

): _`Nom`

(29265; 14% of non-empty`Case`

): _`EMPTY`

(9192): _

`ADJ`

63518 `ADJ` tokens (94% of all `ADJ`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `ADJ`

and `Case`

co-occurred: `Number``=Sing` (60596; 95%), `Definite``=Def` (42979; 68%), `Gender``=Masc` (31950; 50%).

`ADJ`

tokens may have the following values of `Case`

:

`Acc`

(11857; 19% of non-empty`Case`

): _`Gen`

(40502; 64% of non-empty`Case`

): _`Nom`

(11159; 18% of non-empty`Case`

): _`EMPTY`

(4086): _

`PRON`

22765 `PRON` tokens (73% of all `PRON`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `PRON`

and `Case`

co-occurred: `PronType``=Prs` (22577; 99%), `Definite``=Def` (20814; 91%), `Person``=3` (20363; 89%), `Number``=Sing` (18430; 81%), `Gender``=Masc` (14810; 65%).

`PRON`

tokens may have the following values of `Case`

:

`Acc`

(4584; 20% of non-empty`Case`

): _`Gen`

(16343; 72% of non-empty`Case`

): _`Nom`

(1838; 8% of non-empty`Case`

): _`EMPTY`

(8474): _

`ADV`

20740 `ADV` tokens (78% of all `ADV`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `ADV`

and `Case`

co-occurred: `Number``=Sing` (20580; 99%), `Gender``=Masc` (19871; 96%), `Definite``=Com` (15606; 75%).

`ADV`

tokens may have the following values of `Case`

:

`Acc`

(18316; 88% of non-empty`Case`

): _`Gen`

(1915; 9% of non-empty`Case`

): _`Nom`

(509; 2% of non-empty`Case`

): _`EMPTY`

(5787): _

`PROPN`

11495 `PROPN` tokens (20% of all `PROPN`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `PROPN`

and `Case`

co-occurred: `Number``=Sing` (11066; 96%), `Gender``=Masc` (9848; 86%).

`PROPN`

tokens may have the following values of `Case`

:

`Acc`

(2351; 20% of non-empty`Case`

): _`Gen`

(7191; 63% of non-empty`Case`

): _`Nom`

(1953; 17% of non-empty`Case`

): _`EMPTY`

(46830): _

`Case`

seems to be **lexical feature** of `PROPN`

. 98% lemmas (212) occur only with one value of `Case`

.

`NUM`

3282 `NUM` tokens (22% of all `NUM`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `NUM`

and `Case`

co-occurred: `NumForm``=Word` (3200; 98%), `Number``=Sing` (2886; 88%), `Definite``=Com` (2317; 71%), `Gender``=Masc` (1937; 59%).

`NUM`

tokens may have the following values of `Case`

:

`Acc`

(920; 28% of non-empty`Case`

): _`Gen`

(2039; 62% of non-empty`Case`

): _`Nom`

(323; 10% of non-empty`Case`

): _`EMPTY`

(11865): _

`SCONJ`

1516 `SCONJ` tokens (6% of all `SCONJ`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `SCONJ`

and `Case`

co-occurred: `Number``=Sing` (1251; 83%), `Definite``=Ind` (1118; 74%), `Gender``=Masc` (938; 62%).

`SCONJ`

tokens may have the following values of `Case`

:

`Acc`

(182; 12% of non-empty`Case`

): _`Gen`

(292; 19% of non-empty`Case`

): _`Nom`

(1042; 69% of non-empty`Case`

): _`EMPTY`

(24518): _

`ADP`

685 `ADP` tokens (1% of all `ADP`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `ADP`

and `Case`

co-occurred: `AdpType``=Prep` (685; 100%).

`ADP`

tokens may have the following values of `Case`

:

`Acc`

(111; 16% of non-empty`Case`

): _`Gen`

(505; 74% of non-empty`Case`

): _`Nom`

(69; 10% of non-empty`Case`

): _`EMPTY`

(91009): _

`PUNCT`

486 `PUNCT` tokens (1% of all `PUNCT`

tokens) have a non-empty value of `Case`

.

`PUNCT`

tokens may have the following values of `Case`

:

`Acc`

(48; 10% of non-empty`Case`

): _`Gen`

(345; 71% of non-empty`Case`

): _`Nom`

(93; 19% of non-empty`Case`

): _`EMPTY`

(74662): _

`CCONJ`

396 `CCONJ` tokens (1% of all `CCONJ`

tokens) have a non-empty value of `Case`

.

`CCONJ`

tokens may have the following values of `Case`

:

`Acc`

(67; 17% of non-empty`Case`

): _`Gen`

(248; 63% of non-empty`Case`

): _`Nom`

(81; 20% of non-empty`Case`

): _`EMPTY`

(49636): _

`VERB`

214 `VERB` tokens (0% of all `VERB`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `VERB`

and `Case`

co-occurred: `Aspect``=EMPTY` (214; 100%), `Mood``=EMPTY` (214; 100%), `Voice``=EMPTY` (214; 100%), `Person``=EMPTY` (193; 90%), `Number``=Sing` (187; 87%), `Gender``=Masc` (152; 71%).

`VERB`

tokens may have the following values of `Case`

:

`Acc`

(83; 39% of non-empty`Case`

): _`Gen`

(91; 43% of non-empty`Case`

): _`Nom`

(40; 19% of non-empty`Case`

): _`EMPTY`

(55001): _

`AUX`

92 `AUX` tokens (1% of all `AUX`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `AUX`

and `Case`

co-occurred: `Mood``=EMPTY` (92; 100%), `Voice``=EMPTY` (92; 100%), `Number``=Sing` (88; 96%), `Person``=EMPTY` (87; 95%), `Gender``=Masc` (76; 83%).

`AUX`

tokens may have the following values of `Case`

:

`Acc`

(17; 18% of non-empty`Case`

): _`Gen`

(55; 60% of non-empty`Case`

): _`Nom`

(20; 22% of non-empty`Case`

): _`EMPTY`

(7631): _

`DET`

70 `DET` tokens (1% of all `DET`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `DET`

and `Case`

co-occurred: `Definite``=Ind` (45; 64%), `Gender``=Masc` (42; 60%), `Number``=Dual` (39; 56%).

`DET`

tokens may have the following values of `Case`

:

`Acc`

(38; 54% of non-empty`Case`

): _`Gen`

(20; 29% of non-empty`Case`

): _`Nom`

(12; 17% of non-empty`Case`

): _`EMPTY`

(6292): _

`X`

48 `X` tokens (5% of all `X`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `X`

and `Case`

co-occurred: `Mood``=EMPTY` (48; 100%), `Voice``=EMPTY` (48; 100%), `Person``=EMPTY` (47; 98%), `Gender``=Masc` (43; 90%).

`X`

tokens may have the following values of `Case`

:

`Acc`

(27; 56% of non-empty`Case`

): _`Gen`

(9; 19% of non-empty`Case`

): _`Nom`

(12; 25% of non-empty`Case`

): _`EMPTY`

(869): _

Paradigm None | Nom | Acc | Gen |
---|---|---|---|

Definite=Com|Gender=Masc|Number=Sing | _ | _ | |

Definite=Com|Gender=Masc|Number=Dual | _ | ||

Definite=Com|Gender=Masc|Number=Plur | _ | ||

Definite=Def|Gender=Masc|Number=Sing | _ | ||

Definite=Def|Gender=Masc|Number=Dual | _ | _ | |

Definite=Ind|Gender=Masc|Number=Sing | _ | ||

Definite=Ind|Gender=Masc|Number=Dual | _ | ||

Definite=Ind|Gender=Masc|Number=Plur | _ | ||

Definite=Ind|Gender=Fem|Number=Dual | _ |

`PART`

44 `PART` tokens (1% of all `PART`

tokens) have a non-empty value of `Case`

.

The most frequent other feature values with which `PART`

and `Case`

co-occurred: `Polarity``=EMPTY` (44; 100%).

`PART`

tokens may have the following values of `Case`

:

`Acc`

(6; 14% of non-empty`Case`

): _`Gen`

(30; 68% of non-empty`Case`

): _`Nom`

(8; 18% of non-empty`Case`

): _`EMPTY`

(8568): _

`INTJ`

1 `INTJ` tokens (2% of all `INTJ`

tokens) have a non-empty value of `Case`

.

`INTJ`

tokens may have the following values of `Case`

:

`Gen`

(1; 100% of non-empty`Case`

): _`EMPTY`

(55): _

## Relations with Agreement in `Case`

The 10 most frequent relations where parent and child node agree in `Case`

:
`NOUN –[ amod]–> ADJ` (48200; 88%),

`NOUN –[`(36026; 66%),

`nmod:poss`]–> NOUN`NOUN –[`(22628; 56%),

`nmod`]–> NOUN`NOUN –[`(12322; 87%),

`conj`]–> NOUN`NOUN –[`(9312; 62%),

`nmod:poss`]–> PRON`ADJ –[`(1714; 95%),

`conj`]–> ADJ`ADJ –[`(980; 78%),

`amod`]–> ADJ`NOUN –[`(436; 56%),

`nmod:poss`]–> ADJ`ADJ –[`(318; 58%),

`nsubj`]–> NOUN`ADV –[`(306; 55%).

`amod`]–> ADJ