home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic: Features: Case

This feature is universal. It occurs with 3 different values: Acc, Gen, Nom.

144002 tokens (51%) have a non-empty value of Case. 15061 types (57%) occur at least once with a non-empty value of Case. 6378 lemmas (38%) occur at least once with a non-empty value of Case. The feature is used with 6 part-of-speech tags: NOUN (92051; 33% instances), ADJ (29221; 10% instances), PRON (9991; 4% instances), ADP (5971; 2% instances), DET (4562; 2% instances), NUM (2206; 1% instances).

NOUN

92051 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (70854; 77%).

NOUN tokens may have the following values of Case:

Paradigm يَومNomAccGen
Definite=Cons|Number=Singيوميوميوم
Definite=Cons|Number=Dualيومي, يومى
Definite=Cons|Number=Plurأيامايام, أيامأيام
Definite=Def|Number=Singاليوماليوماليوم
Definite=Def|Number=Dualاليومين
Definite=Def|Number=Plurالايام, الأيامالأيام, الايامالايام, الأيام
Definite=Ind|Number=Singيوميوما, يوماًيوم
Definite=Ind|Number=Dualيومينيومين
Definite=Ind|Number=Plurاياماأيام, ايام

ADJ

29221 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (27495; 94%), Definite=Def (18902; 65%), Gender=Masc (15039; 51%).

ADJ tokens may have the following values of Case:

Paradigm مِصرِيّNomAccGen
Definite=Cons|Gender=Masc|Number=Singمصري
Definite=Def|Gender=Masc|Number=Singالمصري, المصرىالمصريالمصري, المصرى
Definite=Def|Gender=Masc|Number=Plurالمصريونالمصريينالمصريين
Definite=Def|Gender=Fem|Number=Singالمصريةالمصرية, المصـــريةالمصرية, المصريةـ
Definite=Def|Gender=Fem|Number=Dualالمصريتين
Definite=Ind|Gender=Masc|Number=Singمصريمصرياًمصري
Definite=Ind|Gender=Masc|Number=Dualمصريين
Definite=Ind|Gender=Masc|Number=Plurمصريونمصريينمصريين
Definite=Ind|Gender=Fem|Number=Singمصريةمصريةمصرية
Definite=Ind|Gender=Fem|Number=Dualمصريتان
Definite=Ind|Gender=Fem|Number=Plurمصريات

PRON

9991 PRON tokens (100% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: PronType=Prs (9991; 100%), Person=3 (9693; 97%), Number=Sing (8642; 86%), Gender=Masc (5866; 59%).

PRON tokens may have the following values of Case:

Paradigm هُوَNomAccGen
Gender=Masc|Number=Sing|Person=1أنا, انانيي
Gender=Masc|Number=Sing|Person=2أنتكك
Gender=Masc|Number=Sing|Person=3هوهه, طائرته, إدانته, لاراضيه, حكومته, بلاده, مستقبله, تجهيزه, انتشاره, زنزانته, استعداداته, والده, وغربه
Gender=Masc|Number=Dual|Person=3هماهماهما
Gender=Masc|Number=Plur|Person=1نحننانا, لمساعدتنا
Gender=Masc|Number=Plur|Person=2انتم, أنتمكمكم
Gender=Masc|Number=Plur|Person=3همهمهم, شفائهم, بهم, بأنفسهم, استبعادهم, لهم, بلادهم
Gender=Fem|Number=Sing|Person=2ك
Gender=Fem|Number=Sing|Person=3هي, هى, وهيهاها, بضمانها, نهايتها, تجارتها, تجميدها, إليها, بهويتها, مواجهتها, أهدافها, أعضائها, مستشفياتها, بفقدانها, تخصيصها
Gender=Fem|Number=Dual|Person=3هماهماهما
Gender=Fem|Number=Plur|Person=3هنهنهن

ADP

5971 ADP tokens (14% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (5971; 100%).

ADP tokens may have the following values of Case:

Paradigm بَعدَNomAccGen
بعدبعدبعد

DET

4562 DET tokens (79% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Number=Sing (4281; 94%), PronType=Rel (2513; 55%), Gender=Fem (2397; 53%).

DET tokens may have the following values of Case:

Paradigm اَلَّذِيNomAccGen
Gender=Masc|Number=Singالذي, الذىالذي, الذىالذي, الذى
Gender=Masc|Number=Dualاللذاناللذيناللذين
Gender=Masc|Number=Plurالذينالذينالذين
Gender=Fem|Number=Singالتي, التىالتي, التىالتي, التى
Gender=Fem|Number=Dualاللتاناللتيناللتين
Gender=Fem|Number=Plurاللواتي, اللاتى, اللاتي

NUM

2206 NUM tokens (28% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (2206; 100%), Definite=Cons (1519; 69%), Number=Sing (1155; 52%).

NUM tokens may have the following values of Case:

Paradigm مِليُونNomAccGen
Definite=Cons|Number=Singمليونمليون, ملـيونمليون
Definite=Cons|Number=Dualمليونامليونيمليوني
Definite=Cons|Number=Plurملايينملايينملايين
Definite=Def|Number=Singالمليونالمليون
Definite=Def|Number=Plurالملايينالملايين
Definite=Ind|Number=Singمليوناً, مليونامليون, ملــيون
Definite=Ind|Number=Plurملايين
Number=Sing|Polarity=Negمليون

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[nmod]–> NOUN (26173; 71%), NOUN –[amod]–> ADJ (22129; 97%), NOUN –[conj]–> NOUN (5137; 97%), NOUN –[nmod]–> PRON (3153; 66%), NOUN –[det]–> DET (1729; 87%), ADJ –[nmod]–> NOUN (1169; 59%), NOUN –[obl]–> NOUN (1032; 56%), ADJ –[conj]–> ADJ (831; 98%), NOUN –[cc]–> NOUN (823; 71%), NOUN –[obl:arg]–> NOUN (768; 59%).