home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: Features: Case

This feature is universal. It occurs with 3 different values: Acc, Gen, Nom.

146800 tokens (52%) have a non-empty value of Case. 15350 types (62%) occur at least once with a non-empty value of Case. 6577 lemmas (43%) occur at least once with a non-empty value of Case. The feature is used with 7 part-of-speech tags: NOUN (93686; 33% instances), ADJ (29351; 10% instances), PRON (10877; 4% instances), ADP (6005; 2% instances), DET (4670; 2% instances), NUM (2208; 1% instances), PROPN (3; 0% instances).

NOUN

93686 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (72092; 77%).

NOUN tokens may have the following values of Case:

Paradigm يَومNomAccGen
Definite=Cons|Number=Singيوميوميوم
Definite=Cons|Number=Dualيومي, يومى
Definite=Cons|Number=Plurأيامايام, أيامأيام
Definite=Def|Number=Singاليوماليوماليوم
Definite=Def|Number=Dualاليومين
Definite=Def|Number=Plurالايام, الأيامالأيام, الايامالايام, الأيام
Definite=Ind|Number=Singيوميوما, يوماًيوم
Definite=Ind|Number=Dualيومينيومين
Definite=Ind|Number=Plurاياماأيام, ايام

ADJ

29351 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (27614; 94%), Definite=Def (18961; 65%), Gender=Masc (15122; 52%).

ADJ tokens may have the following values of Case:

Paradigm مِصرِيّNomAccGen
Definite=Cons|Gender=Masc|Number=Singمصري
Definite=Def|Gender=Masc|Number=Singالمصري, المصرىالمصريالمصري, المصرى
Definite=Def|Gender=Masc|Number=Plurالمصريونالمصريينالمصريين
Definite=Def|Gender=Fem|Number=Singالمصريةالمصرية, المصـــريةالمصرية, المصريةـ
Definite=Def|Gender=Fem|Number=Dualالمصريتين
Definite=Ind|Gender=Masc|Number=Singمصريمصرياًمصري
Definite=Ind|Gender=Masc|Number=Dualمصريين
Definite=Ind|Gender=Masc|Number=Plurمصريونمصريينمصريين
Definite=Ind|Gender=Fem|Number=Singمصريةمصريةمصرية
Definite=Ind|Gender=Fem|Number=Dualمصريتان
Definite=Ind|Gender=Fem|Number=Plurمصريات

PRON

10877 PRON tokens (100% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: PronType=Prs (10877; 100%), Person=3 (10131; 93%), Number=Sing (9002; 83%), Gender=Masc (6639; 61%).

PRON tokens may have the following values of Case:

Paradigm هُوَNomAccGen
Gender=Masc|Number=Sing|Person=1أنا, انانيي, ني
Gender=Masc|Number=Sing|Person=2أنتكك
Gender=Masc|Number=Sing|Person=3هوهه, إدانته, استعداداته, انتشاره, بلاده, تجهيزه, حكومته, زنزانته, طائرته, لاراضيه, مستقبله, والده, وغربه
Gender=Masc|Number=Dual|Person=2كما
Gender=Masc|Number=Dual|Person=3هماهماهما
Gender=Masc|Number=Plur|Person=1نحننانا, لمساعدتنا
Gender=Masc|Number=Plur|Person=2انتم, أنتمكمكم
Gender=Masc|Number=Plur|Person=3همهمهم, استبعادهم, بأنفسهم, بلادهم, بهم, شفائهم, لهم
Gender=Fem|Number=Sing|Person=2ك
Gender=Fem|Number=Sing|Person=3هي, هى, وهيهاها, أعضائها, أهدافها, إليها, بضمانها, بفقدانها, بهويتها, تجارتها, تجميدها, تخصيصها, مستشفياتها, مواجهتها, نهايتها
Gender=Fem|Number=Dual|Person=3هماهماهما
Gender=Fem|Number=Plur|Person=3هنهنهن

ADP

6005 ADP tokens (14% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (6005; 100%).

ADP tokens may have the following values of Case:

Paradigm بَعدَNomAccGen
بعدبعدبعد

DET

4670 DET tokens (79% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: Number=Sing (4384; 94%), PronType=Rel (2532; 54%), Gender=Fem (2417; 52%).

DET tokens may have the following values of Case:

Paradigm اَلَّذِيNomAccGen
Gender=Masc|Number=Singالذي, الذىالذي, الذىالذي, الذى
Gender=Masc|Number=Dualاللذاناللذيناللذين
Gender=Masc|Number=Plurالذينالذينالذين
Gender=Fem|Number=Singالتي, التىالتي, التىالتي, التى
Gender=Fem|Number=Dualاللتاناللتيناللتين
Gender=Fem|Number=Plurاللواتي, اللاتى, اللاتي

NUM

2208 NUM tokens (28% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (2208; 100%), Definite=Cons (1519; 69%), Number=Sing (1155; 52%).

NUM tokens may have the following values of Case:

Paradigm مِليُونNomAccGen
Definite=Cons|Number=Singمليونمليون, ملـيونمليون
Definite=Cons|Number=Dualمليونامليونيمليوني
Definite=Cons|Number=Plurملايينملايينملايين
Definite=Def|Number=Singالمليونالمليون
Definite=Def|Number=Plurالملايينالملايين
Definite=Ind|Number=Singمليوناً, مليونامليون, ملــيون
Definite=Ind|Number=Plurملايين
Number=Sing|Polarity=Negمليون

PROPN

3 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Case.

PROPN tokens may have the following values of Case:

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[nmod]–> NOUN (26755; 69%), NOUN –[amod]–> ADJ (22252; 97%), NOUN –[conj]–> NOUN (5270; 91%), NOUN –[nmod]–> PRON (3509; 63%), NOUN –[det]–> DET (1743; 86%), NOUN –[obl:arg]–> NOUN (1229; 63%), ADJ –[nmod]–> NOUN (1195; 58%), NOUN –[obl]–> NOUN (1103; 57%), ADJ –[conj]–> ADJ (841; 94%), NOUN –[appos]–> NOUN (344; 90%).