home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

66056 tokens (23%) have a non-empty value of Gender. 9183 types (35%) occur at least once with a non-empty value of Gender. 3386 lemmas (20%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: ADJ (29221; 10% instances), VERB (20901; 7% instances), PRON (9991; 4% instances), DET (4562; 2% instances), NUM (700; 0% instances), AUX (681; 0% instances).

ADJ

29221 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (27495; 94%), Case=Gen (19101; 65%), Definite=Def (18902; 65%).

ADJ tokens may have the following values of Gender:

Paradigm مِصرِيّMascFem
Case=Acc|Definite=Def|Number=Singالمصريالمصرية, المصـــرية
Case=Acc|Definite=Def|Number=Plurالمصريين
Case=Acc|Definite=Ind|Number=Singمصرياًمصرية
Case=Acc|Definite=Ind|Number=Dualمصريين
Case=Acc|Definite=Ind|Number=Plurمصريين
Case=Gen|Definite=Def|Number=Singالمصري, المصرىالمصرية, المصريةـ
Case=Gen|Definite=Def|Number=Dualالمصريتين
Case=Gen|Definite=Def|Number=Plurالمصريين
Case=Gen|Definite=Ind|Number=Singمصريمصرية
Case=Gen|Definite=Ind|Number=Plurمصريينمصريات
Case=Nom|Definite=Cons|Number=Singمصري
Case=Nom|Definite=Def|Number=Singالمصري, المصرىالمصرية
Case=Nom|Definite=Def|Number=Plurالمصريون
Case=Nom|Definite=Ind|Number=Singمصريمصرية
Case=Nom|Definite=Ind|Number=Dualمصريتان
Case=Nom|Definite=Ind|Number=Plurمصريون

VERB

20901 VERB tokens (100% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (20629; 99%), Voice=Act (19687; 94%), Number=Sing (19547; 94%), Aspect=Perf (11091; 53%), Mood=EMPTY (11091; 53%), VerbForm=EMPTY (11091; 53%).

VERB tokens may have the following values of Gender:

Paradigm قَالMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيقولتقول
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Passيقال
Aspect=Imp|Mood=Ind|Number=Plur|Person=1|VerbForm=Fin|Voice=Actنقول
Aspect=Imp|Mood=Ind|Number=Plur|Person=3|VerbForm=Fin|Voice=Actيقولون
Aspect=Imp|Mood=Sub|Number=Sing|Person=1|VerbForm=Fin|Voice=Actأقول
Aspect=Imp|Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيقول
Aspect=Perf|Number=Sing|Person=1|Voice=Actقلت
Aspect=Perf|Number=Sing|Person=3|Voice=Actقالقالت
Aspect=Perf|Number=Sing|Person=3|Voice=Passقيل
Aspect=Perf|Number=Dual|Person=3|Voice=Actقالا
Aspect=Perf|Number=Plur|Person=3|Voice=Actقالوا

PRON

9991 PRON tokens (100% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (9991; 100%), Person=3 (9693; 97%), Number=Sing (8642; 86%), Case=Gen (6743; 67%).

PRON tokens may have the following values of Gender:

Paradigm هُوَMascFem
Case=Acc|Number=Sing|Person=1ني
Case=Acc|Number=Sing|Person=2ك
Case=Acc|Number=Sing|Person=3هها
Case=Acc|Number=Dual|Person=3هماهما
Case=Acc|Number=Plur|Person=1نا
Case=Acc|Number=Plur|Person=2كم
Case=Acc|Number=Plur|Person=3همهن
Case=Gen|Number=Sing|Person=1ي
Case=Gen|Number=Sing|Person=2كك
Case=Gen|Number=Sing|Person=3ه, طائرته, إدانته, حكومته, لاراضيه, مستقبله, تجهيزه, بلاده, انتشاره, استعداداته, وغربه, والده, زنزانتهها, نهايتها, بضمانها, تجارتها, إليها, تجميدها, بهويتها, مواجهتها, أهدافها, أعضائها, مستشفياتها, بفقدانها, تخصيصها
Case=Gen|Number=Dual|Person=3هماهما
Case=Gen|Number=Plur|Person=1نا, لمساعدتنا
Case=Gen|Number=Plur|Person=2كم
Case=Gen|Number=Plur|Person=3هم, استبعادهم, بلادهم, لهم, شفائهم, بأنفسهم, بهمهن
Case=Nom|Number=Sing|Person=1أنا, انا
Case=Nom|Number=Sing|Person=2أنت
Case=Nom|Number=Sing|Person=3هوهي, هى, وهي
Case=Nom|Number=Dual|Person=3هماهما
Case=Nom|Number=Plur|Person=1نحن
Case=Nom|Number=Plur|Person=2انتم, أنتم
Case=Nom|Number=Plur|Person=3همهن

DET

4562 DET tokens (79% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4281; 94%), Case=Gen (3096; 68%), PronType=Rel (2513; 55%).

DET tokens may have the following values of Gender:

Paradigm اَلَّذِيMascFem
Case=Acc|Number=Singالذي, الذىالتي, التى
Case=Acc|Number=Dualاللذيناللتين
Case=Acc|Number=Plurالذين
Case=Gen|Number=Singالذي, الذىالتي, التى
Case=Gen|Number=Dualاللذيناللتين
Case=Gen|Number=Plurالذيناللواتي, اللاتى, اللاتي
Case=Nom|Number=Singالذي, الذىالتي, التى
Case=Nom|Number=Dualاللذاناللتان
Case=Nom|Number=Plurالذين

NUM

700 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (700; 100%), Number=EMPTY (700; 100%), Case=Gen (410; 59%), Definite=Cons (401; 57%).

NUM tokens may have the following values of Gender:

Paradigm ثَلَاثَةMascFem
Case=Acc|Definite=Consثلاثةثلاث
Case=Acc|Definite=Defالثلاثـــــة, الثلاثةالثلاث
Case=Acc|Definite=Indثلاثةثلاثا
Case=Gen|Definite=Comالثلاثة
Case=Gen|Definite=Consثلاثةثلاث
Case=Gen|Definite=Defالثلاثةالثلاث
Case=Gen|Definite=Indثلاثةثلاث
Case=Nom|Definite=Consثلاثةثلاث
Case=Nom|Definite=Defالثلاثةالثلاث
Case=Nom|Definite=Indثلاثةثلاث

AUX

681 AUX tokens (100% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=3 (670; 98%), Number=Sing (657; 96%), Voice=Act (624; 92%), Aspect=Perf (389; 57%), Mood=EMPTY (389; 57%), VerbForm=EMPTY (389; 57%).

AUX tokens may have the following values of Gender:

Paradigm كَانMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=2|VerbForm=Fin|Voice=Actتكون
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون
Aspect=Imp|Mood=Ind|Number=Plur|Person=2|VerbForm=Fin|Voice=Actتكونون
Aspect=Imp|Mood=Jus|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكنتكن
Aspect=Imp|Mood=Sub|Number=Sing|Person=1|VerbForm=Fin|Voice=Actاكون
Aspect=Imp|Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون
Aspect=Imp|Mood=Sub|Number=Dual|Person=3|VerbForm=Fin|Voice=Actيكونا
Aspect=Imp|Mood=Sub|Number=Plur|Person=1|VerbForm=Fin|Voice=Actنكون
Aspect=Perf|Number=Sing|Person=1|Voice=Actكنت
Aspect=Perf|Number=Sing|Person=3|Voice=Actكانكانت
Aspect=Perf|Number=Plur|Person=3|Voice=Actكانوا
Mood=Imp|Number=Sing|VerbForm=Finكن

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[conj]–> VERB (1880; 75%), VERB –[nsubj]–> DET (1542; 74%), VERB –[ccomp]–> VERB (1478; 58%), ADJ –[conj]–> ADJ (841; 99%), VERB –[advcl]–> VERB (747; 67%), VERB –[obj]–> PRON (659; 54%), VERB –[xcomp]–> VERB (624; 98%), VERB –[nsubj]–> PRON (612; 99%), VERB –[xcomp]–> ADJ (505; 95%), VERB –[obl]–> ADJ (423; 59%).