This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ar/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Arabic)

This feature is universal. It occurs with 2 different values: Fem, Masc.

66056 tokens (23%) have a non-empty value of Gender. 9183 types (35%) occur at least once with a non-empty value of Gender. 3386 lemmas (20%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: ADJ (29221; 10% instances), VERB (21543; 8% instances), PRON (12942; 5% instances), DET (1611; 1% instances), NUM (700; 0% instances), AUX (39; 0% instances).

ADJ

29221 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (27495; 94%), Case=Gen (19101; 65%), Definite=Def (18902; 65%).

ADJ tokens may have the following values of Gender:

Paradigm مِصرِيّMascFem
Case=Acc|Definite=Def|Number=Singالمصريالمصرية, المصـــرية
Case=Acc|Definite=Def|Number=Plurالمصريين
Case=Acc|Definite=Ind|Number=Singمصرياًمصرية
Case=Acc|Definite=Ind|Number=Dualمصريين
Case=Acc|Definite=Ind|Number=Plurمصريين
Case=Gen|Definite=Def|Number=Singالمصري, المصرىالمصرية, المصريةـ
Case=Gen|Definite=Def|Number=Dualالمصريتين
Case=Gen|Definite=Def|Number=Plurالمصريين
Case=Gen|Definite=Ind|Number=Singمصريمصرية
Case=Gen|Definite=Ind|Number=Plurمصريينمصريات
Case=Nom|Definite=Def|Number=Singالمصري, المصرىالمصرية
Case=Nom|Definite=Def|Number=Plurالمصريون
Case=Nom|Definite=Ind|Number=Singمصريمصرية
Case=Nom|Definite=Ind|Number=Dualمصريتان
Case=Nom|Definite=Ind|Number=Plurمصريون
Case=Nom|Definite=Red|Number=Singمصري

VERB

21543 VERB tokens (100% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (21260; 99%), Voice=Act (20272; 94%), Number=Sing (20165; 94%), Aspect=Perf (11442; 53%), VerbForm=EMPTY (11442; 53%), Mood=EMPTY (11442; 53%).

VERB tokens may have the following values of Gender:

Paradigm كَانMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=1|VerbForm=Fin|Voice=Actأكون
Aspect=Imp|Mood=Ind|Number=Sing|Person=2|VerbForm=Fin|Voice=Actتكون
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون
Aspect=Imp|Mood=Ind|Number=Plur|Person=2|VerbForm=Fin|Voice=Actتكونون
Aspect=Imp|Mood=Jus|Number=Sing|Person=1|VerbForm=Fin|Voice=Actاكن
Aspect=Imp|Mood=Jus|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكنتكن
Aspect=Imp|Mood=Jus|Number=Plur|Person=3|VerbForm=Fin|Voice=Actيكونوا
Aspect=Imp|Mood=Sub|Number=Sing|Person=1|VerbForm=Fin|Voice=Actاكون
Aspect=Imp|Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون, تكـــون
Aspect=Imp|Mood=Sub|Number=Dual|Person=3|VerbForm=Fin|Voice=Actيكونا
Aspect=Imp|Mood=Sub|Number=Plur|Person=1|VerbForm=Fin|Voice=Actنكون
Aspect=Imp|Mood=Sub|Number=Plur|Person=3|VerbForm=Fin|Voice=Actيكونوا
Aspect=Perf|Number=Sing|Person=1|Voice=Actكنت
Aspect=Perf|Number=Sing|Person=2|Voice=Actكنت
Aspect=Perf|Number=Sing|Person=3|Voice=Actكانكانت
Aspect=Perf|Number=Dual|Person=3|Voice=Actكاناكانتا
Aspect=Perf|Number=Plur|Person=1|Voice=Actكنا
Aspect=Perf|Number=Plur|Person=3|Voice=Actكانوا
Mood=Imp|Number=Sing|VerbForm=Finكن

PRON

12942 PRON tokens (93% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (11359; 88%), PronType=Prs (9991; 77%), Person=3 (9693; 75%), Case=Gen (8770; 68%).

PRON tokens may have the following values of Gender:

Paradigm هُوَMascFem
Case=Acc|Number=Sing|Person=1ني
Case=Acc|Number=Sing|Person=2ك
Case=Acc|Number=Sing|Person=3هها
Case=Acc|Number=Dual|Person=3هماهما
Case=Acc|Number=Plur|Person=1نا
Case=Acc|Number=Plur|Person=2كم
Case=Acc|Number=Plur|Person=3همهن
Case=Gen|Number=Sing|Person=1ي
Case=Gen|Number=Sing|Person=2كك
Case=Gen|Number=Sing|Person=3ه, وغربه, تجهيزه, لاراضيه, والده, طائرته, حكومته, إدانته, انتشاره, مستقبله, بلاده, استعداداته, زنزانتهها, أهدافها, مواجهتها, بضمانها, نهايتها, تخصيصها, تجميدها, أعضائها, إليها, تجارتها, بهويتها, مستشفياتها, بفقدانها
Case=Gen|Number=Dual|Person=3هماهما
Case=Gen|Number=Plur|Person=1نا, لمساعدتنا
Case=Gen|Number=Plur|Person=2كم
Case=Gen|Number=Plur|Person=3هم, شفائهم, استبعادهم, بهم, بأنفسهم, لهم, بلادهمهن
Case=Nom|Number=Sing|Person=1أنا, انا
Case=Nom|Number=Sing|Person=2أنت
Case=Nom|Number=Sing|Person=3هوهي, هى, وهي
Case=Nom|Number=Dual|Person=3هماهما
Case=Nom|Number=Plur|Person=1نحن
Case=Nom|Number=Plur|Person=2انتم, أنتم
Case=Nom|Number=Plur|Person=3همهن

DET

1611 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Dem (1599; 99%), Number=Sing (1564; 97%), Case=Gen (1069; 66%).

DET tokens may have the following values of Gender:

Paradigm هٰذَاMascFem
Case=Acc|Number=Singهذاهذه, هٰذه, هــــذه
Case=Acc|Number=Dualهذين
Case=Acc|Number=Plurهؤلاء
Case=Gen|Number=Singهذا, هٰذاهذه, هٰذه, هـــذه, هذــه
Case=Gen|Number=Dualهذينهاتين
Case=Gen|Number=Plurهؤلاء, هٰؤلاء
Case=Nom|Number=Singهذا, هٰذاهذه, هٰذه
Case=Nom|Number=Dualهٰذانهاتان
Case=Nom|Number=Plurهؤلاء

NUM

700 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=EMPTY (700; 100%), NumForm=Word (700; 100%), Case=Gen (410; 59%), Definite=Red (401; 57%).

NUM tokens may have the following values of Gender:

Paradigm ثَلَاثَةMascFem
Case=Acc|Definite=Defالثلاثة, الثلاثـــــةالثلاث
Case=Acc|Definite=Indثلاثةثلاثا
Case=Acc|Definite=Redثلاثةثلاث
Case=Gen|Definite=Comالثلاثة
Case=Gen|Definite=Defالثلاثةالثلاث
Case=Gen|Definite=Indثلاثةثلاث
Case=Gen|Definite=Redثلاثةثلاث
Case=Nom|Definite=Defالثلاثةالثلاث
Case=Nom|Definite=Indثلاثةثلاث
Case=Nom|Definite=Redثلاثةثلاث

AUX

39 AUX tokens (100% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (39; 100%), Person=3 (39; 100%), Voice=Act (39; 100%), Aspect=Perf (38; 97%).

AUX tokens may have the following values of Gender:

Paradigm لَيسMascFem
ليسليست

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[conj]–> VERB (1886; 75%), VERB –[ccomp]–> VERB (1478; 58%), VERB –[dobj]–> PRON (999; 50%), ADJ –[conj]–> ADJ (842; 99%), VERB –[advcl]–> VERB (749; 67%), VERB –[nsubj]–> PRON (655; 91%), VERB –[xcomp]–> VERB (624; 98%), VERB –[xcomp]–> ADJ (505; 95%), VERB –[advmod]–> ADJ (423; 59%), VERB –[dobj]–> ADJ (332; 60%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]