home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

67429 tokens (24%) have a non-empty value of Gender. 9336 types (37%) occur at least once with a non-empty value of Gender. 3447 lemmas (23%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: ADJ (29351; 10% instances), VERB (21116; 7% instances), PRON (10877; 4% instances), DET (4668; 2% instances), NUM (702; 0% instances), AUX (685; 0% instances), NOUN (27; 0% instances), PROPN (3; 0% instances).

ADJ

29351 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (27614; 94%), Case=Gen (19121; 65%), Definite=Def (18961; 65%).

ADJ tokens may have the following values of Gender:

Paradigm مِصرِيّMascFem
Case=Acc|Definite=Def|Number=Singالمصريالمصرية, المصـــرية
Case=Acc|Definite=Def|Number=Plurالمصريين
Case=Acc|Definite=Ind|Number=Singمصرياًمصرية
Case=Acc|Definite=Ind|Number=Dualمصريين
Case=Acc|Definite=Ind|Number=Plurمصريين
Case=Gen|Definite=Def|Number=Singالمصري, المصرىالمصرية, المصريةـ
Case=Gen|Definite=Def|Number=Dualالمصريتين
Case=Gen|Definite=Def|Number=Plurالمصريين
Case=Gen|Definite=Ind|Number=Singمصريمصرية
Case=Gen|Definite=Ind|Number=Plurمصريينمصريات
Case=Nom|Definite=Cons|Number=Singمصري
Case=Nom|Definite=Def|Number=Singالمصري, المصرىالمصرية
Case=Nom|Definite=Def|Number=Plurالمصريون
Case=Nom|Definite=Ind|Number=Singمصريمصرية
Case=Nom|Definite=Ind|Number=Dualمصريتان
Case=Nom|Definite=Ind|Number=Plurمصريون

VERB

21116 VERB tokens (100% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (20722; 98%), Voice=Act (19895; 94%), Number=Sing (19679; 93%), Aspect=Perf (11132; 53%), Mood=EMPTY (11132; 53%), VerbForm=EMPTY (11132; 53%).

VERB tokens may have the following values of Gender:

Paradigm قَالMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=1|VerbForm=Fin|Voice=Actأقول
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيقولتقول
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Passيقال
Aspect=Imp|Mood=Ind|Number=Plur|Person=1|VerbForm=Fin|Voice=Actنقول
Aspect=Imp|Mood=Ind|Number=Plur|Person=3|VerbForm=Fin|Voice=Actيقولون
Aspect=Imp|Mood=Sub|Number=Sing|Person=1|VerbForm=Fin|Voice=Actأقول
Aspect=Imp|Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيقول
Aspect=Perf|Number=Sing|Person=1|Voice=Actقلت
Aspect=Perf|Number=Sing|Person=3|Voice=Actقالقالت
Aspect=Perf|Number=Sing|Person=3|Voice=Passقيل
Aspect=Perf|Number=Dual|Person=3|Voice=Actقالا
Aspect=Perf|Number=Plur|Person=3|Voice=Actقالوا

PRON

10877 PRON tokens (100% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (10877; 100%), Person=3 (10131; 93%), Number=Sing (9002; 83%), Case=Gen (7425; 68%).

PRON tokens may have the following values of Gender:

Paradigm هُوَMascFem
Case=Acc|Number=Sing|Person=1ني
Case=Acc|Number=Sing|Person=2ك
Case=Acc|Number=Sing|Person=3هها
Case=Acc|Number=Dual|Person=3هماهما
Case=Acc|Number=Plur|Person=1نا
Case=Acc|Number=Plur|Person=2كم
Case=Acc|Number=Plur|Person=3همهن
Case=Gen|Number=Sing|Person=1ي, ني
Case=Gen|Number=Sing|Person=2كك
Case=Gen|Number=Sing|Person=3ه, إدانته, استعداداته, انتشاره, بلاده, تجهيزه, حكومته, زنزانته, طائرته, لاراضيه, مستقبله, والده, وغربهها, أعضائها, أهدافها, إليها, بضمانها, بفقدانها, بهويتها, تجارتها, تجميدها, تخصيصها, مستشفياتها, مواجهتها, نهايتها
Case=Gen|Number=Dual|Person=2كما
Case=Gen|Number=Dual|Person=3هماهما
Case=Gen|Number=Plur|Person=1نا, لمساعدتنا
Case=Gen|Number=Plur|Person=2كم
Case=Gen|Number=Plur|Person=3هم, استبعادهم, بأنفسهم, بلادهم, بهم, شفائهم, لهمهن
Case=Nom|Number=Sing|Person=1أنا, انا
Case=Nom|Number=Sing|Person=2أنت
Case=Nom|Number=Sing|Person=3هوهي, هى, وهي
Case=Nom|Number=Dual|Person=3هماهما
Case=Nom|Number=Plur|Person=1نحن
Case=Nom|Number=Plur|Person=2انتم, أنتم
Case=Nom|Number=Plur|Person=3همهن

DET

4668 DET tokens (79% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4383; 94%), Case=Gen (3107; 67%), PronType=Rel (2532; 54%).

DET tokens may have the following values of Gender:

Paradigm اَلَّذِيMascFem
Case=Acc|Number=Singالذي, الذىالتي, التى
Case=Acc|Number=Dualاللذيناللتين
Case=Acc|Number=Plurالذين
Case=Gen|Number=Singالذي, الذىالتي, التى
Case=Gen|Number=Dualاللذيناللتين
Case=Gen|Number=Plurالذيناللواتي, اللاتى, اللاتي
Case=Nom|Number=Singالذي, الذىالتي, التى
Case=Nom|Number=Dualاللذاناللتان
Case=Nom|Number=Plurالذين

NUM

702 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (702; 100%), Number=EMPTY (702; 100%), Case=Gen (411; 59%), Definite=Cons (401; 57%).

NUM tokens may have the following values of Gender:

Paradigm ثَلَاثَةMascFem
Case=Acc|Definite=Consثلاثةثلاث
Case=Acc|Definite=Defالثلاثة, الثلاثـــــةالثلاث
Case=Acc|Definite=Indثلاثةثلاثا
Case=Gen|Definite=Comالثلاثة
Case=Gen|Definite=Consثلاثةثلاث
Case=Gen|Definite=Defالثلاثةالثلاث
Case=Gen|Definite=Indثلاثةثلاث
Case=Nom|Definite=Consثلاثةثلاث
Case=Nom|Definite=Defالثلاثةالثلاث
Case=Nom|Definite=Indثلاثةثلاث

AUX

685 AUX tokens (100% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=3 (670; 98%), Number=Sing (660; 96%), Voice=Act (628; 92%), Aspect=Perf (391; 57%), Mood=EMPTY (391; 57%), VerbForm=EMPTY (391; 57%).

AUX tokens may have the following values of Gender:

Paradigm كَانMascFem
Aspect=Imp|Mood=Ind|Number=Sing|Person=1|VerbForm=Fin|Voice=Actأكون
Aspect=Imp|Mood=Ind|Number=Sing|Person=2|VerbForm=Fin|Voice=Actتكون
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون
Aspect=Imp|Mood=Ind|Number=Plur|Person=2|VerbForm=Fin|Voice=Actتكونون
Aspect=Imp|Mood=Jus|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكنتكن
Aspect=Imp|Mood=Sub|Number=Sing|Person=1|VerbForm=Fin|Voice=Actاكون
Aspect=Imp|Mood=Sub|Number=Sing|Person=3|VerbForm=Fin|Voice=Actيكونتكون
Aspect=Imp|Mood=Sub|Number=Dual|Person=3|VerbForm=Fin|Voice=Actيكونا
Aspect=Imp|Mood=Sub|Number=Plur|Person=1|VerbForm=Fin|Voice=Actنكون
Aspect=Perf|Number=Sing|Person=1|Voice=Actكنت
Aspect=Perf|Number=Sing|Person=3|Voice=Actكانكانت
Aspect=Perf|Number=Plur|Person=3|Voice=Actكانوا
Mood=Imp|Number=Sing|VerbForm=Finكن

NOUN

27 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Case=Gen (22; 81%), Number=Plur (21; 78%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (21) occur only with one value of Gender.

PROPN

3 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[conj]–> VERB (1929; 75%), VERB –[nsubj]–> DET (1566; 74%), VERB –[ccomp]–> VERB (1503; 58%), ADJ –[conj]–> ADJ (891; 99%), VERB –[advcl]–> VERB (761; 67%), VERB –[obj]–> PRON (723; 55%), VERB –[xcomp]–> VERB (628; 98%), VERB –[nsubj]–> PRON (626; 99%), VERB –[xcomp]–> ADJ (505; 95%), VERB –[obl]–> ADJ (430; 59%).