home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PUD: Features: Number

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing.

12295 tokens (59%) have a non-empty value of Number. 6332 types (93%) occur at least once with a non-empty value of Number. 4380 lemmas (92%) occur at least once with a non-empty value of Number. The feature is used with 7 part-of-speech tags: NOUN (5450; 26% instances), ADJ (1943; 9% instances), VERB (1746; 8% instances), PROPN (1658; 8% instances), PRON (1226; 6% instances), AUX (184; 1% instances), NUM (88; 0% instances).

NOUN

5450 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=Def (4216; 77%), Case=Gen (3779; 69%), Gender=Masc (3635; 67%).

NOUN tokens may have the following values of Number:

Paradigm مِنطَقَةSingDualPlur
Case=Acc|Definite=Defمنطقة, المنطقةالمناطق
Case=Gen|Definite=Defالمنطقة, منطقة, لمنطقة, منطقتالمناطق, مناطق
Case=Gen|Definite=Indمنطقة
Case=Nom|Definite=Defالمنطقة, منطقةمنطقتا

ADJ

1943 ADJ tokens (96% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Definite=Def (1216; 63%), Case=Gen (1184; 61%), Gender=Fem (1017; 52%).

ADJ tokens may have the following values of Number:

Paradigm أُستُرالِيّSingDualPlur
Case=Gen|Definite=Def|Gender=Femالأسترالية
Case=Nom|Definite=Def|Gender=Mascالأستراليون
Case=Nom|Definite=Def|Gender=Femالأسترالية
Case=Nom|Definite=Ind|Gender=Mascأستراليانأستراليون

Number seems to be lexical feature of ADJ. 96% lemmas (792) occur only with one value of Number.

VERB

1746 VERB tokens (100% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Person=3 (1657; 95%), Voice=Act (1560; 89%), Gender=Masc (1039; 60%), Aspect=Imp (903; 52%), Tense=Past (883; 51%).

VERB tokens may have the following values of Number:

Paradigm كَانSingDualPlur
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Futيكون
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Presيكون, يكن
Aspect=Imp|Gender=Masc|Mood=Jus|Tense=Pastيكن
Aspect=Imp|Gender=Masc|Mood=Sub|Tense=Futيكون
Aspect=Imp|Gender=Masc|Mood=Sub|Tense=Presيكون
Aspect=Imp|Gender=Fem|Mood=Ind|Tense=Presتكن
Aspect=Perf|Gender=Masc|Tense=Pastكانكانوا
Aspect=Perf|Gender=Fem|Tense=Pastكانتكانتا

Number seems to be lexical feature of VERB. 90% lemmas (568) occur only with one value of Number.

PROPN

1658 PROPN tokens (96% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Definite=EMPTY (1335; 81%), Case=EMPTY (1170; 71%), Gender=Masc (837; 50%).

PROPN tokens may have the following values of Number:

Paradigm وِلايَةSingPlur
Case=Accالولايات
Case=Genولايةالولايات, لولايات
Case=Nomالولايات

Number seems to be lexical feature of PROPN. 99% lemmas (1035) occur only with one value of Number.

PRON

1226 PRON tokens (94% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Case=Gen (820; 67%), Person=3 (756; 62%), Gender=Masc (681; 56%).

PRON tokens may have the following values of Number:

Paradigm هُوَSingDualPlur
Case=Acc|Gender=Masc|Person=2ك
Case=Acc|Gender=Masc|Person=3ههم
Case=Acc|Gender=Fem|Person=3ها
Case=Acc|Person=1نينا
Case=Acc|Person=2ك
Case=Acc|Person=3هما
Case=Gen|Gender=Masc|Person=2ك
Case=Gen|Gender=Masc|Person=3ههم
Case=Gen|Gender=Fem|Person=3هاهن, هم
Case=Gen|Person=1ي, نانا
Case=Gen|Person=2كهماكم
Case=Gen|Person=3هماهم
Case=Nom|Gender=Masc|Person=3هوهم
Case=Nom|Gender=Fem|Person=3هي
Case=Nom|Person=1أنانحن
Gender=Masc|Person=3هو, ه
Gender=Fem|Person=3هي
Person=3هما

AUX

184 AUX tokens (99% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Voice=Act (184; 100%), Person=3 (178; 97%), Tense=Past (161; 88%), Mood=EMPTY (152; 83%), Aspect=Perf (151; 82%), Gender=Masc (114; 62%).

AUX tokens may have the following values of Number:

Paradigm كَانSingPlur
Aspect=Imp|Gender=Masc|Mood=Ind|Person=3|Tense=Futيكون
Aspect=Imp|Gender=Masc|Mood=Ind|Person=3|Tense=Presيكون
Aspect=Imp|Gender=Masc|Mood=Jus|Person=3|Tense=Pastيكن
Aspect=Imp|Gender=Masc|Mood=Sub|Person=3|Tense=Presيكون
Aspect=Imp|Gender=Fem|Mood=Ind|Person=3|Tense=Presتكون
Aspect=Imp|Gender=Fem|Mood=Jus|Person=3|Tense=Pastتكن
Aspect=Imp|Gender=Fem|Mood=Sub|Person=3|Tense=Presتكون
Aspect=Imp|Mood=Jus|Person=3|Tense=Pastأكن
Aspect=Perf|Gender=Masc|Person=2|Tense=Pastكنت
Aspect=Perf|Gender=Masc|Person=3|Tense=Pastكانكانوا
Aspect=Perf|Gender=Fem|Person=3|Tense=Pastكانت
Aspect=Perf|Person=1|Tense=Pastكنتكنا

NUM

88 NUM tokens (24% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: Case=Gen (51; 58%), Gender=Masc (47; 53%).

NUM tokens may have the following values of Number:

Paradigm عشرSingPlur
Case=Accعشر
Case=Genعشرعشر

Number seems to be lexical feature of NUM. 94% lemmas (30) occur only with one value of Number.

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[nmod]–> NOUN (1177; 62%), NOUN –[amod]–> ADJ (914; 67%), VERB –[obl]–> NOUN (818; 75%), VERB –[nsubj]–> NOUN (442; 70%), NOUN –[nmod]–> PROPN (366; 70%), VERB –[obj]–> NOUN (278; 65%), NOUN –[nmod]–> PRON (276; 66%), PROPN –[flat]–> PROPN (232; 96%), PROPN –[amod]–> ADJ (206; 85%), NOUN –[conj]–> NOUN (200; 76%).