home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic: Features: Definite

This feature is universal but the values Cons are language-specific. It occurs with 4 different values: Com, Cons, Def, Ind.

123453 tokens (44%) have a non-empty value of Definite. 14971 types (57%) occur at least once with a non-empty value of Definite. 6324 lemmas (38%) occur at least once with a non-empty value of Definite. The feature is used with 3 part-of-speech tags: NOUN (92032; 33% instances), ADJ (29216; 10% instances), NUM (2205; 1% instances).

NOUN

92032 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (70839; 77%), Case=Gen (66157; 72%).

NOUN tokens may have the following values of Definite:

Paradigm يَومIndDefCons
Case=Acc|Number=Singيوما, يوماًاليوميوم
Case=Acc|Number=Dualيومينيومي, يومى
Case=Acc|Number=Plurاياماالأيام, الايامايام, أيام
Case=Gen|Number=Singيوماليوميوم
Case=Gen|Number=Dualيوميناليومين
Case=Gen|Number=Plurأيام, ايامالايام, الأيامأيام
Case=Nom|Number=Singيوماليوميوم
Case=Nom|Number=Plurالايام, الأيامأيام

ADJ

29216 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Number=Sing (27490; 94%), Case=Gen (19099; 65%), Gender=Masc (15035; 51%).

ADJ tokens may have the following values of Definite:

Paradigm أَوَّلIndDefComCons
Case=Acc|Gender=Masc|Number=Singأول, أولاً, اول, اولا, اولاً, أولاالاول, الأولأول, اول
Case=Acc|Gender=Masc|Number=Plurالأولى
Case=Acc|Gender=Fem|Number=Singالاولى, الأولىأولى, اولى
Case=Gen|Gender=Masc|Number=Singأول, اولالاول, الأولأول, اول
Case=Gen|Gender=Masc|Number=Plurالأول, الاوائلالأولىأوائل, اوائل
Case=Gen|Gender=Fem|Number=Singأولى, اولىالاولى, الأولىأولى
Case=Gen|Gender=Fem|Number=Dualالاوليين
Case=Gen|Gender=Fem|Number=Plurالاوليات
Case=Nom|Gender=Masc|Number=Singاولالأول, الاولاول, أول
Case=Nom|Gender=Fem|Number=Singاولى, أولىالاولى, الأولىأولى, اولى

NUM

2205 NUM tokens (28% of all NUM tokens) have a non-empty value of Definite.

The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (2205; 100%), Case=Gen (1233; 56%), Number=Sing (1154; 52%).

NUM tokens may have the following values of Definite:

Paradigm ثَلَاثَةIndDefComCons
Case=Acc|Gender=Mascثلاثةالثلاثة, الثلاثـــــةثلاثة
Case=Acc|Gender=Femثلاثاالثلاثثلاث
Case=Gen|Gender=Mascثلاثةالثلاثةالثلاثةثلاثة
Case=Gen|Gender=Femثلاثالثلاثثلاث
Case=Nom|Gender=Mascثلاثةالثلاثةثلاثة
Case=Nom|Gender=Femثلاثالثلاثثلاث

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[amod]–> ADJ (19082; 84%), NOUN –[conj]–> NOUN (3731; 71%), ADJ –[conj]–> ADJ (832; 98%), NOUN –[appos]–> NOUN (313; 79%), ADJ –[amod]–> ADJ (234; 82%), NOUN –[conj]–> ADJ (122; 66%), ADJ –[conj]–> NOUN (90; 52%), NOUN –[orphan]–> NOUN (66; 75%), NOUN –[nsubj]–> ADJ (56; 84%), NOUN –[case]–> NOUN (40; 55%).