home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: Features: Definite

This feature is universal but the values Cons are language-specific. It occurs with 4 different values: Com, Cons, Def, Ind.

125257 tokens (44%) have a non-empty value of Definite. 15283 types (61%) occur at least once with a non-empty value of Definite. 6545 lemmas (43%) occur at least once with a non-empty value of Definite. The feature is used with 4 part-of-speech tags: NOUN (93680; 33% instances), ADJ (29346; 10% instances), NUM (2207; 1% instances), PROPN (24; 0% instances).

NOUN

93680 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (72077; 77%), Case=Gen (66786; 71%).

NOUN tokens may have the following values of Definite:

Paradigm يَومIndDefCons
Case=Acc|Number=Singيوما, يوماًاليوميوم
Case=Acc|Number=Dualيومينيومي, يومى
Case=Acc|Number=Plurاياماالأيام, الايامايام, أيام
Case=Gen|Number=Singيوماليوميوم
Case=Gen|Number=Dualيوميناليومين
Case=Gen|Number=Plurأيام, ايامالايام, الأيامأيام
Case=Nom|Number=Singيوماليوميوم
Case=Nom|Number=Plurالايام, الأيامأيام

ADJ

29346 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Number=Sing (27609; 94%), Case=Gen (19119; 65%), Gender=Masc (15118; 52%).

ADJ tokens may have the following values of Definite:

Paradigm أَوَّلIndDefComCons
Case=Acc|Gender=Masc|Number=Singأول, أولاً, اول, اولا, أولا, اولاًالاول, الأولأول, اول
Case=Acc|Gender=Masc|Number=Plurالأولى
Case=Acc|Gender=Fem|Number=Singالاولى, الأولىأولى, اولى
Case=Gen|Gender=Masc|Number=Singأول, اولالاول, الأولأول, اول
Case=Gen|Gender=Masc|Number=Plurالأول, الاوائلالأولىأوائل, اوائل
Case=Gen|Gender=Fem|Number=Singأولى, اولىالاولى, الأولىأولى
Case=Gen|Gender=Fem|Number=Dualالاوليين
Case=Gen|Gender=Fem|Number=Plurالاوليات
Case=Nom|Gender=Masc|Number=Singاولالأول, الاولاول, أول
Case=Nom|Gender=Fem|Number=Singاولى, أولىالاولى, الأولىأولى, اولى

NUM

2207 NUM tokens (28% of all NUM tokens) have a non-empty value of Definite.

The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (2207; 100%), Case=Gen (1234; 56%), Number=Sing (1154; 52%).

NUM tokens may have the following values of Definite:

Paradigm ثَلَاثَةIndDefComCons
Case=Acc|Gender=Mascثلاثةالثلاثة, الثلاثـــــةثلاثة
Case=Acc|Gender=Femثلاثاالثلاثثلاث
Case=Gen|Gender=Mascثلاثةالثلاثةالثلاثةثلاثة
Case=Gen|Gender=Femثلاثالثلاثثلاث
Case=Nom|Gender=Mascثلاثةالثلاثةثلاثة
Case=Nom|Gender=Femثلاثالثلاثثلاث

PROPN

24 PROPN tokens (10% of all PROPN tokens) have a non-empty value of Definite.

PROPN tokens may have the following values of Definite:

Definite seems to be lexical feature of PROPN. 100% lemmas (19) occur only with one value of Definite.

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[amod]–> ADJ (19201; 83%), NOUN –[conj]–> NOUN (4041; 70%), ADJ –[conj]–> ADJ (881; 98%), NOUN –[appos]–> NOUN (311; 81%), ADJ –[amod]–> ADJ (236; 82%), NOUN –[conj]–> ADJ (135; 67%), ADJ –[conj]–> NOUN (98; 54%), NOUN –[orphan]–> NOUN (67; 72%), NOUN –[nsubj]–> ADJ (56; 82%), ADJ –[obj]–> ADJ (20; 100%).