home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PUD: Features: Definite

This feature is universal. It occurs with 2 different values: Def, Ind.

7881 tokens (38%) have a non-empty value of Definite. 4441 types (65%) occur at least once with a non-empty value of Definite. 2916 lemmas (61%) occur at least once with a non-empty value of Definite. The feature is used with 5 part-of-speech tags: NOUN (5529; 27% instances), ADJ (2019; 10% instances), PROPN (323; 2% instances), VERB (8; 0% instances), AUX (2; 0% instances).

NOUN

5529 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (3941; 71%), Case=Gen (3832; 69%), Gender=Masc (3651; 66%).

NOUN tokens may have the following values of Definite:

Paradigm عَامIndDef
Case=Acc|Number=Singعاماً, عام
Case=Gen|Number=Singعامعام, العام
Case=Gen|Number=Dualعامينعامي, العامين
Case=Nom|Number=Singالعام, عام

ADJ

2019 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Number=Sing (1861; 92%), Case=Gen (1230; 61%), Gender=Fem (1017; 50%).

ADJ tokens may have the following values of Definite:

Paradigm أَوَّلIndDef
Case=Acc|Gender=Masc|Number=Singأولالأول
Case=Acc|Gender=Fem|Number=Singأولىالأولى
Case=Genأول
Case=Gen|Gender=Masc|Number=Singأولالأول
Case=Gen|Gender=Masc|Number=Plurأوائل
Case=Gen|Gender=Fem|Number=Singأولىالأولى
Case=Nom|Gender=Masc|Number=Singالأول, أول
Case=Nom|Gender=Fem|Number=Singأولى, الأولى
Case=Nom|Gender=Fem|Number=Plurأولى

PROPN

323 PROPN tokens (19% of all PROPN tokens) have a non-empty value of Definite.

The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (271; 84%), Case=Gen (266; 82%), Gender=Masc (167; 52%).

PROPN tokens may have the following values of Definite:

Paradigm جَزِيرَةIndDef
Case=Gen|Number=Singجزيرةالجزيرة, جزيرة
Case=Gen|Number=Plurالجزر, جزر
Case=Nom|Number=Plurجزر

Definite seems to be lexical feature of PROPN. 98% lemmas (153) occur only with one value of Definite.

VERB

8 VERB tokens (0% of all VERB tokens) have a non-empty value of Definite.

The most frequent other feature values with which VERB and Definite co-occurred: Aspect=EMPTY (8; 100%), Mood=EMPTY (8; 100%), Person=EMPTY (8; 100%), Tense=EMPTY (8; 100%), Voice=EMPTY (7; 88%), Gender=Masc (6; 75%).

VERB tokens may have the following values of Definite:

AUX

2 AUX tokens (1% of all AUX tokens) have a non-empty value of Definite.

The most frequent other feature values with which AUX and Definite co-occurred: Aspect=EMPTY (2; 100%), Gender=Masc (2; 100%), Mood=EMPTY (2; 100%), Number=EMPTY (2; 100%), Person=EMPTY (2; 100%), Tense=EMPTY (2; 100%), Voice=EMPTY (2; 100%).

AUX tokens may have the following values of Definite:

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[nmod]–> NOUN (1575; 83%), NOUN –[amod]–> ADJ (1357; 99%), NOUN –[conj]–> NOUN (240; 91%), ADJ –[obl]–> NOUN (160; 58%), PROPN –[amod]–> ADJ (154; 64%), NOUN –[ccomp]–> ADJ (88; 92%), ADJ –[conj]–> ADJ (33; 100%), NOUN –[appos]–> NOUN (27; 82%), NOUN –[acl]–> NOUN (16; 94%), NOUN –[nmod]–> ADJ (9; 75%).