Treebank Statistics: UD_Arabic-NYUAD: Features: Definite
This feature is universal.
It occurs with 3 different values: Com, Def, Ind.
417286 tokens (56%) have a non-empty value of Definite.
1 types (0) occur at least once with a non-empty value of Definite.
4543 lemmas (90%) occur at least once with a non-empty value of Definite.
The feature is used with 16 part-of-speech tags: NOUN (221551; 30% instances), ADJ (69167; 9% instances), PROPN (53581; 7% instances), PRON (43051; 6% instances), ADV (19507; 3% instances), DET (6060; 1% instances), NUM (3524; 0% instances), X (281; 0% instances), ADP (151; 0% instances), PUNCT (147; 0% instances), VERB (136; 0% instances), CCONJ (79; 0% instances), PART (17; 0% instances), SCONJ (16; 0% instances), AUX (15; 0% instances), INTJ (3; 0% instances).
NOUN
221551 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (197241; 89%), Gender=Masc (154565; 70%), Case=Gen (142652; 64%).
NOUN tokens may have the following values of Definite:
Com(83991; 38% of non-emptyDefinite): _Def(96157; 43% of non-emptyDefinite): _Ind(41403; 19% of non-emptyDefinite): _EMPTY(348): _
ADJ
69167 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Number=Sing (66115; 96%), Case=Gen (40733; 59%), Gender=Masc (36996; 53%).
ADJ tokens may have the following values of Definite:
Com(2397; 3% of non-emptyDefinite): _Def(45841; 66% of non-emptyDefinite): _Ind(20929; 30% of non-emptyDefinite): _EMPTY(188): _
PROPN
53581 PROPN tokens (93% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (53128; 99%), Gender=Masc (50581; 94%), Case=EMPTY (42821; 80%).
PROPN tokens may have the following values of Definite:
Com(3307; 6% of non-emptyDefinite): _Def(9949; 19% of non-emptyDefinite): _Ind(40325; 75% of non-emptyDefinite): _EMPTY(3840): _
Definite seems to be lexical feature of PROPN. 99% lemmas (4476) occur only with one value of Definite.
PRON
43051 PRON tokens (99% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Number=Sing (36362; 84%), PronType=Prs (30458; 71%), Person=3 (29793; 69%), Gender=Masc (27279; 63%).
PRON tokens may have the following values of Definite:
Com(94; 0% of non-emptyDefinite): _Def(30207; 70% of non-emptyDefinite): _Ind(12750; 30% of non-emptyDefinite): _EMPTY(444): _
Definite seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Definite.
ADV
19507 ADV tokens (81% of all ADV tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADV and Definite co-occurred: Number=Sing (19507; 100%), Polarity=EMPTY (19507; 100%), Gender=Masc (19486; 100%), Case=Acc (13032; 67%).
ADV tokens may have the following values of Definite:
Com(15109; 77% of non-emptyDefinite): _Def(9; 0% of non-emptyDefinite): _Ind(4389; 22% of non-emptyDefinite): _EMPTY(4560): _
DET
6060 DET tokens (95% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Number=Sing (5876; 97%), Gender=Masc (3818; 63%).
DET tokens may have the following values of Definite:
Com(3; 0% of non-emptyDefinite): _Def(2; 0% of non-emptyDefinite): _Ind(6055; 100% of non-emptyDefinite): _EMPTY(303): _
NUM
3524 NUM tokens (23% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (3328; 94%), Number=Sing (3123; 89%), Gender=Masc (2155; 61%), Case=Gen (2114; 60%).
NUM tokens may have the following values of Definite:
Com(2440; 69% of non-emptyDefinite): _Def(341; 10% of non-emptyDefinite): _Ind(743; 21% of non-emptyDefinite): _EMPTY(11853): _
X
281 X tokens (30% of all X tokens) have a non-empty value of Definite.
The most frequent other feature values with which X and Definite co-occurred: Mood=EMPTY (281; 100%), Person=EMPTY (281; 100%), Voice=EMPTY (281; 100%), Number=Sing (248; 88%), Gender=Masc (217; 77%).
X tokens may have the following values of Definite:
Com(18; 6% of non-emptyDefinite): _Def(56; 20% of non-emptyDefinite): _Ind(207; 74% of non-emptyDefinite): _EMPTY(646): _
Definite seems to be lexical feature of X. 92% lemmas (23) occur only with one value of Definite.
ADP
151 ADP tokens (0% of all ADP tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADP and Definite co-occurred: AdpType=Prep (141; 93%).
ADP tokens may have the following values of Definite:
Com(6; 4% of non-emptyDefinite): _Def(93; 62% of non-emptyDefinite): _Ind(52; 34% of non-emptyDefinite): _EMPTY(91592): _
PUNCT
147 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Definite.
PUNCT tokens may have the following values of Definite:
Com(15; 10% of non-emptyDefinite): _Def(88; 60% of non-emptyDefinite): _Ind(44; 30% of non-emptyDefinite): _EMPTY(75119): _
VERB
136 VERB tokens (0% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Aspect=EMPTY (136; 100%), Mood=EMPTY (136; 100%), Voice=EMPTY (136; 100%), Number=Sing (131; 96%), Person=EMPTY (127; 93%), Gender=Masc (118; 87%).
VERB tokens may have the following values of Definite:
Com(31; 23% of non-emptyDefinite): _Def(19; 14% of non-emptyDefinite): _Ind(86; 63% of non-emptyDefinite): _EMPTY(55333): _
CCONJ
79 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Definite.
CCONJ tokens may have the following values of Definite:
Com(12; 15% of non-emptyDefinite): _Def(46; 58% of non-emptyDefinite): _Ind(21; 27% of non-emptyDefinite): _EMPTY(49082): _
PART
17 PART tokens (1% of all PART tokens) have a non-empty value of Definite.
PART tokens may have the following values of Definite:
Com(2; 12% of non-emptyDefinite): _Def(6; 35% of non-emptyDefinite): _Ind(9; 53% of non-emptyDefinite): _EMPTY(2504): _
SCONJ
16 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Definite.
SCONJ tokens may have the following values of Definite:
Com(2; 13% of non-emptyDefinite): _Def(6; 38% of non-emptyDefinite): _Ind(8; 50% of non-emptyDefinite): _EMPTY(16598): _
AUX
15 AUX tokens (0% of all AUX tokens) have a non-empty value of Definite.
The most frequent other feature values with which AUX and Definite co-occurred: Mood=EMPTY (15; 100%), Voice=EMPTY (15; 100%), Number=Sing (14; 93%), Person=EMPTY (13; 87%), Gender=Masc (10; 67%).
AUX tokens may have the following values of Definite:
Def(7; 47% of non-emptyDefinite): _Ind(8; 53% of non-emptyDefinite): _EMPTY(9140): _
INTJ
3 INTJ tokens (5% of all INTJ tokens) have a non-empty value of Definite.
INTJ tokens may have the following values of Definite:
Ind(3; 100% of non-emptyDefinite): _EMPTY(53): _
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[amod]–> ADJ (46661; 85%),
PROPN –[flat]–> PROPN (11043; 77%),
ADJ –[obj]–> ADJ (2342; 95%),
PROPN –[obj]–> PROPN (1936; 71%),
ADJ –[amod]–> ADJ (1109; 85%),
PROPN –[nmod]–> PRON (401; 69%),
NOUN –[obj]–> ADJ (397; 52%),
ADV –[obj]–> ADV (342; 91%),
ADV –[nmod]–> PROPN (323; 74%),
ADJ –[nsubj]–> PRON (229; 53%).