Treebank Statistics: UD_Arabic-NYUAD: Features: Case
This feature is universal.
It occurs with 3 different values: Acc, Gen, Nom.
334414 tokens (45%) have a non-empty value of Case.
1 types (0) occur at least once with a non-empty value of Case.
230 lemmas (5%) occur at least once with a non-empty value of Case.
The feature is used with 16 part-of-speech tags: NOUN (213749; 29% instances), ADJ (65605; 9% instances), PRON (24351; 3% instances), ADV (16050; 2% instances), PROPN (10760; 1% instances), NUM (3387; 0% instances), ADP (137; 0% instances), PUNCT (126; 0% instances), CCONJ (73; 0% instances), VERB (60; 0% instances), DET (46; 0% instances), X (38; 0% instances), AUX (12; 0% instances), SCONJ (11; 0% instances), PART (8; 0% instances), INTJ (1; 0% instances).
NOUN
213749 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (189550; 89%), Gender=Masc (147050; 69%).
NOUN tokens may have the following values of Case:
Acc(41522; 19% of non-emptyCase): _Gen(142652; 67% of non-emptyCase): _Nom(29575; 14% of non-emptyCase): _EMPTY(8150): _
ADJ
65605 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (62554; 95%), Definite=Def (43275; 66%), Gender=Masc (33608; 51%).
ADJ tokens may have the following values of Case:
Acc(13586; 21% of non-emptyCase): _Gen(40733; 62% of non-emptyCase): _Nom(11286; 17% of non-emptyCase): _EMPTY(3750): _
PRON
24351 PRON tokens (56% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: PronType=Prs (22577; 93%), Person=3 (21806; 90%), Definite=Def (21312; 88%), Number=Sing (19722; 81%), Gender=Masc (15770; 65%).
PRON tokens may have the following values of Case:
Acc(4778; 20% of non-emptyCase): _Gen(16699; 69% of non-emptyCase): _Nom(2874; 12% of non-emptyCase): _EMPTY(19144): _
ADV
16050 ADV tokens (67% of all ADV tokens) have a non-empty value of Case.
The most frequent other feature values with which ADV and Case co-occurred: Number=Sing (16050; 100%), Polarity=EMPTY (16050; 100%), Gender=Masc (16029; 100%), Definite=Com (15102; 94%).
ADV tokens may have the following values of Case:
Acc(13032; 81% of non-emptyCase): _Gen(2640; 16% of non-emptyCase): _Nom(378; 2% of non-emptyCase): _EMPTY(8017): _
PROPN
10760 PROPN tokens (19% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (10345; 96%), Gender=Masc (9152; 85%).
PROPN tokens may have the following values of Case:
Acc(2300; 21% of non-emptyCase): _Gen(6548; 61% of non-emptyCase): _Nom(1912; 18% of non-emptyCase): _EMPTY(46661): _
Case seems to be lexical feature of PROPN. 100% lemmas (214) occur only with one value of Case.
NUM
3387 NUM tokens (22% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (3198; 94%), Number=Sing (2986; 88%), Definite=Com (2440; 72%), Gender=Masc (2019; 60%).
NUM tokens may have the following values of Case:
Acc(951; 28% of non-emptyCase): _Gen(2114; 62% of non-emptyCase): _Nom(322; 10% of non-emptyCase): _EMPTY(11990): _
ADP
137 ADP tokens (0% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (128; 93%).
ADP tokens may have the following values of Case:
Acc(42; 31% of non-emptyCase): _Gen(80; 58% of non-emptyCase): _Nom(15; 11% of non-emptyCase): _EMPTY(91606): _
PUNCT
126 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Case.
PUNCT tokens may have the following values of Case:
Acc(14; 11% of non-emptyCase): _Gen(100; 79% of non-emptyCase): _Nom(12; 10% of non-emptyCase): _EMPTY(75140): _
CCONJ
73 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Case.
CCONJ tokens may have the following values of Case:
Acc(19; 26% of non-emptyCase): _Gen(48; 66% of non-emptyCase): _Nom(6; 8% of non-emptyCase): _EMPTY(49088): _
VERB
60 VERB tokens (0% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Aspect=EMPTY (60; 100%), Mood=EMPTY (60; 100%), Voice=EMPTY (60; 100%), Number=Sing (56; 93%), Person=EMPTY (51; 85%), Gender=Masc (49; 82%).
VERB tokens may have the following values of Case:
Acc(24; 40% of non-emptyCase): _Gen(21; 35% of non-emptyCase): _Nom(15; 25% of non-emptyCase): _EMPTY(55409): _
DET
46 DET tokens (1% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Definite=Ind (41; 89%), Number=Dual (39; 85%), Gender=Masc (25; 54%).
DET tokens may have the following values of Case:
Acc(33; 72% of non-emptyCase): _Gen(4; 9% of non-emptyCase): _Nom(9; 20% of non-emptyCase): _EMPTY(6317): _
X
38 X tokens (4% of all X tokens) have a non-empty value of Case.
The most frequent other feature values with which X and Case co-occurred: Mood=EMPTY (38; 100%), Person=EMPTY (38; 100%), Voice=EMPTY (38; 100%), Gender=Masc (35; 92%), Number=Dual (24; 63%).
X tokens may have the following values of Case:
Acc(25; 66% of non-emptyCase): _Gen(1; 3% of non-emptyCase): _Nom(12; 32% of non-emptyCase): _EMPTY(889): _
AUX
12 AUX tokens (0% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Mood=EMPTY (12; 100%), Voice=EMPTY (12; 100%), Number=Sing (11; 92%), Person=EMPTY (10; 83%), Gender=Masc (7; 58%).
AUX tokens may have the following values of Case:
Acc(3; 25% of non-emptyCase): _Gen(6; 50% of non-emptyCase): _Nom(3; 25% of non-emptyCase): _EMPTY(9143): _
SCONJ
11 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Case.
SCONJ tokens may have the following values of Case:
Acc(3; 27% of non-emptyCase): _Gen(4; 36% of non-emptyCase): _Nom(4; 36% of non-emptyCase): _EMPTY(16603): _
PART
8 PART tokens (0% of all PART tokens) have a non-empty value of Case.
PART tokens may have the following values of Case:
Acc(1; 13% of non-emptyCase): _Gen(6; 75% of non-emptyCase): _Nom(1; 13% of non-emptyCase): _EMPTY(2513): _
INTJ
1 INTJ tokens (2% of all INTJ tokens) have a non-empty value of Case.
INTJ tokens may have the following values of Case:
Gen(1; 100% of non-emptyCase): _EMPTY(55): _
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[amod]–> ADJ (48285; 88%),
NOUN –[nmod:poss]–> NOUN (36195; 66%),
NOUN –[obj]–> NOUN (29062; 65%),
NOUN –[nmod:poss]–> PRON (9419; 61%),
NOUN –[nmod]–> NOUN (5836; 51%),
ADJ –[obj]–> ADJ (2236; 91%),
NOUN –[iobj]–> NOUN (1515; 53%),
ADJ –[amod]–> ADJ (1020; 78%),
NOUN –[nmod:poss]–> ADJ (678; 57%),
NOUN –[nsubj]–> PRON (669; 60%).