Treebank Statistics: UD_Arabic-NYUAD: Features: Number
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
477701 tokens (65%) have a non-empty value of Number
.
1 types (0) occur at least once with a non-empty value of Number
.
4838 lemmas (96%) occur at least once with a non-empty value of Number
.
The feature is used with 16 part-of-speech tags: NOUN (221645; 30% instances), ADJ (69179; 9% instances), VERB (55373; 7% instances), PROPN (54272; 7% instances), PRON (43070; 6% instances), ADV (19509; 3% instances), DET (6065; 1% instances), AUX (4101; 1% instances), NUM (3526; 0% instances), X (482; 0% instances), ADP (192; 0% instances), PUNCT (154; 0% instances), CCONJ (88; 0% instances), SCONJ (25; 0% instances), PART (17; 0% instances), INTJ (3; 0% instances).
NOUN
221645 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (154639; 70%), Case=Gen (142652; 64%).
NOUN
tokens may have the following values of Number
:
Dual
(2594; 1% of non-emptyNumber
): _Plur
(21729; 10% of non-emptyNumber
): _Sing
(197322; 89% of non-emptyNumber
): _EMPTY
(254): _
ADJ
69179 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Definite=Def (45840; 66%), Case=Gen (40733; 59%), Gender=Masc (37008; 53%).
ADJ
tokens may have the following values of Number
:
Dual
(620; 1% of non-emptyNumber
): _Plur
(2436; 4% of non-emptyNumber
): _Sing
(66123; 96% of non-emptyNumber
): _EMPTY
(176): _
VERB
55373 VERB tokens (100% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Person=3 (51943; 94%), Voice=Act (51452; 93%), Mood=Ind (50158; 91%), Gender=Masc (37018; 67%), Aspect=Perf (28891; 52%).
VERB
tokens may have the following values of Number
:
Dual
(603; 1% of non-emptyNumber
): _Plur
(5038; 9% of non-emptyNumber
): _Sing
(49732; 90% of non-emptyNumber
): _EMPTY
(96): _
PROPN
54272 PROPN tokens (95% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (51122; 94%), Case=EMPTY (43512; 80%), Definite=Ind (40325; 74%).
PROPN
tokens may have the following values of Number
:
Dual
(462; 1% of non-emptyNumber
): _Plur
(229; 0% of non-emptyNumber
): _Sing
(53581; 99% of non-emptyNumber
): _EMPTY
(3149): _
Number
seems to be lexical feature of PROPN
. 100% lemmas (4785) occur only with one value of Number
.
PRON
43070 PRON tokens (99% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (30458; 71%), Definite=Def (30207; 70%), Person=3 (29809; 69%), Gender=Masc (27294; 63%).
PRON
tokens may have the following values of Number
:
Dual
(726; 2% of non-emptyNumber
): _Plur
(5966; 14% of non-emptyNumber
): _Sing
(36378; 84% of non-emptyNumber
): _EMPTY
(425): _
Number
seems to be lexical feature of PRON
. 92% lemmas (12) occur only with one value of Number
.
ADV
19509 ADV tokens (81% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Polarity=EMPTY (19509; 100%), Gender=Masc (19487; 100%), Definite=Com (15109; 77%), Case=Acc (13032; 67%).
ADV
tokens may have the following values of Number
:
Plur
(1; 0% of non-emptyNumber
): _Sing
(19508; 100% of non-emptyNumber
): _EMPTY
(4558): _
DET
6065 DET tokens (95% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=Ind (6055; 100%), Gender=Masc (3821; 63%).
DET
tokens may have the following values of Number
:
Dual
(39; 1% of non-emptyNumber
): _Plur
(145; 2% of non-emptyNumber
): _Sing
(5881; 97% of non-emptyNumber
): _EMPTY
(298): _
AUX
4101 AUX tokens (45% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Voice=Act (4076; 99%), Person=3 (3924; 96%), Mood=Ind (3347; 82%), Gender=Masc (2726; 66%).
AUX
tokens may have the following values of Number
:
Dual
(42; 1% of non-emptyNumber
): _Plur
(235; 6% of non-emptyNumber
): _Sing
(3824; 93% of non-emptyNumber
): _EMPTY
(5054): _
NUM
3526 NUM tokens (23% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (3328; 94%), Definite=Com (2440; 69%), Gender=Masc (2157; 61%), Case=Gen (2114; 60%).
NUM
tokens may have the following values of Number
:
Dual
(111; 3% of non-emptyNumber
): _Plur
(290; 8% of non-emptyNumber
): _Sing
(3125; 89% of non-emptyNumber
): _EMPTY
(11851): _
X
482 X tokens (52% of all X
tokens) have a non-empty value of Number
.
The most frequent other feature values with which X
and Number
co-occurred: Gender=Masc (409; 85%), Mood=EMPTY (285; 59%), Person=EMPTY (277; 57%), Voice=EMPTY (277; 57%).
X
tokens may have the following values of Number
:
Dual
(32; 7% of non-emptyNumber
): _Plur
(26; 5% of non-emptyNumber
): _Sing
(424; 88% of non-emptyNumber
): _EMPTY
(445): _
Number
seems to be lexical feature of X
. 93% lemmas (25) occur only with one value of Number
.
ADP
192 ADP tokens (0% of all ADP
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADP
and Number
co-occurred: AdpType=Prep (176; 92%).
ADP
tokens may have the following values of Number
:
Dual
(3; 2% of non-emptyNumber
): _Plur
(29; 15% of non-emptyNumber
): _Sing
(160; 83% of non-emptyNumber
): _EMPTY
(91551): _
PUNCT
154 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Number
.
PUNCT
tokens may have the following values of Number
:
Dual
(1; 1% of non-emptyNumber
): _Plur
(13; 8% of non-emptyNumber
): _Sing
(140; 91% of non-emptyNumber
): _EMPTY
(75112): _
CCONJ
88 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Number
.
CCONJ
tokens may have the following values of Number
:
Dual
(5; 6% of non-emptyNumber
): _Plur
(14; 16% of non-emptyNumber
): _Sing
(69; 78% of non-emptyNumber
): _EMPTY
(49073): _
Paradigm w | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Com | _ | ||
Definite=Com | _ | ||
Mood=Ind|Person=3|Voice=Act | _ | _ |
SCONJ
25 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Number
.
SCONJ
tokens may have the following values of Number
:
Plur
(4; 16% of non-emptyNumber
): _Sing
(21; 84% of non-emptyNumber
): _EMPTY
(16589): _
PART
17 PART tokens (1% of all PART
tokens) have a non-empty value of Number
.
PART
tokens may have the following values of Number
:
Plur
(2; 12% of non-emptyNumber
): _Sing
(15; 88% of non-emptyNumber
): _EMPTY
(2504): _
INTJ
3 INTJ tokens (5% of all INTJ
tokens) have a non-empty value of Number
.
INTJ
tokens may have the following values of Number
:
Sing
(3; 100% of non-emptyNumber
): _EMPTY
(53): _
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (46087; 84%),
NOUN –[nmod:poss]–> NOUN (45604; 83%),
NOUN –[obj]–> NOUN (37305; 83%),
VERB –[obj]–> NOUN (26365; 80%),
VERB –[nsubj]–> NOUN (16145; 86%),
PROPN –[flat]–> PROPN (13747; 96%),
NOUN –[nmod:poss]–> PRON (11788; 75%),
VERB –[iobj]–> NOUN (11534; 83%),
NOUN –[nmod]–> NOUN (10483; 91%),
ADV –[nmod:poss]–> NOUN (9070; 83%).