home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: Features: Number

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing.

477701 tokens (65%) have a non-empty value of Number. 1 types (0) occur at least once with a non-empty value of Number. 4839 lemmas (96%) occur at least once with a non-empty value of Number. The feature is used with 16 part-of-speech tags: NOUN (217040; 29% instances), ADJ (67102; 9% instances), VERB (54927; 7% instances), PROPN (54782; 7% instances), PRON (31064; 4% instances), ADV (24659; 3% instances), SCONJ (11439; 2% instances), DET (6040; 1% instances), AUX (4442; 1% instances), NUM (3454; 0% instances), ADP (926; 0% instances), PUNCT (712; 0% instances), CCONJ (562; 0% instances), X (474; 0% instances), PART (75; 0% instances), INTJ (3; 0% instances).

NOUN

217040 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (150830; 69%), Case=Gen (142071; 65%).

NOUN tokens may have the following values of Number:

Number seems to be lexical feature of NOUN. 93% lemmas (39) occur only with one value of Number.

ADJ

67102 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Definite=Def (45521; 68%), Case=Gen (40502; 60%), Gender=Masc (35347; 53%).

ADJ tokens may have the following values of Number:

VERB

54927 VERB tokens (99% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Person=3 (51358; 94%), Voice=Act (50838; 93%), Mood=Ind (49568; 90%), Gender=Masc (36749; 67%), Aspect=Perf (28875; 53%).

VERB tokens may have the following values of Number:

PROPN

54782 PROPN tokens (94% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (51610; 94%), Case=EMPTY (43287; 79%), Definite=Ind (40714; 74%).

PROPN tokens may have the following values of Number:

Number seems to be lexical feature of PROPN. 100% lemmas (4758) occur only with one value of Number.

PRON

31064 PRON tokens (99% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (30458; 98%), Definite=Def (28709; 92%), Person=3 (27619; 89%), Gender=Masc (20076; 65%), Case=Gen (16343; 53%).

PRON tokens may have the following values of Number:

ADV

24659 ADV tokens (93% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: Gender=Masc (23668; 96%), Case=Acc (18316; 74%), Definite=Com (15629; 63%).

ADV tokens may have the following values of Number:

SCONJ

11439 SCONJ tokens (44% of all SCONJ tokens) have a non-empty value of Number.

The most frequent other feature values with which SCONJ and Number co-occurred: Definite=Ind (10387; 91%), Gender=Masc (6788; 59%).

SCONJ tokens may have the following values of Number:

Number seems to be lexical feature of SCONJ. 92% lemmas (12) occur only with one value of Number.

DET

6040 DET tokens (95% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=Ind (6005; 99%), Gender=Masc (3808; 63%).

DET tokens may have the following values of Number:

AUX

4442 AUX tokens (58% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Person=3 (4054; 91%), Voice=Act (4004; 90%), Mood=Ind (3284; 74%), Gender=Masc (3012; 68%).

AUX tokens may have the following values of Number:

Number seems to be lexical feature of AUX. 91% lemmas (10) occur only with one value of Number.

NUM

3454 NUM tokens (23% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (3330; 96%), Definite=Com (2317; 67%), Gender=Masc (2099; 61%), Case=Gen (2039; 59%).

NUM tokens may have the following values of Number:

ADP

926 ADP tokens (1% of all ADP tokens) have a non-empty value of Number.

The most frequent other feature values with which ADP and Number co-occurred: AdpType=Prep (926; 100%).

ADP tokens may have the following values of Number:

Number seems to be lexical feature of ADP. 91% lemmas (29) occur only with one value of Number.

PUNCT

712 PUNCT tokens (1% of all PUNCT tokens) have a non-empty value of Number.

PUNCT tokens may have the following values of Number:

CCONJ

562 CCONJ tokens (1% of all CCONJ tokens) have a non-empty value of Number.

CCONJ tokens may have the following values of Number:

Paradigm wSingDualPlur
Case=Acc|Definite=Com_
Definite=Com_
Mood=Ind|Person=3|Voice=Act__

X

474 X tokens (52% of all X tokens) have a non-empty value of Number.

The most frequent other feature values with which X and Number co-occurred: Gender=Masc (400; 84%), Mood=EMPTY (284; 60%), Voice=EMPTY (275; 58%), Person=EMPTY (274; 58%).

X tokens may have the following values of Number:

Paradigm NoneSingDualPlur
Case=Acc|Definite=Com|Gender=Masc___
Case=Acc|Definite=Def|Gender=Masc__
Case=Acc|Definite=Ind|Gender=Masc__
Case=Acc|Definite=Ind|Gender=Fem_
Case=Gen|Definite=Com|Gender=Masc_
Case=Nom|Definite=Def|Gender=Masc_
Case=Nom|Definite=Ind|Gender=Masc_
Definite=Com|Gender=Masc_
Definite=Def|Gender=Masc_
Definite=Def|Gender=Fem_
Definite=Ind|Gender=Masc_
Definite=Ind|Gender=Fem_
Gender=Masc|Mood=Ind|Person=1|Voice=Act__
Gender=Masc|Mood=Ind|Person=2|Voice=Act_
Gender=Masc|Mood=Ind|Person=3|Voice=Act___
Gender=Masc|Mood=Ind|Person=3|Voice=Pass_
Gender=Masc|Mood=Jus|Person=1|Voice=Act_
Gender=Masc|Mood=Jus|Person=3|Voice=Act_
Gender=Masc|Mood=Sub|Person=1|Voice=Act__
Gender=Masc|Mood=Sub|Person=2|Voice=Act_
Gender=Masc|Mood=Sub|Person=3|Voice=Act_
Gender=Masc|Person=3|Voice=Act_
Gender=Fem|Mood=Ind|Person=3|Voice=Act__
Gender=Fem|Person=2|Voice=Act_
Gender=Fem|Person=3|Voice=Act_

PART

75 PART tokens (1% of all PART tokens) have a non-empty value of Number.

The most frequent other feature values with which PART and Number co-occurred: Polarity=EMPTY (75; 100%).

PART tokens may have the following values of Number:

INTJ

3 INTJ tokens (5% of all INTJ tokens) have a non-empty value of Number.

INTJ tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (46096; 84%), NOUN –[nmod:poss]–> NOUN (44964; 82%), NOUN –[nmod]–> NOUN (33906; 84%), VERB –[nmod]–> NOUN (28638; 84%), VERB –[nsubj]–> NOUN (15740; 86%), VERB –[obj]–> NOUN (14122; 78%), PROPN –[flat:name]–> PROPN (13320; 95%), NOUN –[conj]–> NOUN (11834; 83%), NOUN –[nmod:poss]–> PRON (11112; 74%), VERB –[advmod]–> ADV (9948; 80%).