home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

477701 tokens (65%) have a non-empty value of Gender. 1 types (0) occur at least once with a non-empty value of Gender. 4839 lemmas (96%) occur at least once with a non-empty value of Gender. The feature is used with 16 part-of-speech tags: NOUN (217040; 29% instances), ADJ (67102; 9% instances), VERB (54927; 7% instances), PROPN (54782; 7% instances), PRON (31064; 4% instances), ADV (24659; 3% instances), SCONJ (11439; 2% instances), DET (6040; 1% instances), AUX (4442; 1% instances), NUM (3454; 0% instances), ADP (926; 0% instances), PUNCT (712; 0% instances), CCONJ (562; 0% instances), X (474; 0% instances), PART (75; 0% instances), INTJ (3; 0% instances).

NOUN

217040 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (192855; 89%), Case=Gen (142071; 65%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 93% lemmas (39) occur only with one value of Gender.

ADJ

67102 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (64167; 96%), Definite=Def (45521; 68%), Case=Gen (40502; 60%).

ADJ tokens may have the following values of Gender:

VERB

54927 VERB tokens (99% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (51358; 94%), Voice=Act (50838; 93%), Mood=Ind (49568; 90%), Number=Sing (49350; 90%), Aspect=Perf (28875; 53%).

VERB tokens may have the following values of Gender:

PROPN

54782 PROPN tokens (94% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (54081; 99%), Case=EMPTY (43287; 79%), Definite=Ind (40714; 74%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (4762) occur only with one value of Gender.

PRON

31064 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (30458; 98%), Definite=Def (28709; 92%), Person=3 (27619; 89%), Number=Sing (25445; 82%), Case=Gen (16343; 53%).

PRON tokens may have the following values of Gender:

ADV

24659 ADV tokens (93% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Number=Sing (24448; 99%), Case=Acc (18316; 74%), Definite=Com (15629; 63%).

ADV tokens may have the following values of Gender:

SCONJ

11439 SCONJ tokens (44% of all SCONJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which SCONJ and Gender co-occurred: Number=Sing (10521; 92%), Definite=Ind (10387; 91%).

SCONJ tokens may have the following values of Gender:

DET

6040 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=Ind (6005; 99%), Number=Sing (5854; 97%).

DET tokens may have the following values of Gender:

AUX

4442 AUX tokens (58% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=3 (4054; 91%), Number=Sing (4046; 91%), Voice=Act (4004; 90%), Mood=Ind (3284; 74%).

AUX tokens may have the following values of Gender:

Gender seems to be lexical feature of AUX. 91% lemmas (10) occur only with one value of Gender.

NUM

3454 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (3330; 96%), Number=Sing (3057; 89%), Definite=Com (2317; 67%), Case=Gen (2039; 59%).

NUM tokens may have the following values of Gender:

ADP

926 ADP tokens (1% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Prep (926; 100%).

ADP tokens may have the following values of Gender:

Gender seems to be lexical feature of ADP. 94% lemmas (30) occur only with one value of Gender.

PUNCT

712 PUNCT tokens (1% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

CCONJ

562 CCONJ tokens (1% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

Gender seems to be lexical feature of CCONJ. 97% lemmas (28) occur only with one value of Gender.

X

474 X tokens (52% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (416; 88%), Mood=EMPTY (284; 60%), Voice=EMPTY (275; 58%), Person=EMPTY (274; 58%).

X tokens may have the following values of Gender:

Paradigm NoneMascFem
Case=Acc|Definite=Com|Number=Sing_
Case=Acc|Definite=Com|Number=Dual_
Case=Acc|Definite=Com|Number=Plur_
Case=Acc|Definite=Def|Number=Sing_
Case=Acc|Definite=Def|Number=Dual_
Case=Acc|Definite=Ind|Number=Sing_
Case=Acc|Definite=Ind|Number=Dual__
Case=Gen|Definite=Com|Number=Sing_
Case=Nom|Definite=Def|Number=Dual_
Case=Nom|Definite=Ind|Number=Plur_
Definite=Com|Number=Sing_
Definite=Def|Number=Sing__
Definite=Ind|Number=Sing__
Mood=Ind|Number=Sing|Person=1|Voice=Act_
Mood=Ind|Number=Sing|Person=3|Voice=Act__
Mood=Ind|Number=Sing|Person=3|Voice=Pass_
Mood=Ind|Number=Dual|Person=3|Voice=Act_
Mood=Ind|Number=Plur|Person=1|Voice=Act_
Mood=Ind|Number=Plur|Person=2|Voice=Act_
Mood=Ind|Number=Plur|Person=3|Voice=Act__
Mood=Jus|Number=Sing|Person=3|Voice=Act_
Mood=Jus|Number=Plur|Person=1|Voice=Act_
Mood=Sub|Number=Sing|Person=1|Voice=Act_
Mood=Sub|Number=Sing|Person=2|Voice=Act_
Mood=Sub|Number=Sing|Person=3|Voice=Act_
Mood=Sub|Number=Plur|Person=1|Voice=Act_
Number=Sing|Person=2|Voice=Act_
Number=Dual|Person=3|Voice=Act_
Number=Plur|Person=3|Voice=Act_

PART

75 PART tokens (1% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (75; 100%).

PART tokens may have the following values of Gender:

INTJ

3 INTJ tokens (5% of all INTJ tokens) have a non-empty value of Gender.

INTJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (44638; 81%), NOUN –[nmod:poss]–> NOUN (30688; 56%), NOUN –[nmod]–> NOUN (23733; 59%), VERB –[nmod]–> NOUN (19089; 56%), VERB –[nsubj]–> NOUN (16017; 88%), PROPN –[flat:name]–> PROPN (12836; 92%), VERB –[obj]–> NOUN (9967; 55%), NOUN –[conj]–> NOUN (9293; 65%), NOUN –[nmod:poss]–> PRON (8542; 57%), ADV –[nmod:poss]–> NOUN (8039; 70%).