home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kadiweu-Unicamp: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

This is a layered feature with the following layers: Gender, Gender[obj].

148 tokens (47%) have a non-empty value of Gender. 66 types (70%) occur at least once with a non-empty value of Gender. 49 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (102; 32% instances), DET (31; 10% instances), PRON (11; 3% instances), VERB (3; 1% instances), PROPN (1; 0% instances).

NOUN

102 NOUN tokens (92% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (95; 93%), Number[psor]=EMPTY (91; 89%), Degree=EMPTY (88; 86%).

NOUN tokens may have the following values of Gender:

Paradigm binieMascFem
Number=Singlibinienigilibiniena
Number=Plurlibinienigipi

Gender seems to be lexical feature of NOUN. 97% lemmas (34) occur only with one value of Gender.

DET

31 DET tokens (94% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (31; 100%), PronType=Dem (31; 100%).

DET tokens may have the following values of Gender:

Paradigm ijoMascFem
ijoajo

PRON

11 PRON tokens (61% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (10; 91%), PronType=Dem (10; 91%).

PRON tokens may have the following values of Gender:

VERB

3 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Gender[obj]=EMPTY (3; 100%), Mood=EMPTY (3; 100%), Person=3 (3; 100%), Person[erg]=EMPTY (3; 100%), Person[obj]=EMPTY (3; 100%), VerbForm=EMPTY (3; 100%), Voice=EMPTY (3; 100%).

VERB tokens may have the following values of Gender:

PROPN

1 PROPN tokens (25% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (30; 94%), NOUN –[nsubj]–> NOUN (12; 75%), NOUN –[nmod:poss]–> NOUN (6; 60%).