home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: Features: Gender

This feature is universal. It occurs with 2 different values: Com, Neut. Some words have combined values of the feature; 1 combinations have been observed: Com|Neut.

51171 tokens (17%) have a non-empty value of Gender. 12867 types (40%) occur at least once with a non-empty value of Gender. 12016 lemmas (46%) occur at least once with a non-empty value of Gender. The feature is used with 2 part-of-speech tags: NOUN (36679; 12% instances), PROPN (14492; 5% instances).

NOUN

36679 NOUN tokens (74% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (36679; 100%).

NOUN tokens may have the following values of Gender:

Paradigm soortCom,NeutNeutCom
soortsoortsoort

Gender seems to be lexical feature of NOUN. 99% lemmas (8344) occur only with one value of Gender.

PROPN

14492 PROPN tokens (48% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (14492; 100%).

PROPN tokens may have the following values of Gender:

Paradigm BelgiëNeutCom
ExtPos=PROPNBelgië
België, BELGIËBelgië

Gender seems to be lexical feature of PROPN. 99% lemmas (3581) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[conj]–> NOUN (1223; 52%), PROPN –[conj]–> PROPN (1172; 80%), NOUN –[appos]–> NOUN (234; 57%), PROPN –[flat]–> PROPN (89; 67%), NOUN –[fixed]–> NOUN (34; 53%), NOUN –[advcl]–> NOUN (10; 53%), NOUN –[case]–> NOUN (7; 70%), NOUN –[csubj]–> NOUN (4; 57%), PROPN –[nsubj]–> PROPN (3; 75%), NOUN –[amod]–> PROPN (1; 100%).