Treebank Statistics: UD_Dutch-LassySmall: Features: Gender
This feature is universal.
It occurs with 2 different values: Com
, Neut
.
Some words have combined values of the feature; 1 combinations have been observed: Com|Neut
.
17757 tokens (18%) have a non-empty value of Gender
.
6154 types (40%) occur at least once with a non-empty value of Gender
.
5734 lemmas (44%) occur at least once with a non-empty value of Gender
.
The feature is used with 2 part-of-speech tags: NOUN (11819; 12% instances), PROPN (5938; 6% instances).
NOUN
11819 NOUN tokens (74% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (11819; 100%).
NOUN
tokens may have the following values of Gender
:
Com
(8190; 69% of non-emptyGender
): partij, stad, eeuw, naam, koning, regering, finale, provincie, politie, reeksCom,Neut
(15; 0% of non-emptyGender
): soort, mout, boord, katoen, krijtpoeder, sorghum, tin, wortNeut
(3614; 31% of non-emptyGender
): jaar, deel, aantal, werk, begin, land, bier, gewest, centrum, gebiedEMPTY
(4138): jaren, verkiezingen, gemeenten, partijen, inwoners, leden, links, zetels, verhalen, provincies
Paradigm wort | Com,Neut | Neut | Com |
---|---|---|---|
wort | wort | wort |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (3846) occur only with one value of Gender
.
PROPN
5938 PROPN tokens (43% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (5938; 100%).
PROPN
tokens may have the following values of Gender
:
Com
(2703; 46% of non-emptyGender
): Wiske, Suske, juni, oktober, Ensor, Vandersteen, Kuifje, VLD, CVP, D66Com,Neut
(80; 1% of non-emptyGender
): Spirit, Vivant, Parijs-Roubaix, Euronext, SPIRIT, Dexia, Fortis, Giroux, Mobistar, PregoNeut
(3155; 53% of non-emptyGender
): België, Brussel, Antwerpen, Vlaanderen, Hasselt, Nederland, Bel, Limburg, Luik, GentEMPTY
(7854): van, de, Vlaams, Gewest, Jan, Gemeenschap, der, II, Wereldoorlog, Nederlanden
Paradigm Vandersteen | Com,Neut | Com |
---|---|---|
Vandersteen | Vandersteen |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (1862) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
PROPN –[conj]–> PROPN (559; 81%),
NOUN –[conj]–> NOUN (475; 52%),
PROPN –[flat]–> PROPN (73; 74%),
NOUN –[advcl]–> NOUN (9; 75%),
NOUN –[orphan]–> PROPN (2; 100%),
PROPN –[nsubj]–> PROPN (2; 67%),
NOUN –[amod]–> PROPN (1; 100%),
NOUN –[case]–> NOUN (1; 100%),
NOUN –[obj]–> NOUN (1; 100%),
NOUN –[obj]–> PROPN (1; 100%).