Gender
: gender
This document is a placeholder for the language-specific documentation
for Gender
.
Treebank Statistics (UD_Dutch)
This feature is universal.
It occurs with 4 different values: Com
, Fem
, Masc
, Neut
.
5049 tokens (2%) have a non-empty value of Gender
.
139 types (1%) occur at least once with a non-empty value of Gender
.
139 lemmas (1%) occur at least once with a non-empty value of Gender
.
The feature is used with 4 part-of-speech tags: DET (4323; 2% instances), X (613; 0% instances), PRON (61; 0% instances), ADP (52; 0% instances).
DET
4323 DET tokens (20% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Definite=Def (4319; 100%), PronType=Art (4319; 100%), Number=EMPTY (4286; 99%).
DET
tokens may have the following values of Gender
:
Com
(1; 0% of non-emptyGender
): denFem
(22; 1% of non-emptyGender
): la, As, Les, AS, Nostra, nuestraMasc
(15; 0% of non-emptyGender
): el, los, Els, tot, osNeut
(4285; 99% of non-emptyGender
): het, ‘tEMPTY
(17527): de, een, veel, meer, der, vele, meeste, weinig, minder, ‘n
Gender
seems to be lexical feature of DET
. 100% lemmas (18) occur only with one value of Gender
.
X
613 X tokens (13% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Definite=EMPTY (477; 78%), Degree=EMPTY (392; 64%), Number=Sing (365; 60%).
X
tokens may have the following values of Gender
:
Com
(18; 3% of non-emptyGender
): den, duur, op, aan, dag, treure, uitNeut
(595; 97% of non-emptyGender
): het, aan, van, voor, eerst, op, leven, gebied, om, kaderEMPTY
(4022): van, flo, op, ten, met, voor, een, onder, te, ‘s
Paradigm op | Neut | Com |
---|---|---|
Degree=Cmp | op | |
Number=Sing | op | op |
Gender
seems to be lexical feature of X
. 97% lemmas (110) occur only with one value of Gender
.
PRON
61 PRON tokens (0% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=EMPTY (61; 100%), Number=Sing (61; 100%), PronType=Tot (61; 100%), Case=EMPTY (61; 100%).
PRON
tokens may have the following values of Gender
:
Masc
(61; 100% of non-emptyGender
): totEMPTY
(17063): die, hij, ik, het, dat, zijn, wat, welke, zich, dit
ADP
52 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADP
and Gender
co-occurred: AdpType=Preppron (52; 100%).
ADP
tokens may have the following values of Gender
:
Fem
(6; 12% of non-emptyGender
): da, DasMasc
(46; 88% of non-emptyGender
): als, al, del, doEMPTY
(23626): van, in, te, op, voor, met, aan, door, bij, naar
Gender
seems to be lexical feature of ADP
. 100% lemmas (10) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
X –[compound]–> X (317; 100%),
X –[mark]–> X (21; 100%),
X –[conj]–> X (1; 100%),
X –[cop]–> X (1; 100%),
ADP –[det]–> DET (1; 100%),
X –[nsubj]–> X (1; 100%).
Treebank Statistics (UD_Dutch-LassySmall)
This feature is universal.
It occurs with 2 different values: Com
, Neut
.
Some words have combined values of the feature; 1 combinations have been observed: Com|Neut
.
18356 tokens (19%) have a non-empty value of Gender
.
6251 types (41%) occur at least once with a non-empty value of Gender
.
5827 lemmas (45%) occur at least once with a non-empty value of Gender
.
The feature is used with 2 part-of-speech tags: NOUN (12372; 13% instances), PROPN (5984; 6% instances).
NOUN
12372 NOUN tokens (74% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (12372; 100%).
NOUN
tokens may have the following values of Gender
:
Com
(8522; 69% of non-emptyGender
): partij, stad, eeuw, naam, regering, koning, finale, gemeenschap, provincie, politieCom,Neut
(31; 0% of non-emptyGender
): keer, soort, mout, Salon, cement, katoen, natuursteen, tinNeut
(3819; 31% of non-emptyGender
): jaar, gewest, deel, aantal, werk, parlement, museum, begin, centrum, landEMPTY
(4269): jaren, verkiezingen, gemeenten, partijen, inwoners, leden, links, zetels, verhalen, provincies
Paradigm stad | Neut | Com |
---|---|---|
stadje | stad |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (3913) occur only with one value of Gender
.
PROPN
5984 PROPN tokens (49% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (5984; 100%).
PROPN
tokens may have the following values of Gender
:
Com
(2707; 45% of non-emptyGender
): Wiske, Suske, juni, oktober, Ensor, Vandersteen, Boudewijn, Kuifje, VLD, CVPCom,Neut
(43; 1% of non-emptyGender
): Spirit, Giroux, Vivant, Andras, Bouckaert, Brouckère, Dekeyser, Den, Flickr, MarineNeut
(3234; 54% of non-emptyGender
): België, Brussel, Vlaanderen, Antwerpen, Hasselt, Nederland, Limburg, Luik, Gent, Sint-NiklaasEMPTY
(6256): van, de, Jan, II, Nederlanden, Vlaams, Kim, I, Clijsters, der
Paradigm Vivant | Com,Neut | Neut | Com |
---|---|---|---|
Vivant | Vivant | Vivant |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (1873) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
PROPN –[conj]–> PROPN (545; 83%),
NOUN –[conj]–> NOUN (478; 52%),
NOUN –[appos]–> NOUN (103; 52%),
PROPN –[mwe]–> PROPN (82; 51%),
NOUN –[advcl]–> NOUN (11; 61%),
PROPN –[dobj]–> PROPN (4; 80%),
PROPN –[nsubj]–> PROPN (3; 60%),
NOUN –[advcl]–> PROPN (3; 60%),
PROPN –[advmod]–> NOUN (1; 100%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]