Treebank Statistics: UD_Dutch-LassySmall: Features: Gender
This feature is universal.
It occurs with 2 different values: Com, Neut.
Some words have combined values of the feature; 1 combinations have been observed: Com|Neut.
51192 tokens (17%) have a non-empty value of Gender.
12868 types (40%) occur at least once with a non-empty value of Gender.
12018 lemmas (46%) occur at least once with a non-empty value of Gender.
The feature is used with 2 part-of-speech tags: NOUN (36690; 12% instances), PROPN (14502; 5% instances).
NOUN
36690 NOUN tokens (74% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (36690; 100%).
NOUN tokens may have the following values of Gender:
Com(25003; 68% of non-emptyGender): oorlog, tijd, eeuw, stad, partij, koning, naam, plaats, film, regeringCom,Neut(81; 0% of non-emptyGender): soort, boord, opzet, katoen, mout, sorghum, proviand, aspirinetablet, falsetto, krijtpoederNeut(11606; 32% of non-emptyGender): jaar, deel, aantal, land, leger, begin, gebied, album, werk, eindEMPTY(12779): jaren, mensen, landen, troepen, partijen, leden, verkiezingen, inwoners, tanks, gemeenten
| Paradigm soort | Com,Neut | Neut | Com |
|---|---|---|---|
| soort | soort | soort |
Gender seems to be lexical feature of NOUN. 99% lemmas (8346) occur only with one value of Gender.
PROPN
14502 PROPN tokens (48% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (14502; 100%).
PROPN tokens may have the following values of Gender:
Com(7098; 49% of non-emptyGender): juni, september, mei, oktober, Ron, augustus, maart, Prince, november, juliCom,Neut(117; 1% of non-emptyGender): Spirit, Vivant, Parijs-Roubaix, IGE, Osram, SEM, Euronext, SPIRIT, Barbarossa, DexiaNeut(7287; 50% of non-emptyGender): België, Brussel, Duitsland, Frankrijk, Vlaanderen, Antwerpen, Nederland, Europa, Israël, WolderEMPTY(15834): van, de, Wereldoorlog, II, Verenigde, staten, Tweede, Vlaams, Duitsers, Eerste
| Paradigm België | Neut | Com |
|---|---|---|
| ExtPos=PROPN | België | |
| België, BELGIË | België |
Gender seems to be lexical feature of PROPN. 99% lemmas (3581) occur only with one value of Gender.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[conj]–> NOUN (1219; 53%),
PROPN –[conj]–> PROPN (1170; 80%),
NOUN –[appos]–> NOUN (235; 58%),
PROPN –[flat]–> PROPN (89; 70%),
NOUN –[fixed]–> NOUN (38; 54%),
NOUN –[case]–> NOUN (7; 70%),
NOUN –[csubj]–> NOUN (4; 57%),
PROPN –[nsubj]–> PROPN (3; 75%),
NOUN –[amod]–> PROPN (1; 100%),
NOUN –[obj]–> NOUN (1; 100%).