Treebank Statistics: UD_Low_Saxon-LSDC: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
Some words have combined values of the feature; 4 combinations have been observed: Fem|Masc, Fem|Masc|Neut, Fem|Neut, Masc|Neut.
This is a layered feature with the following layers: Gender, Gender[psor].
6951 tokens (31%) have a non-empty value of Gender.
2222 types (46%) occur at least once with a non-empty value of Gender.
1627 lemmas (50%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (2816; 12% instances), DET (2152; 10% instances), PRON (1314; 6% instances), ADJ (630; 3% instances), PROPN (29; 0% instances), NUM (10; 0% instances).
NOUN
2816 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2284; 81%).
NOUN tokens may have the following values of Gender:
Fem(754; 27% of non-emptyGender): vrouwe, tyd, stad, döäre, syde, hand, werld, nacht, aerde, dochterFem,Masc(23; 1% of non-emptyGender): tyd, heyrskop, nachts, tyden, Ploege, gek, gråten, kansen, last, leavtydenFem,Masc,Neut(2; 0% of non-emptyGender): andächtige, gardynenFem,Neut(1; 0% of non-emptyGender): jakkeMasc(1250; 44% of non-emptyGender): dag, man, god, her, buur, åvend, doud, junge, möller, kearlMasc,Neut(16; 1% of non-emptyGender): menske, minske, minsken, bast, hokuspokus, lyv, mensken, minsker, noorden, vlasNeut(770; 27% of non-emptyGender): lüde, huus, kinder, mål, geld, ougen, woord, ende, jår, leavenEMPTY(41): pår, lüde, Auguste, Belsebul, Düveken, Hilleken, Holtfräter, Kopernikus, Sikkebård, Smalbek
| Paradigm tyd | Fem,Masc | Masc | Fem |
|---|---|---|---|
| Case=Acc,Dat|Number=Sing | tyd, tyden | tyd | tyd |
| Case=Acc,Dat|Number=Plur | tyden, tyd | ||
| Case=Acc|Number=Sing | tyd | tyd | |
| Case=Acc|Number=Plur | tyden | ||
| Case=Dat|Number=Sing | tyd | ||
| Case=Dat|Number=Plur | tyden | ||
| Case=Nom|Number=Sing | tyd | tyd |
Gender seems to be lexical feature of NOUN. 95% lemmas (1264) occur only with one value of Gender.
DET
2152 DET tokens (97% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (1860; 86%), Number=Sing (1839; 85%), PronType=Art (1680; 78%), Definite=Def (1343; 62%).
DET tokens may have the following values of Gender:
Fem(587; 27% of non-emptyGender): de, der, en, ne, dee, syne, eyne, myn, syn, düsseFem,Masc(23; 1% of non-emptyGender): de, dee, en, Eynes, des, dissen, gyn, neFem,Masc,Neut(2; 0% of non-emptyGender): Myne, synFem,Neut(1; 0% of non-emptyGender): enMasc(974; 45% of non-emptyGender): de, den, en, dem, dee, synen, myn, eynen, syne, synMasc,Neut(10; 0% of non-emptyGender): keyn, de, Alle, al, en, neyn, synenNeut(555; 26% of non-emptyGender): dat, en, et, de, syn, myn, det, dem, ‘n, denEMPTY(69): de, en, al, alle, syn, den, myn, eare, Syne, dee
| Paradigm en | Fem,Masc | Fem,Neut | Masc | Masc,Neut | Fem | Neut |
|---|---|---|---|---|---|---|
| Case=Acc,Dat|Definite=Def|Number=Sing|PronType=Art | den, en | en | ||||
| Case=Acc,Dat|Definite=Def|Number=Plur|PronType=Art | den | |||||
| Case=Acc,Dat|Definite=Ind|Number=Sing|PronType=Art | en | en, nen | en, eyne, ne | en | ||
| Case=Acc,Dat|Definite=Ind|Number=Plur|PronType=Dem | 'ne | |||||
| Case=Acc|Definite=Def|Number=Sing|PronType=Art | den, eynen | en | en | |||
| Case=Acc|Definite=Def|Number=Sing|PronType=Prs | eynen | |||||
| Case=Acc|Definite=Ind|Number=Sing | en | |||||
| Case=Acc|Definite=Ind|Number=Sing|PronType=Art | en, ne | en | en, eynen, ne, nen, e, eyne | en, ne, eyne, e | en, Eyn, e | |
| Case=Acc|Definite=Ind|Number=Plur|PronType=Art | en, ne | |||||
| Case=Acc|Number=Sing|PronType=Art | en | ne | ||||
| Case=Acc|Number=Sing|PronType=Prs | en | |||||
| Case=Dat|Definite=Def|Number=Sing|PronType=Art | en | en | ||||
| Case=Dat|Definite=Ind|Number=Sing|PronType=Art | eynem | eyner | eynem | |||
| Case=Gen|Definite=Def|Number=Sing|PronType=Art | en | en | ||||
| Case=Nom|Definite=Def|Number=Sing|PronType=Art | ne | ne | en | |||
| Case=Nom|Definite=Ind|Number=Sing|PronType=Art | en, ne, eyn, nen | en | en, ne, eyne | en, eyn, e | ||
| Case=Nom|Definite=Ind|Number=Sing|PronType=Prs | ne | |||||
| Case=Nom|Number=Sing|PronType=Art | en | |||||
| Definite=Ind|Number=Sing | en |
PRON
1314 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1298; 99%), Case=Nom (872; 66%), Person=3 (781; 59%), PronType=Prs (776; 59%).
PRON tokens may have the following values of Gender:
Fem(102; 8% of non-emptyGender): see, dee, ear, höär, haar, andere, diaser, mynde, änderFem,Masc(37; 3% of non-emptyGender): dee, eyne, Geyn, ander, andere, eyn, wekMasc(590; 45% of non-emptyGender): hee, dee, em, den, man, hum, en, üm, iame, eynerMasc,Neut(1; 0% of non-emptyGender): geynNeut(584; 44% of non-emptyGender): et, dat, wat, niks, det, alles, allens, dee, dit, nistEMPTY(1279): ik, see, my, sik, wy, y, du, jy, dee, dy
| Paradigm dee | Fem,Masc | Masc | Fem | Neut |
|---|---|---|---|---|
| Case=Acc,Dat|Number=Sing|PronType=Dem | den | |||
| Case=Acc,Dat|Number=Sing|PronType=Rel | den | dee | ||
| Case=Acc|Number=Sing|PronType=Dem | Dee | den, dean | Dee | dee |
| Case=Acc|Number=Sing|PronType=Rel | den | dee | ||
| Case=Acc|Number=Plur|PronType=Rel | dee | |||
| Case=Dat|Number=Sing|PronType=Dem | dem, deane | |||
| Case=Dat|Number=Sing|PronType=Rel | dem | |||
| Case=Nom|Number=Sing|Person=3|PronType=Dem | dee | |||
| Case=Nom|Number=Sing|Person=3|PronType=Rel | dee | dee | ||
| Case=Nom|Number=Sing|PronType=Dem | dee | dee | dee | dee |
| Case=Nom|Number=Sing|PronType=Prs | dee | |||
| Case=Nom|Number=Sing|PronType=Rel | dee | dee, den | dee | |
| Case=Nom|Number=Plur|PronType=Dem | dee | |||
| Case=Nom|Number=Plur|PronType=Rel | dee | dee |
ADJ
630 ADJ tokens (48% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (545; 87%), Number=Sing (518; 82%).
ADJ tokens may have the following values of Gender:
Fem(162; 26% of non-emptyGender): olde, ganse, groute, ander, gode, grout, houge, lest, olden, andereFem,Masc(10; 2% of non-emptyGender): olde, Golden, heyle, lest, olden, smalle, vorweerden, weinigFem,Neut(1; 0% of non-emptyGender): lakenskMasc(262; 42% of non-emptyGender): goden, grouten, olde, olden, anderen, gode, groute, andere, beiden, eyrsteMasc,Neut(3; 0% of non-emptyGender): krütslike, olde, vorstandigNeut(192; 30% of non-emptyGender): eyrste, old, ander, andere, anders, gode, grout, heyle, leste, leveEMPTY(684): eyrst, good, gans, gerade, recht, meyr, richtig, vul, doud, gelyk
| Paradigm old | Fem,Masc | Masc | Masc,Neut | Fem | Neut |
|---|---|---|---|---|---|
| Case=Acc,Dat|Degree=Pos|Number=Sing | olden | olde | |||
| Case=Acc,Dat|Degree=Pos|Number=Plur | olden | ||||
| Case=Acc|Degree=Pos|Number=Sing | olde, olden | olde | old | ||
| Case=Acc|Number=Sing | old | ||||
| Case=Acc|Number=Plur | olden | ||||
| Case=Dat|Degree=Pos|Number=Plur | olden | olden | |||
| Case=Dat|Number=Sing | olden | ||||
| Case=Gen|Degree=Pos|Number=Sing | öldesten | ||||
| Case=Nom|Degree=Pos|Number=Sing | olde | olde, old, ol, olden, older | olde, old | old, olde | |
| Case=Nom|Degree=Pos|Number=Plur | olde | olde | olde, olden | ||
| Case=Nom|Number=Sing | olde | olde |
PROPN
29 PROPN tokens (6% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (29; 100%).
PROPN tokens may have the following values of Gender:
Fem(10; 34% of non-emptyGender): Hente, CDU, Havel, Luoden-heide, Marigge, Nicolaikarke, Slaumayerske, St., TrinaFem,Masc(1; 3% of non-emptyGender): StrüwingkenMasc(16; 55% of non-emptyGender): Hiärmen, Andrees, Bennad, Claus, Friedrich, Gravenes, Harms, Hein, Henrick, KrisjaonNeut(2; 7% of non-emptyGender): Eykertyn, Grote-OogEMPTY(437): Pölz, Anna, Hiärmen, Koch, Andries, Gassen, Jesus, Willem, Annegyn, Diekes
Gender seems to be lexical feature of PROPN. 100% lemmas (26) occur only with one value of Gender.
NUM
10 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 10% of non-emptyGender): eyneMasc(3; 30% of non-emptyGender): eynen, veerNeut(6; 60% of non-emptyGender): eyn, eynen, tweyEMPTY(92): twey, dree, veer, eyn, tein, 14, acht, dusend, pår, sös
| Paradigm eyn | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc,Dat|Number=Sing | eynen | eyn | |
| Case=Acc,Dat|NumType=Card | eyn | ||
| Case=Acc|NumType=Card | eynen | ||
| Case=Dat|Number=Sing | eyne | eynen | |
| Case=Nom|NumType=Card | eyn |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (1978; 97%),
NOUN –[amod]–> ADJ (532; 94%),
ADJ –[det]–> DET (54; 96%),
PRON –[det]–> DET (18; 86%),
NOUN –[det:poss]–> DET (14; 64%),
ADJ –[conj]–> ADJ (9; 90%),
NOUN –[flat]–> NOUN (8; 89%),
NOUN –[orphan]–> NOUN (6; 60%),
NOUN –[det]–> PRON (4; 67%),
PRON –[conj]–> PRON (4; 67%).