home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Low_Saxon-LSDC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 4 combinations have been observed: Fem|Masc, Fem|Masc|Neut, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

6949 tokens (31%) have a non-empty value of Gender. 2222 types (46%) occur at least once with a non-empty value of Gender. 1705 lemmas (50%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (2813; 12% instances), DET (2145; 9% instances), PRON (1318; 6% instances), ADJ (628; 3% instances), PROPN (35; 0% instances), NUM (10; 0% instances).

NOUN

2813 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2281; 81%).

NOUN tokens may have the following values of Gender:

Paradigm tydFem,MascMascFem
Case=Acc,Dat|Number=Singtyd, tydentydtyd
Case=Acc,Dat|Number=Plurtyden, tyd
Case=Acc|Number=Singtydtyd
Case=Acc|Number=Plurtyden
Case=Dat|Number=Singtyd
Case=Dat|Number=Plurtyden
Case=Nom|Number=Singtydtyd

Gender seems to be lexical feature of NOUN. 95% lemmas (1319) occur only with one value of Gender.

DET

2145 DET tokens (97% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (1853; 86%), Number=Sing (1834; 86%), PronType=Art (1690; 79%), Definite=Def (1345; 63%).

DET tokens may have the following values of Gender:

Paradigm enFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Acc,Dat|Definite=Def|Number=Sing|PronType=Artden, enen
Case=Acc,Dat|Definite=Def|Number=Plur|PronType=Artden
Case=Acc,Dat|Definite=Ind|Number=Sing|PronType=Artenen, nenen, eyne, neen
Case=Acc,Dat|Definite=Ind|Number=Plur|PronType=Dem'ne
Case=Acc|Definite=Def|Number=Sing|PronType=Artden, eynenenen
Case=Acc|Definite=Def|Number=Sing|PronType=Prseynen
Case=Acc|Definite=Ind|Number=Singen
Case=Acc|Definite=Ind|Number=Sing|PronType=Arten, neenen, eynen, ne, nen, e, eyneen, ne, eyne, een, Eyn, e
Case=Acc|Definite=Ind|Number=Plur|PronType=Arten, ne
Case=Acc|Number=Sing|PronType=Artne
Case=Acc|Number=Sing|PronType=Prsen
Case=Dat|Definite=Def|Number=Sing|PronType=Artenen
Case=Dat|Definite=Ind|Number=Sing|PronType=Arteynemeynereynem
Case=Gen|Definite=Def|Number=Sing|PronType=Artenen
Case=Nom|Definite=Def|Number=Sing|PronType=Artneneen
Case=Nom|Definite=Ind|Number=Sing|PronType=Arten, ne, eyn, nenenen, ne, eyneen, e, eyn
Case=Nom|Definite=Ind|Number=Sing|PronType=Prsne
Case=Nom|Number=Sing|PronType=Arten
Definite=Ind|Number=Singen

PRON

1318 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1301; 99%), Case=Nom (875; 66%), Person=3 (781; 59%), PronType=Prs (776; 59%).

PRON tokens may have the following values of Gender:

Paradigm deeFem,MascMascFemNeut
Case=Acc,Dat|Number=Sing|PronType=Demden
Case=Acc,Dat|Number=Sing|PronType=Reldendee
Case=Acc|Number=Sing|PronType=DemDeeden, deandee
Case=Acc|Number=Sing|PronType=Reldendee
Case=Acc|Number=Plur|PronType=Reldee
Case=Dat|Number=Sing|PronType=Demdem, deane
Case=Nom|Number=Sing|Person=3|PronType=Demdee
Case=Nom|Number=Sing|Person=3|PronType=Reldeedee
Case=Nom|Number=Sing|PronType=Demdeedeedeedee
Case=Nom|Number=Sing|PronType=Prsdee
Case=Nom|Number=Sing|PronType=Reldeedee, dendee
Case=Nom|Number=Plur|PronType=Demdee
Case=Nom|Number=Plur|PronType=Reldeedee

ADJ

628 ADJ tokens (49% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (543; 86%), Number=Sing (515; 82%).

ADJ tokens may have the following values of Gender:

Paradigm oldFem,MascMascMasc,NeutFemNeut
Case=Acc,Dat|Degree=Pos|Number=Singoldenolde
Case=Acc,Dat|Degree=Pos|Number=Plurolden
Case=Acc|Degree=Pos|Number=Singolde, oldenoldeold
Case=Acc|Number=Singold
Case=Acc|Number=Plurolden
Case=Dat|Degree=Pos|Number=Pluroldenolden
Case=Dat|Number=Singolden
Case=Gen|Degree=Pos|Number=Singöldesten
Case=Nom|Degree=Pos|Number=Singoldeolde, old, ol, olden, olderolde, oldold, olde
Case=Nom|Degree=Pos|Number=Pluroldeoldeolde, olden
Case=Nom|Number=Singoldeolde

PROPN

35 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (35; 100%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (32) occur only with one value of Gender.

NUM

10 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm eynMascFemNeut
Case=Acc,Dat|Number=Singeyneneyn
Case=Acc,Dat|NumType=Cardeyn
Case=Acc|NumType=Cardeynen
Case=Dat|Number=Singeyneeynen
Case=Nom|NumType=Cardeyn

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1971; 97%), NOUN –[amod]–> ADJ (532; 93%), ADJ –[det]–> DET (52; 96%), PRON –[det]–> DET (16; 89%), NOUN –[det:poss]–> DET (14; 64%), ADJ –[conj]–> ADJ (9; 90%), NOUN –[flat]–> NOUN (8; 89%), NOUN –[orphan]–> NOUN (6; 60%), PRON –[conj]–> PRON (4; 67%), PRON –[nmod]–> PRON (4; 80%).