Treebank Statistics: UD_Danish-DDT: Features: Gender
This feature is universal.
It occurs with 2 different values: Com, Neut.
29231 tokens (29%) have a non-empty value of Gender.
10157 types (57%) occur at least once with a non-empty value of Gender.
7063 lemmas (53%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (18611; 18% instances), PRON (4802; 5% instances), DET (4203; 4% instances), ADJ (1605; 2% instances), NUM (8; 0% instances), VERB (2; 0% instances).
NOUN
18611 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Definite=Ind (13455; 72%), Number=Sing (13404; 72%).
NOUN tokens may have the following values of Gender:
Com(12925; 69% of non-emptyGender): kr., gang, dag, tid, del, mand, måde, verden, dage, gangeNeut(5686; 31% of non-emptyGender): år, folk, går, par, børn, mennesker, stedet, fald, arbejde, stedEMPTY(115): lov, Jordens, forvejen, vest, øst, Jorden, fjor, mahogni, slut, Nord
| Paradigm dag | Neut | Com |
|---|---|---|
| Case=Gen|Definite=Def|Number=Sing | dagens | |
| Case=Gen|Definite=Ind|Number=Sing | dags | dags |
| Case=Gen|Definite=Ind|Number=Plur | dages | |
| Definite=Def|Number=Sing | dagen | |
| Definite=Ind|Number=Sing | dag | |
| Definite=Ind|Number=Plur | dage |
Gender seems to be lexical feature of NOUN. 100% lemmas (6540) occur only with one value of Gender.
PRON
4802 PRON tokens (67% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: PartType=EMPTY (4802; 100%), PronType=Prs (3965; 83%), Number=Sing (3868; 81%), Person=3 (2488; 52%), Case=Nom (2455; 51%).
PRON tokens may have the following values of Gender:
Com(3305; 69% of non-emptyGender): han, jeg, vi, man, hun, den, du, ham, mig, osNeut(1497; 31% of non-emptyGender): det, noget, andet, dette, et, hvilket, hvert, intet, a., détEMPTY(2400): der, de, sig, som, hvad, selv, dem, andre, hinanden, nogle
| Paradigm den | Neut | Com |
|---|---|---|
| Case=Acc|Person=3|PronType=Prs | den | |
| PronType=Dem | det | den |
DET
4203 DET tokens (76% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4203; 100%), Number[psor]=EMPTY (3822; 91%), Person=EMPTY (3822; 91%), Poss=EMPTY (3822; 91%), PronType=Ind (2431; 58%).
DET tokens may have the following values of Gender:
Com(2843; 68% of non-emptyGender): en, den, sin, denne, min, ingen, anden, nogen, én, dinNeut(1360; 32% of non-emptyGender): et, det, sit, noget, mit, dette, andet, intet, vort, ethvertEMPTY(1301): de, deres, hans, andre, nogle, hendes, sine, vores, disse, vore
| Paradigm en | Neut | Com |
|---|---|---|
| Case=Gen | ens | |
| et, ét | en, én, een |
ADJ
1605 ADJ tokens (24% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (1605; 100%), Number=Sing (1605; 100%), Definite=Ind (1489; 93%).
ADJ tokens may have the following values of Gender:
Com(1020; 64% of non-emptyGender): stor, ny, klar, lang, god, egen, sådan, al, almindelig, friNeut(585; 36% of non-emptyGender): alt, stort, godt, nyt, svært, muligt, eget, klart, vigtigt, halvtEMPTY(4954): alle, mange, danske, store, samme, flere, hele, første, nye, sidste
| Paradigm stor | Neut | Com |
|---|---|---|
| stort | stor |
NUM
8 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (8; 100%).
NUM tokens may have the following values of Gender:
Com(6; 75% of non-emptyGender): halv, en, halvanden, énNeut(2; 25% of non-emptyGender): halvtEMPTY(1497): to, tre, fire, 20, fem, seks, 10, otte, 100, 1
| Paradigm halv | Neut | Com |
|---|---|---|
| halvt | halv |
VERB
2 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Definite=Def (2; 100%), Mood=EMPTY (2; 100%), Number=Sing (2; 100%), Tense=Past (2; 100%), VerbForm=Part (2; 100%), Voice=EMPTY (2; 100%).
VERB tokens may have the following values of Gender:
Com(2; 100% of non-emptyGender): foretrukne, udskårneEMPTY(10896): er, har, siger, var, få, får, fik, sagde, bliver, kommer
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3743; 75%),
NOUN –[nmod]–> NOUN (1852; 57%),
NOUN –[conj]–> NOUN (628; 64%),
NOUN –[nmod:poss]–> NOUN (340; 55%),
ADJ –[nsubj]–> PRON (177; 57%),
NOUN –[nmod]–> PRON (93; 52%),
NOUN –[nsubj]–> NOUN (91; 57%),
PRON –[nmod]–> NOUN (78; 58%),
NOUN –[appos]–> NOUN (33; 59%),
PRON –[nmod]–> PRON (29; 57%).