Treebank Statistics: UD_Norwegian-Nynorsk: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc
.
88741 tokens (29%) have a non-empty value of Gender
.
20137 types (65%) occur at least once with a non-empty value of Gender
.
13962 lemmas (61%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (58293; 19% instances), DET (11254; 4% instances), PRON (10383; 3% instances), ADJ (7889; 3% instances), VERB (863; 0% instances), NUM (57; 0% instances), AUX (2; 0% instances).
NOUN
58293 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (40820; 70%), Definite=Ind (33320; 57%).
NOUN
tokens may have the following values of Gender
:
Fem
(13647; 23% of non-emptyGender
): tid, kroner, regjeringa, saka, verda, boka, tida, meldinga, lov, gradMasc
(27518; 47% of non-emptyGender
): dag, prosent, del, millionar, gong, grunn, Olav, leiar, bruk, kommunenNeut
(17128; 29% of non-emptyGender
): år, folk, språk, stortinget, landet, land, Framstegspartiet, tillegg, departementet, arbeidetEMPTY
(1737): USA, SV, kap., Ap, EU, lag, fjor, Sp, OECD, nr.
Paradigm lov | Masc | Fem | Neut |
---|---|---|---|
Definite=Def|Number=Sing | lova, lovi | ||
Definite=Def|Number=Plur | lovene | ||
Definite=Ind|Number=Sing | lov | lov | lov |
Definite=Ind|Number=Plur | lover |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (12247) occur only with one value of Gender
.
DET
11254 DET tokens (75% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (11254; 100%), PronType=Art (5926; 53%).
DET
tokens may have the following values of Gender
:
Fem
(2508; 22% of non-emptyGender
): ei, den, denne, anna, slik, eiga, all, inga, kvar, nokoMasc
(4558; 41% of non-emptyGender
): ein, den, denne, kvar, eigen, annan, nokon, ingen, all, slikNeut
(4188; 37% of non-emptyGender
): eit, det, anna, noko, dette, kvart, eitt, eige, alt, sliktEMPTY
(3752): dei, andre, alle, same, desse, nokre, sjølv, slike, kva, sjølve
Paradigm ein | Masc | Fem | Neut |
---|---|---|---|
ein, en | ei | eit, eitt, eir |
PRON
10383 PRON tokens (46% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Animacy=EMPTY (10381; 100%), Number=Sing (10379; 100%), PronType=Prs (10056; 97%), Person=3 (9223; 89%), Case=EMPTY (8096; 78%).
PRON
tokens may have the following values of Gender
:
Fem
(978; 9% of non-emptyGender
): ho, si, henne, vår, hans, mi, hennar, di, deira, eiFem,Masc
(171; 2% of non-emptyGender
): den, denne, dénMasc
(2129; 21% of non-emptyGender
): han, sin, vår, min, hans, nokon, hennar, deira, ingen, dinNeut
(7105; 68% of non-emptyGender
): det, dette, noko, sitt, alt, vårt, mitt, hans, slikt, hennarEMPTY
(12419): som, dei, eg, vi, seg, ein, me, du, kva, oss
Paradigm sin | Masc | Fem | Neut |
---|---|---|---|
sin | si | sitt |
ADJ
7889 ADJ tokens (29% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Definite=Ind (7889; 100%), Number=Sing (7888; 100%), Degree=Pos (7396; 94%).
ADJ
tokens may have the following values of Gender
:
Fem
(16; 0% of non-emptyGender
): lita, bundi, opaMasc
(107; 1% of non-emptyGender
): liten, open, kristen, oppteken, god, lunken, medfaren, sliten, velkomen, ForbodenNeut
(7766; 98% of non-emptyGender
): mykje, godt, heilt, langt, svært, litt, rett, veldig, viktig, norskEMPTY
(19260): meir, mange, fleire, nye, store, heile, norske, siste, mest, stor
Paradigm liten | Masc | Fem | Neut |
---|---|---|---|
liten | lita | lite, smått |
Gender
seems to be lexical feature of ADJ
. 99% lemmas (1388) occur only with one value of Gender
.
VERB
863 VERB tokens (3% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (863; 100%), Tense=EMPTY (863; 100%), VerbForm=Part (863; 100%).
VERB
tokens may have the following values of Gender
:
Fem
(2; 0% of non-emptyGender
): teki, vedtekiNeut
(861; 100% of non-emptyGender
): lagt, gjort, sett, sagt, vedteke, gjeve, teke, vist, halde, sendtEMPTY
(29860): har, seier, er, få, kjem, får, meiner, ha, går, fekk
Paradigm ta | Fem | Neut |
---|---|---|
teki | teke, tatt |
Gender
seems to be lexical feature of VERB
. 99% lemmas (319) occur only with one value of Gender
.
NUM
57 NUM tokens (1% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (57; 100%), Number=Sing (57; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(12; 21% of non-emptyGender
): éi, annakvar, eiMasc
(41; 72% of non-emptyGender
): éin, en, annankvar, èinNeut
(4; 7% of non-emptyGender
): annakvart, halvanna, halvtanna, noeEMPTY
(3975): to, tre, fire, ti, fem, 20, 1, seks, 2005, 2006
Paradigm annankvar | Masc | Fem | Neut |
---|---|---|---|
annankvar | annakvar | annakvart |
AUX
2 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Part (2; 100%).
AUX
tokens may have the following values of Gender
:
Neut
(2; 100% of non-emptyGender
): blittEMPTY
(16722): er, har, var, kan, skal, vil, må, vart, vore, blir
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (9667; 76%),
NOUN –[nmod]–> PRON (1164; 64%),
ADJ –[expl]–> PRON (550; 86%),
NOUN –[flat:name]–> NOUN (246; 78%),
NOUN –[appos]–> NOUN (240; 51%),
ADJ –[conj]–> ADJ (216; 67%),
DET –[nmod]–> NOUN (147; 69%),
PRON –[acl:relcl]–> ADJ (31; 74%),
PRON –[det]–> DET (28; 67%),
DET –[conj]–> DET (23; 85%).