home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pomak-Philotis: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

11433 tokens (33%) have a non-empty value of Gender. 4331 types (69%) occur at least once with a non-empty value of Gender. 2140 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (5055; 15% instances), VERB (2346; 7% instances), PRON (1401; 4% instances), DET (1173; 3% instances), ADJ (855; 2% instances), PROPN (311; 1% instances), NUM (189; 1% instances), AUX (103; 0% instances).

NOUN

5055 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3970; 79%), Case=Acc (3317; 66%), Definite=Ind (2790; 55%), Deixis=EMPTY (2790; 55%).

NOUN tokens may have the following values of Gender:

Paradigm kópelMascFemNeut
Case=Acc|Definite=Def|Degree=Dim|Deixis=Remtkópelčeno
Case=Acc|Definite=Def|Deixis=Remtkópelane
Case=Acc|Definite=Ind|Degree=Dimkópelčekópelče
Case=Acc|Definite=Indkópela
Case=Gen|Definite=Def|Degree=Dim|Deixis=Remtkópelčotune
Case=Nom|Definite=Def|Degree=Dim|Deixis=Remtkópelčeno
Case=Nom|Definite=Def|Deixis=Remtkópelon, Kópeløn
Case=Nom|Definite=Ind|Degree=Dimkópelče
Case=Nom|Definite=Indkópelkópela

Gender seems to be lexical feature of NOUN. 94% lemmas (1064) occur only with one value of Gender.

VERB

2346 VERB tokens (40% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2346; 100%), VerbForm=Part (2346; 100%), Person=EMPTY (2345; 100%), Tense=Past (2154; 92%), Voice=Act (2153; 92%), Aspect=Perf (1842; 79%), Number=Sing (1828; 78%).

VERB tokens may have the following values of Gender:

Paradigm réčemMascFemNeut
Animacy=Hum|Aspect=Perf|Number=Plurreklíli, reklí, raklí
Aspect=Imp|Number=Singrékla
Aspect=Perf|Number=Singreklól, rekólreklála, reklá, réklareklólo
Aspect=Perf|Number=Plurreklýly

PRON

1401 PRON tokens (41% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1401; 100%), Person=3 (1397; 100%), PronType=Prs (1396; 100%), Number=Sing (1332; 95%).

PRON tokens may have the following values of Gender:

Paradigm jaMascFemNeut
Animacy=Hum|Case=Acc|Number=Plur|PronType=Prstæh
Animacy=Hum|Case=Nom|Number=Plur|PronType=Prstíje
Animacy=Nhum|Case=Nom|Number=Plur|PronType=Prsto
Case=Acc|Number=Sing
Case=Acc|Number=Sing|PronType=Prsgo, tóga, gu, tógujé, týje, jo, jago, to, gu
Case=Acc|Number=Plur|PronType=Prsto
Case=Gen|Number=Sing|Number[psor]=Sing|Poss=Yes|PronType=Prsmu
Case=Gen|Number=Sing|PronType=Prsmú, mo, tómují, himú, mo, gu, tómu
Case=Nom|Number=Sing|PronType=Prstoj, tómutja, te, tæto
Case=Nom|Number=Plur|PronType=Prstoto

DET

1173 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: DeixisRef=EMPTY (1041; 89%), Animacy=EMPTY (1034; 88%), Number=Sing (955; 81%), Case=Acc (717; 61%), Deixis=EMPTY (653; 56%).

DET tokens may have the following values of Gender:

Paradigm adínMascFemNeut
Animacy=Hum|Case=Acc|Definite=Ind|Number=Sing|PronType=Indannóga
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|PronType=Indanní
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|PronType=Indennókteannóto
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|PronType=Indannókne, annóganekannóno
Case=Acc|Definite=Ind|Number=Sing|NumType=Cardannókannó
Case=Acc|Definite=Ind|Number=Sing|PronType=Indannók, ennók, edín, annóga, annómuannó, ennó, annój, jennóannó, ennó
Case=Acc|Definite=Ind|Number=Plur|PronType=Indannýanný
Case=Acc|Deixis=Remt|Number=Sing|PronType=Demannǽh
Case=Gen|Definite=Ind|Number=Sing|PronType=Indannómu, annójannój
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|PronType=Indadínyjenannóno
Case=Nom|Definite=Ind|Number=Sing|NumType=Cardadín
Case=Nom|Definite=Ind|Number=Sing|PronType=Indadín, edín, annómuanná, enná, jennóannó, anná, ennó

ADJ

855 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (633; 74%), Deixis=EMPTY (501; 59%), Definite=Ind (499; 58%), Case=Acc (481; 56%).

ADJ tokens may have the following values of Gender:

Paradigm gulǽmMascFemNeut
Animacy=Hum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plurgulǽmehne
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plurgolémi, gulǽmeh
Animacy=Nhum|Case=Acc|Definite=Ind|Number=Plurgulǽmy
Animacy=Nhum|Case=Nom|Definite=Def|Deixis=Remt|Number=Plurgolǽmyne
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singgulǽmoto
Case=Acc|Definite=Def|Deixis=Remt|Number=Singgulǽmono
Case=Acc|Definite=Ind|Number=Singgulǽma, goléma, golémigulǽmo
Case=Acc|Definite=Ind|Number=Plurgulǽmigulǽmy
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singgulǽmata
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plurgulǽmite
Case=Nom|Definite=Def|Deixis=Remt|Number=SinggulǽmyjenGulǽmanagulǽmono
Case=Nom|Definite=Def|Deixis=Remt|Number=Plurgulǽmyne
Case=Nom|Definite=Ind|Number=Singgulǽm, Golémgulǽma
Case=Nom|Definite=Ind|Number=PlurGolǽmy, gulǽmy

PROPN

311 PROPN tokens (62% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (267; 86%), Definite=Ind (244; 78%), Case=Nom (172; 55%).

PROPN tokens may have the following values of Gender:

Paradigm NedéleFemNeut
Number=SingNedéleNedéle
Number=PlurNedéleta

Gender seems to be lexical feature of PROPN. 96% lemmas (114) occur only with one value of Gender.

NUM

189 NUM tokens (35% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (168; 89%), Definite=Ind (136; 72%), Deixis=EMPTY (136; 72%), Animacy=EMPTY (132; 70%), Case=Acc (122; 65%), Number=Sing (114; 60%).

NUM tokens may have the following values of Gender:

Paradigm adínMascFemNeut
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|NumType=Cardannóga
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|NumType=Cardannógateannóto, ennóto
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|NumType=Cardannókne
Case=Acc|Definite=Ind|Number=Singannókannó, ennóannó
Case=Acc|Definite=Ind|Number=Sing|NumType=Cardannók, ennók, jedínannóannó, jennó
Case=Gen|Definite=Def|Deixis=Remt|Number=Sing|NumType=Cardannómune
Case=Nom|Definite=Def|Deixis=Remt|Number=Singadínyjen
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|NumType=Cardadínyjen
Case=Nom|Definite=Ind|Degree=Dim|Number=Sing|NumType=Cardanníčka
Case=Nom|Definite=Ind|Number=Singadínannó
Case=Nom|Definite=Ind|Number=Sing|NumType=Cardadín, jedínanná, ennáannó

AUX

103 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Perf (103; 100%), Mood=EMPTY (103; 100%), Person=EMPTY (103; 100%), Tense=Past (103; 100%), VerbForm=Part (103; 100%), Voice=Act (103; 100%), Number=Sing (86; 83%).

AUX tokens may have the following values of Gender:

Paradigm býdomMascFemNeut
Animacy=Hum|Number=Plurbúli, býli, bíli
Number=Singbul, byl, bilbúla, býla, bíla, bílubúlo, býlo
Number=Plurbýly, búly

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (703; 82%), VERB –[conj]–> VERB (585; 64%), VERB –[nsubj]–> NOUN (579; 57%), NOUN –[amod]–> ADJ (487; 82%), NOUN –[amod]–> VERB (52; 84%), VERB –[nsubj]–> ADJ (35; 56%), ADJ –[conj]–> ADJ (30; 100%), ADJ –[det]–> DET (19; 61%), PROPN –[flat]–> PROPN (17; 89%), ADJ –[nsubj]–> NOUN (15; 65%).