home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pomak-Philotis: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

29075 tokens (34%) have a non-empty value of Gender. 7842 types (72%) occur at least once with a non-empty value of Gender. 2695 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (12751; 15% instances), VERB (6246; 7% instances), PRON (3580; 4% instances), DET (2897; 3% instances), ADJ (2216; 3% instances), PROPN (734; 1% instances), NUM (438; 1% instances), AUX (213; 0% instances).

NOUN

12751 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (10025; 79%), Case=Acc (8056; 63%), Deixis=EMPTY (6884; 54%), Definite=Ind (6883; 54%).

NOUN tokens may have the following values of Gender:

Paradigm žanáFemNeut
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Singžanóso
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singžanóto
Case=Acc|Definite=Def|Deixis=Remt|Number=Singžanóno, ženóno, žónana
Case=Acc|Definite=Ind|Number=Singžónožóno
Case=Acc|Definite=Ind|Number=Pluržóny
Case=Gen|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Singžanójse
Case=Gen|Definite=Def|Deixis=Remt|Number=Singžanójne, žónajne
Case=Gen|Definite=Ind|Number=Singžónoj
Case=Gen|Definite=Ind|Number=Pluržónom
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Singženása
Case=Nom|Definite=Def|Deixis=Remt|Number=Singžanána, ženána
Case=Nom|Definite=Def|Deixis=Remt|Number=Pluržónyne
Case=Nom|Definite=Ind|Number=Singžaná, žéna
Case=Nom|Definite=Ind|Number=Pluržóny
Case=Voc|Definite=Ind|Number=Singžóno

Gender seems to be lexical feature of NOUN. 97% lemmas (1176) occur only with one value of Gender.

VERB

6246 VERB tokens (42% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6246; 100%), Person=EMPTY (6246; 100%), VerbForm=Part (6245; 100%), Tense=Past (5775; 92%), Voice=Act (5775; 92%), Aspect=Perf (4961; 79%), Number=Sing (4869; 78%).

VERB tokens may have the following values of Gender:

Paradigm réčemMascFemNeut
Animacy=Hum|Number=Plurreklíli, reklí, raklí
Animacy=Nhum|Number=Plurreklýly
Number=Singreklól, rekól, reklólareklála, reklá, réklareklólo, rekló
Number=Plurreklýlyreklý, reklýly

PRON

3580 PRON tokens (41% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3580; 100%), Person=3 (3576; 100%), PronType=Prs (3575; 100%), Number=Sing (3378; 94%).

PRON tokens may have the following values of Gender:

Paradigm jaMascFemNeut
Animacy=Hum|Case=Acc|Number=Plur|PronType=Prstæh
Animacy=Hum|Case=Nom|Number=Plur|PronType=Prstíje
Animacy=Nhum|Case=Acc|Number=Plur|PronType=Prsto
Animacy=Nhum|Case=Nom|Number=Plur|PronType=Prsto
Case=Acc|Number=Sing
Case=Acc|Number=Sing|PronType=Prsgo, tóga, gu, tógu, négajé, týje, jo, ja, néje, týjogo, to, gu
Case=Acc|Number=Plur|PronType=Prstototo
Case=Gen|Number=Sing|PronType=Prsmú, mo, tómují, hi, je, tójmú, mo, tómu, mó
Case=Gen|Number=Plur|PronType=PrsTæm
Case=Nom|Number=Sing|PronType=Prstojtja, te, tje, tæto
Case=Nom|Number=Plur|PronType=Prstoto

DET

2897 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: DeixisRef=EMPTY (2540; 88%), Number=Sing (2399; 83%), Case=Acc (1667; 58%), Deixis=EMPTY (1628; 56%).

DET tokens may have the following values of Gender:

Paradigm adínMascFemNeut
Animacy=Hum|Case=Acc|Definite=Ind|Number=Singannóga
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plurannǽh
Animacy=Hum|Case=Nom|Definite=Ind|Number=Pluranní
Animacy=Nhum|Case=Acc|Definite=Ind|Number=Pluranný
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=SingEnnósa
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singennókteannótoannóto
Case=Acc|Definite=Def|Deixis=Remt|Number=Singannókne, annóganekannóno, ennóna, jennóna
Case=Acc|Definite=Ind|Degree=Dim|Number=Singanníčko
Case=Acc|Definite=Ind|Number=Singannók, ennók, edín, jedín, adínannó, ennó, enná, jennóannó, ennó, jennó
Case=Acc|Definite=Ind|Number=Pluranný
Case=Gen|Definite=Def|Deixis=Remt|Number=Singannómune
Case=Gen|Definite=Ind|Number=Singannómuannójannómu
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singadínyjet
Case=Nom|Definite=Def|Deixis=Remt|Number=Singadínyjen, edínijon, adínajen, adínenannána, jennónaannóno
Case=Nom|Definite=Def|Deixis=Remt|Number=Plurannýne
Case=Nom|Definite=Ind|Number=Singadín, edín, jedínanná, enná, ennó, jenná, jennóannó, ennó
Case=Nom|Definite=Ind|Number=Pluranný

ADJ

2216 ADJ tokens (85% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1712; 77%), Definite=Ind (1297; 59%), Deixis=EMPTY (1297; 59%), Case=Acc (1144; 52%).

ADJ tokens may have the following values of Gender:

Paradigm starMascFemNeut
Animacy=Hum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plurstárehne, stárene
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plurstáreh
Animacy=Hum|Case=Gen|Definite=Def|Deixis=Remt|Number=Plurstáremne
Animacy=Hum|Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=PlurStárite
Animacy=Hum|Case=Nom|Definite=Def|Deixis=Remt|Number=Plurstárine
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plurstári
Animacy=Nhum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plurstáryne
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=SingStároso
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singstároktestárotostároto
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plurstáryte
Case=Acc|Definite=Def|Deixis=Remt|Number=Singstárokne, stáranestárono
Case=Acc|Definite=Def|Deixis=Remt|Number=Plurstáryne
Case=Acc|Definite=Ind|Number=Singstára, stárokstáro, stára
Case=Acc|Definite=Ind|Number=Plurstáry
Case=Gen|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singstárumute
Case=Gen|Definite=Def|Deixis=Remt|Number=Singstárumune, stáromunestárojne
Case=Gen|Definite=Ind|Number=Singstárustároj
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Singstáryjetstaráta
Case=Nom|Definite=Def|Deixis=Remt|Number=Singstáryjen, stárijon, stáryenstarána
Case=Nom|Definite=Ind|Number=Singstarstará
Case=Nom|Definite=Ind|Number=Plurstáry
Case=Voc|Definite=Ind|Degree=Dim|Number=Singstárku

PROPN

734 PROPN tokens (67% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (624; 85%), Definite=Ind (554; 75%), Case=Nom (428; 58%).

PROPN tokens may have the following values of Gender:

Paradigm HóǧeFemNeut
Case=Acc|Definite=IndHóǧa
Case=Nom|Definite=Def|Deixis=RemtHóǧanaHóǧena
Case=Nom|Definite=IndHóǧe, HÓǦE, Hóǧa
Case=Voc|Definite=IndHóǧa

Gender seems to be lexical feature of PROPN. 99% lemmas (137) occur only with one value of Gender.

NUM

438 NUM tokens (37% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (438; 100%), Definite=Ind (318; 73%), Deixis=EMPTY (318; 73%), Animacy=EMPTY (288; 66%), Case=Acc (258; 59%), Number=Sing (237; 54%).

NUM tokens may have the following values of Gender:

Paradigm adínMascFemNeut
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1annóso
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2annógateannóto, ennóto
Case=Acc|Definite=Def|Deixis=Remtannókneannónoannóno
Case=Acc|Definite=Indannók, ennók, jennók, jedínannó, ennóannó, ennó, jennó
Case=Gen|Definite=Def|Deixis=Remtannómune
Case=Gen|Definite=Indannój
Case=Nom|Definite=Def|Deixis=Remtadínyjen, edíņonannánaannóno
Case=Nom|Definite=Ind|Degree=Dimanníčekanníčka
Case=Nom|Definite=Indadín, jedínanná, ennáannó

AUX

213 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Perf (213; 100%), Mood=EMPTY (213; 100%), Person=EMPTY (213; 100%), Tense=Past (213; 100%), VerbForm=Part (213; 100%), Voice=Act (213; 100%), Number=Sing (179; 84%).

AUX tokens may have the following values of Gender:

Paradigm býdomMascFemNeut
Animacy=Hum|Number=Plurbúli, býli, bíli
Animacy=Nhum|Number=Plurbúly
Number=Singbul, byl, bilbýla, búla, bíla, bylá, bulábúlo, býlo, buló, bílo, bílu
Number=Plurbýly, búlybýly, búly

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1797; 82%), VERB –[conj]–> VERB (1522; 61%), NOUN –[amod]–> ADJ (1400; 88%), VERB –[nsubj]–> NOUN (1260; 53%), VERB –[nsubj]–> PRON (270; 52%), NOUN –[amod]–> VERB (143; 85%), VERB –[nsubj]–> ADJ (86; 52%), ADJ –[det]–> DET (62; 76%), ADJ –[conj]–> ADJ (55; 90%), PROPN –[nmod]–> PROPN (48; 71%).