Treebank Statistics: UD_Pomak-Philotis: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
11433 tokens (33%) have a non-empty value of Gender
.
4331 types (69%) occur at least once with a non-empty value of Gender
.
2140 lemmas (66%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (5055; 15% instances), VERB (2346; 7% instances), PRON (1401; 4% instances), DET (1173; 3% instances), ADJ (855; 2% instances), PROPN (311; 1% instances), NUM (189; 1% instances), AUX (103; 0% instances).
NOUN
5055 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3970; 79%), Case=Acc (3317; 66%), Definite=Ind (2790; 55%), Deixis=EMPTY (2790; 55%).
NOUN
tokens may have the following values of Gender
:
Fem
(2057; 41% of non-emptyGender
): godíny, májka, kóštono, rábato, vódo, goróno, žóno, astinomíjena, rábaty, rábataMasc
(2182; 43% of non-emptyGender
): déne, čulǽkon, čulǽka, bubájko, hašíše, pláden, vakýt, mesecáte, véčera, póteneNeut
(816; 16% of non-emptyGender
): vréme, kópeløno, mómičeno, sélo, mómiče, mǽsto, sélono, kúčeno, evró, déteEMPTY
(41): gün, keré, korf, DEIno, kerét, dumá, Kerém, TV, cm, dokús
Paradigm kópel | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Degree=Dim|Deixis=Remt | kópelčeno | ||
Case=Acc|Definite=Def|Deixis=Remt | kópelane | ||
Case=Acc|Definite=Ind|Degree=Dim | kópelče | kópelče | |
Case=Acc|Definite=Ind | kópela | ||
Case=Gen|Definite=Def|Degree=Dim|Deixis=Remt | kópelčotune | ||
Case=Nom|Definite=Def|Degree=Dim|Deixis=Remt | kópelčeno | ||
Case=Nom|Definite=Def|Deixis=Remt | kópelon, Kópeløn | ||
Case=Nom|Definite=Ind|Degree=Dim | kópelče | ||
Case=Nom|Definite=Ind | kópel | kópela |
Gender
seems to be lexical feature of NOUN
. 94% lemmas (1064) occur only with one value of Gender
.
VERB
2346 VERB tokens (40% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2346; 100%), VerbForm=Part (2346; 100%), Person=EMPTY (2345; 100%), Tense=Past (2154; 92%), Voice=Act (2153; 92%), Aspect=Perf (1842; 79%), Number=Sing (1828; 78%).
VERB
tokens may have the following values of Gender
:
Fem
(631; 27% of non-emptyGender
): reklála, zǿla, vídela, atišlála, stánala, tórnala, imǽla, stórila, dála, kázalaMasc
(1352; 58% of non-emptyGender
): reklól, zøl, atišlól, vídel, stánal, imǽl, tórnal, zǿli, advórnal, rekólNeut
(363; 15% of non-emptyGender
): imǽlo, stánalo, reklólo, skrýto, zǿlo, vídelo, atišlólo, dašlólo, dálo, paminóloEMPTY
(3514): víka, trǽbava, móža, íma, hódi, právi, dam, fáti, vídi, stánava
Paradigm réčem | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Aspect=Perf|Number=Plur | reklíli, reklí, raklí | ||
Aspect=Imp|Number=Sing | rékla | ||
Aspect=Perf|Number=Sing | reklól, rekól | reklála, reklá, rékla | reklólo |
Aspect=Perf|Number=Plur | reklýly |
PRON
1401 PRON tokens (41% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (1401; 100%), Person=3 (1397; 100%), PronType=Prs (1396; 100%), Number=Sing (1332; 95%).
PRON
tokens may have the following values of Gender
:
Fem
(421; 30% of non-emptyGender
): jé, tja, jí, ji, týje, hi, jo, te, to, tæMasc
(717; 51% of non-emptyGender
): go, mú, toj, mu, tóga, tíje, tæh, mo, tómu, toNeut
(263; 19% of non-emptyGender
): go, to, mu, mú, gu, mo, tómu, žónoEMPTY
(1990): só, sí, mí, gi, kaná, ja, tí, ty, sa, mi
Paradigm ja | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Number=Plur|PronType=Prs | tæh | ||
Animacy=Hum|Case=Nom|Number=Plur|PronType=Prs | tíje | ||
Animacy=Nhum|Case=Nom|Number=Plur|PronType=Prs | to | ||
Case=Acc|Number=Sing | jé | ||
Case=Acc|Number=Sing|PronType=Prs | go, tóga, gu, tógu | jé, týje, jo, ja | go, to, gu |
Case=Acc|Number=Plur|PronType=Prs | to | ||
Case=Gen|Number=Sing|Number[psor]=Sing|Poss=Yes|PronType=Prs | mu | ||
Case=Gen|Number=Sing|PronType=Prs | mú, mo, tómu | jí, hi | mú, mo, gu, tómu |
Case=Nom|Number=Sing|PronType=Prs | toj, tómu | tja, te, tæ | to |
Case=Nom|Number=Plur|PronType=Prs | to | to |
DET
1173 DET tokens (85% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: DeixisRef=EMPTY (1041; 89%), Animacy=EMPTY (1034; 88%), Number=Sing (955; 81%), Case=Acc (717; 61%), Deixis=EMPTY (653; 56%).
DET
tokens may have the following values of Gender
:
Fem
(295; 25% of non-emptyGender
): annó, anná, žýne, ennó, isózi, drúgy, kakvó, isázi, drúgono, ennáMasc
(642; 55% of non-emptyGender
): annók, adín, kutrí, žýjen, žíne, vrítsi, žókne, kotrí, drúgyjen, edínNeut
(236; 20% of non-emptyGender
): annó, inazí, drúgo, ennó, žóno, žýne, drúgono, isazí, inakvóne, itazíEMPTY
(211): bir, nǽko, vrit, kólko, inélkus, kač, kólkoto, sǽko, bu, her
Paradigm adín | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Definite=Ind|Number=Sing|PronType=Ind | annóga | ||
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|PronType=Ind | anní | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|PronType=Ind | ennókte | annóto | |
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|PronType=Ind | annókne, annóganek | annóno | |
Case=Acc|Definite=Ind|Number=Sing|NumType=Card | annók | annó | |
Case=Acc|Definite=Ind|Number=Sing|PronType=Ind | annók, ennók, edín, annóga, annómu | annó, ennó, annój, jennó | annó, ennó |
Case=Acc|Definite=Ind|Number=Plur|PronType=Ind | anný | anný | |
Case=Acc|Deixis=Remt|Number=Sing|PronType=Dem | annǽh | ||
Case=Gen|Definite=Ind|Number=Sing|PronType=Ind | annómu, annój | annój | |
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|PronType=Ind | adínyjen | annóno | |
Case=Nom|Definite=Ind|Number=Sing|NumType=Card | adín | ||
Case=Nom|Definite=Ind|Number=Sing|PronType=Ind | adín, edín, annómu | anná, enná, jennó | annó, anná, ennó |
ADJ
855 ADJ tokens (84% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (633; 74%), Deixis=EMPTY (501; 59%), Definite=Ind (499; 58%), Case=Acc (481; 56%).
ADJ
tokens may have the following values of Gender
:
Fem
(329; 38% of non-emptyGender
): gulǽma, cǽlo, gladná, gulǽmo, starána, altóneny, górnono, míčko, altóneno, bǽlyMasc
(373; 44% of non-emptyGender
): stáryjen, cǽla, móske, mládyjen, gulǽma, húbava, míčkyjen, stári, stárine, čárckyjenNeut
(153; 18% of non-emptyGender
): kámatno, Pomácko, právo, húbavo, lóšo, míčko, altóneno, cǽlo, kámatnono, parátikoEMPTY
(162): mlógo, razý, málko, ájni, mlógu, mífko, meǧbúr, pišmén, birinǧíto, halál
Paradigm gulǽm | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plur | gulǽmehne | ||
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plur | golémi, gulǽmeh | ||
Animacy=Nhum|Case=Acc|Definite=Ind|Number=Plur | gulǽmy | ||
Animacy=Nhum|Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | golǽmyne | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | gulǽmoto | ||
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing | gulǽmono | ||
Case=Acc|Definite=Ind|Number=Sing | gulǽma, goléma, golémi | gulǽmo | |
Case=Acc|Definite=Ind|Number=Plur | gulǽmi | gulǽmy | |
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | gulǽmata | ||
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plur | gulǽmite | ||
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | gulǽmyjen | Gulǽmana | gulǽmono |
Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | gulǽmyne | ||
Case=Nom|Definite=Ind|Number=Sing | gulǽm, Golém | gulǽma | |
Case=Nom|Definite=Ind|Number=Plur | Golǽmy, gulǽmy |
PROPN
311 PROPN tokens (62% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (267; 86%), Definite=Ind (244; 78%), Case=Nom (172; 55%).
PROPN
tokens may have the following values of Gender
:
Fem
(109; 35% of non-emptyGender
): Aminǽ, Srǽdo, Sóboto, Ǧemilǽ, Kavála, Dráma, Hilmijá, Mára, Máro, GalínkaMasc
(140; 45% of non-emptyGender
): Alí, Ají, Isén, Asíp, Panedélnik, Jerím, Tórnik, Nasradín, Orhán, AzraílNeut
(62; 20% of non-emptyGender
): Kélčeno, Kélčetune, Nedéle, Pašavík, Jasǿren, Mustáfčevo, Basájkovo, Bunár, Démirǧik, GøkčéEMPTY
(194): Ksánti, Elláda, Ǧumágün, Aleksandrúpoli, Néa, Siría, Komotiní, Vulgaría, Évro, Dimokratía
Paradigm Nedéle | Fem | Neut |
---|---|---|
Number=Sing | Nedéle | Nedéle |
Number=Plur | Nedéleta |
Gender
seems to be lexical feature of PROPN
. 96% lemmas (114) occur only with one value of Gender
.
NUM
189 NUM tokens (35% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (168; 89%), Definite=Ind (136; 72%), Deixis=EMPTY (136; 72%), Animacy=EMPTY (132; 70%), Case=Acc (122; 65%), Number=Sing (114; 60%).
NUM
tokens may have the following values of Gender
:
Fem
(35; 19% of non-emptyGender
): annó, ennó, anná, anníčka, ennáMasc
(118; 62% of non-emptyGender
): annók, dvamínana, dva, dvamína, dvomínana, trimínana, ennók, trimína, adínyjen, dvánaNeut
(36; 19% of non-emptyGender
): annó, annómune, annóto, drúgono, ennóto, jennóEMPTY
(351): tri, dve, jedí, dvéne, kyrk, 6, 10, tríne, 5, 8
Paradigm adín | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|NumType=Card | annóga | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|NumType=Card | annógate | annóto, ennóto | |
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | annókne | ||
Case=Acc|Definite=Ind|Number=Sing | annók | annó, ennó | annó |
Case=Acc|Definite=Ind|Number=Sing|NumType=Card | annók, ennók, jedín | annó | annó, jennó |
Case=Gen|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | annómune | ||
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | adínyjen | ||
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | adínyjen | ||
Case=Nom|Definite=Ind|Degree=Dim|Number=Sing|NumType=Card | anníčka | ||
Case=Nom|Definite=Ind|Number=Sing | adín | annó | |
Case=Nom|Definite=Ind|Number=Sing|NumType=Card | adín, jedín | anná, enná | annó |
AUX
103 AUX tokens (2% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=Perf (103; 100%), Mood=EMPTY (103; 100%), Person=EMPTY (103; 100%), Tense=Past (103; 100%), VerbForm=Part (103; 100%), Voice=Act (103; 100%), Number=Sing (86; 83%).
AUX
tokens may have the following values of Gender
:
Fem
(27; 26% of non-emptyGender
): búla, býla, býly, búly, bíla, bíluMasc
(46; 45% of non-emptyGender
): bul, byl, búli, býli, bil, bíliNeut
(30; 29% of non-emptyGender
): búlo, býlo, bíloEMPTY
(4074): je, da, so, še, li, si, som, sa, ša, jo
Paradigm býdom | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Number=Plur | búli, býli, bíli | ||
Number=Sing | bul, byl, bil | búla, býla, bíla, bílu | búlo, býlo |
Number=Plur | býly, búly |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (703; 82%),
VERB –[conj]–> VERB (585; 64%),
VERB –[nsubj]–> NOUN (579; 57%),
NOUN –[amod]–> ADJ (487; 82%),
NOUN –[amod]–> VERB (52; 84%),
VERB –[nsubj]–> ADJ (35; 56%),
ADJ –[conj]–> ADJ (30; 100%),
ADJ –[det]–> DET (19; 61%),
PROPN –[flat]–> PROPN (17; 89%),
ADJ –[nsubj]–> NOUN (15; 65%).