Treebank Statistics: UD_Pomak-Philotis: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
11433 tokens (33%) have a non-empty value of Gender.
4331 types (69%) occur at least once with a non-empty value of Gender.
2140 lemmas (66%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (5055; 15% instances), VERB (2346; 7% instances), PRON (1401; 4% instances), DET (1173; 3% instances), ADJ (855; 2% instances), PROPN (311; 1% instances), NUM (189; 1% instances), AUX (103; 0% instances).
NOUN
5055 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3970; 79%), Case=Acc (3317; 66%), Definite=Ind (2790; 55%), Deixis=EMPTY (2790; 55%).
NOUN tokens may have the following values of Gender:
Fem(2057; 41% of non-emptyGender): godíny, májka, kóštono, rábato, vódo, goróno, žóno, astinomíjena, rábaty, rábataMasc(2182; 43% of non-emptyGender): déne, čulǽkon, čulǽka, bubájko, hašíše, pláden, vakýt, mesecáte, véčera, póteneNeut(816; 16% of non-emptyGender): vréme, kópeløno, mómičeno, sélo, mómiče, mǽsto, sélono, kúčeno, evró, déteEMPTY(41): gün, keré, korf, DEIno, kerét, dumá, Kerém, TV, cm, dokús
| Paradigm kópel | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Degree=Dim|Deixis=Remt | kópelčeno | ||
| Case=Acc|Definite=Def|Deixis=Remt | kópelane | ||
| Case=Acc|Definite=Ind|Degree=Dim | kópelče | kópelče | |
| Case=Acc|Definite=Ind | kópela | ||
| Case=Gen|Definite=Def|Degree=Dim|Deixis=Remt | kópelčotune | ||
| Case=Nom|Definite=Def|Degree=Dim|Deixis=Remt | kópelčeno | ||
| Case=Nom|Definite=Def|Deixis=Remt | kópelon, Kópeløn | ||
| Case=Nom|Definite=Ind|Degree=Dim | kópelče | ||
| Case=Nom|Definite=Ind | kópel | kópela |
Gender seems to be lexical feature of NOUN. 94% lemmas (1064) occur only with one value of Gender.
VERB
2346 VERB tokens (40% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2346; 100%), VerbForm=Part (2346; 100%), Person=EMPTY (2345; 100%), Tense=Past (2154; 92%), Voice=Act (2153; 92%), Aspect=Perf (1842; 79%), Number=Sing (1828; 78%).
VERB tokens may have the following values of Gender:
Fem(631; 27% of non-emptyGender): reklála, zǿla, vídela, atišlála, stánala, tórnala, imǽla, stórila, dála, kázalaMasc(1352; 58% of non-emptyGender): reklól, zøl, atišlól, vídel, stánal, imǽl, tórnal, zǿli, advórnal, rekólNeut(363; 15% of non-emptyGender): imǽlo, stánalo, reklólo, skrýto, zǿlo, vídelo, atišlólo, dašlólo, dálo, paminóloEMPTY(3514): víka, trǽbava, móža, íma, hódi, právi, dam, fáti, vídi, stánava
| Paradigm réčem | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Aspect=Perf|Number=Plur | reklíli, reklí, raklí | ||
| Aspect=Imp|Number=Sing | rékla | ||
| Aspect=Perf|Number=Sing | reklól, rekól | reklála, reklá, rékla | reklólo |
| Aspect=Perf|Number=Plur | reklýly |
PRON
1401 PRON tokens (41% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1401; 100%), Person=3 (1397; 100%), PronType=Prs (1396; 100%), Number=Sing (1332; 95%).
PRON tokens may have the following values of Gender:
Fem(421; 30% of non-emptyGender): jé, tja, jí, ji, týje, hi, jo, te, to, tæMasc(717; 51% of non-emptyGender): go, mú, toj, mu, tóga, tíje, tæh, mo, tómu, toNeut(263; 19% of non-emptyGender): go, to, mu, mú, gu, mo, tómu, žónoEMPTY(1990): só, sí, mí, gi, kaná, ja, tí, ty, sa, mi
| Paradigm ja | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Case=Acc|Number=Plur|PronType=Prs | tæh | ||
| Animacy=Hum|Case=Nom|Number=Plur|PronType=Prs | tíje | ||
| Animacy=Nhum|Case=Nom|Number=Plur|PronType=Prs | to | ||
| Case=Acc|Number=Sing | jé | ||
| Case=Acc|Number=Sing|PronType=Prs | go, tóga, gu, tógu | jé, týje, jo, ja | go, to, gu |
| Case=Acc|Number=Plur|PronType=Prs | to | ||
| Case=Gen|Number=Sing|Number[psor]=Sing|Poss=Yes|PronType=Prs | mu | ||
| Case=Gen|Number=Sing|PronType=Prs | mú, mo, tómu | jí, hi | mú, mo, gu, tómu |
| Case=Nom|Number=Sing|PronType=Prs | toj, tómu | tja, te, tæ | to |
| Case=Nom|Number=Plur|PronType=Prs | to | to |
DET
1173 DET tokens (85% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: DeixisRef=EMPTY (1041; 89%), Animacy=EMPTY (1034; 88%), Number=Sing (955; 81%), Case=Acc (717; 61%), Deixis=EMPTY (653; 56%).
DET tokens may have the following values of Gender:
Fem(295; 25% of non-emptyGender): annó, anná, žýne, ennó, isózi, drúgy, kakvó, isázi, drúgono, ennáMasc(642; 55% of non-emptyGender): annók, adín, kutrí, žýjen, žíne, vrítsi, žókne, kotrí, drúgyjen, edínNeut(236; 20% of non-emptyGender): annó, inazí, drúgo, ennó, žóno, žýne, drúgono, isazí, inakvóne, itazíEMPTY(211): bir, nǽko, vrit, kólko, inélkus, kač, kólkoto, sǽko, bu, her
| Paradigm adín | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Case=Acc|Definite=Ind|Number=Sing|PronType=Ind | annóga | ||
| Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|PronType=Ind | anní | ||
| Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|PronType=Ind | ennókte | annóto | |
| Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|PronType=Ind | annókne, annóganek | annóno | |
| Case=Acc|Definite=Ind|Number=Sing|NumType=Card | annók | annó | |
| Case=Acc|Definite=Ind|Number=Sing|PronType=Ind | annók, ennók, edín, annóga, annómu | annó, ennó, annój, jennó | annó, ennó |
| Case=Acc|Definite=Ind|Number=Plur|PronType=Ind | anný | anný | |
| Case=Acc|Deixis=Remt|Number=Sing|PronType=Dem | annǽh | ||
| Case=Gen|Definite=Ind|Number=Sing|PronType=Ind | annómu, annój | annój | |
| Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|PronType=Ind | adínyjen | annóno | |
| Case=Nom|Definite=Ind|Number=Sing|NumType=Card | adín | ||
| Case=Nom|Definite=Ind|Number=Sing|PronType=Ind | adín, edín, annómu | anná, enná, jennó | annó, anná, ennó |
ADJ
855 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (633; 74%), Deixis=EMPTY (501; 59%), Definite=Ind (499; 58%), Case=Acc (481; 56%).
ADJ tokens may have the following values of Gender:
Fem(329; 38% of non-emptyGender): gulǽma, cǽlo, gladná, gulǽmo, starána, altóneny, górnono, míčko, altóneno, bǽlyMasc(373; 44% of non-emptyGender): stáryjen, cǽla, móske, mládyjen, gulǽma, húbava, míčkyjen, stári, stárine, čárckyjenNeut(153; 18% of non-emptyGender): kámatno, Pomácko, právo, húbavo, lóšo, míčko, altóneno, cǽlo, kámatnono, parátikoEMPTY(162): mlógo, razý, málko, ájni, mlógu, mífko, meǧbúr, pišmén, birinǧíto, halál
| Paradigm gulǽm | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plur | gulǽmehne | ||
| Animacy=Hum|Case=Acc|Definite=Ind|Number=Plur | golémi, gulǽmeh | ||
| Animacy=Nhum|Case=Acc|Definite=Ind|Number=Plur | gulǽmy | ||
| Animacy=Nhum|Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | golǽmyne | ||
| Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | gulǽmoto | ||
| Case=Acc|Definite=Def|Deixis=Remt|Number=Sing | gulǽmono | ||
| Case=Acc|Definite=Ind|Number=Sing | gulǽma, goléma, golémi | gulǽmo | |
| Case=Acc|Definite=Ind|Number=Plur | gulǽmi | gulǽmy | |
| Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | gulǽmata | ||
| Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plur | gulǽmite | ||
| Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | gulǽmyjen | Gulǽmana | gulǽmono |
| Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | gulǽmyne | ||
| Case=Nom|Definite=Ind|Number=Sing | gulǽm, Golém | gulǽma | |
| Case=Nom|Definite=Ind|Number=Plur | Golǽmy, gulǽmy |
PROPN
311 PROPN tokens (62% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (267; 86%), Definite=Ind (244; 78%), Case=Nom (172; 55%).
PROPN tokens may have the following values of Gender:
Fem(109; 35% of non-emptyGender): Aminǽ, Srǽdo, Sóboto, Ǧemilǽ, Kavála, Dráma, Hilmijá, Mára, Máro, GalínkaMasc(140; 45% of non-emptyGender): Alí, Ají, Isén, Asíp, Panedélnik, Jerím, Tórnik, Nasradín, Orhán, AzraílNeut(62; 20% of non-emptyGender): Kélčeno, Kélčetune, Nedéle, Pašavík, Jasǿren, Mustáfčevo, Basájkovo, Bunár, Démirǧik, GøkčéEMPTY(194): Ksánti, Elláda, Ǧumágün, Aleksandrúpoli, Néa, Siría, Komotiní, Vulgaría, Évro, Dimokratía
| Paradigm Nedéle | Fem | Neut |
|---|---|---|
| Number=Sing | Nedéle | Nedéle |
| Number=Plur | Nedéleta |
Gender seems to be lexical feature of PROPN. 96% lemmas (114) occur only with one value of Gender.
NUM
189 NUM tokens (35% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (168; 89%), Definite=Ind (136; 72%), Deixis=EMPTY (136; 72%), Animacy=EMPTY (132; 70%), Case=Acc (122; 65%), Number=Sing (114; 60%).
NUM tokens may have the following values of Gender:
Fem(35; 19% of non-emptyGender): annó, ennó, anná, anníčka, ennáMasc(118; 62% of non-emptyGender): annók, dvamínana, dva, dvamína, dvomínana, trimínana, ennók, trimína, adínyjen, dvánaNeut(36; 19% of non-emptyGender): annó, annómune, annóto, drúgono, ennóto, jennóEMPTY(351): tri, dve, jedí, dvéne, kyrk, 6, 10, tríne, 5, 8
| Paradigm adín | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur|NumType=Card | annóga | ||
| Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing|NumType=Card | annógate | annóto, ennóto | |
| Case=Acc|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | annókne | ||
| Case=Acc|Definite=Ind|Number=Sing | annók | annó, ennó | annó |
| Case=Acc|Definite=Ind|Number=Sing|NumType=Card | annók, ennók, jedín | annó | annó, jennó |
| Case=Gen|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | annómune | ||
| Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | adínyjen | ||
| Case=Nom|Definite=Def|Deixis=Remt|Number=Sing|NumType=Card | adínyjen | ||
| Case=Nom|Definite=Ind|Degree=Dim|Number=Sing|NumType=Card | anníčka | ||
| Case=Nom|Definite=Ind|Number=Sing | adín | annó | |
| Case=Nom|Definite=Ind|Number=Sing|NumType=Card | adín, jedín | anná, enná | annó |
AUX
103 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Perf (103; 100%), Mood=EMPTY (103; 100%), Person=EMPTY (103; 100%), Tense=Past (103; 100%), VerbForm=Part (103; 100%), Voice=Act (103; 100%), Number=Sing (86; 83%).
AUX tokens may have the following values of Gender:
Fem(27; 26% of non-emptyGender): búla, býla, býly, búly, bíla, bíluMasc(46; 45% of non-emptyGender): bul, byl, búli, býli, bil, bíliNeut(30; 29% of non-emptyGender): búlo, býlo, bíloEMPTY(4074): je, da, so, še, li, si, som, sa, ša, jo
| Paradigm býdom | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Hum|Number=Plur | búli, býli, bíli | ||
| Number=Sing | bul, byl, bil | búla, býla, bíla, bílu | búlo, býlo |
| Number=Plur | býly, búly |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (703; 82%),
VERB –[conj]–> VERB (585; 64%),
VERB –[nsubj]–> NOUN (579; 57%),
NOUN –[amod]–> ADJ (487; 82%),
NOUN –[amod]–> VERB (52; 84%),
VERB –[nsubj]–> ADJ (35; 56%),
ADJ –[conj]–> ADJ (30; 100%),
ADJ –[det]–> DET (19; 61%),
PROPN –[flat]–> PROPN (17; 89%),
ADJ –[nsubj]–> NOUN (15; 65%).