Treebank Statistics: UD_Pomak-Philotis: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
29075 tokens (34%) have a non-empty value of Gender
.
7842 types (72%) occur at least once with a non-empty value of Gender
.
2695 lemmas (68%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (12751; 15% instances), VERB (6246; 7% instances), PRON (3580; 4% instances), DET (2897; 3% instances), ADJ (2216; 3% instances), PROPN (734; 1% instances), NUM (438; 1% instances), AUX (213; 0% instances).
NOUN
12751 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (10025; 79%), Case=Acc (8056; 63%), Deixis=EMPTY (6884; 54%), Definite=Ind (6883; 54%).
NOUN
tokens may have the following values of Gender
:
Fem
(5035; 39% of non-emptyGender
): godíny, májka, kóštono, rábato, vódo, rábaty, žanána, rábata, momána, parýMasc
(5548; 44% of non-emptyGender
): déne, čulǽkon, čulǽk, čulǽka, bubájko, vakýt, pláden, bubájka, dǽdo, bratNeut
(2168; 17% of non-emptyGender
): kópeløno, vréme, mómičeno, mǽsto, mómiče, sélo, sélono, vratána, píle, magárenoEMPTY
(108): gün, keré, korf, dumá, kerét, DEIno, sredénošt, senǽ, i.d., gündé
Paradigm žaná | Fem | Neut |
---|---|---|
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | žanóso | |
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | žanóto | |
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing | žanóno, ženóno, žónana | |
Case=Acc|Definite=Ind|Number=Sing | žóno | žóno |
Case=Acc|Definite=Ind|Number=Plur | žóny | |
Case=Gen|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | žanójse | |
Case=Gen|Definite=Def|Deixis=Remt|Number=Sing | žanójne, žónajne | |
Case=Gen|Definite=Ind|Number=Sing | žónoj | |
Case=Gen|Definite=Ind|Number=Plur | žónom | |
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | ženása | |
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | žanána, ženána | |
Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | žónyne | |
Case=Nom|Definite=Ind|Number=Sing | žaná, žéna | |
Case=Nom|Definite=Ind|Number=Plur | žóny | |
Case=Voc|Definite=Ind|Number=Sing | žóno |
Gender
seems to be lexical feature of NOUN
. 97% lemmas (1176) occur only with one value of Gender
.
VERB
6246 VERB tokens (42% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (6246; 100%), Person=EMPTY (6246; 100%), VerbForm=Part (6245; 100%), Tense=Past (5775; 92%), Voice=Act (5775; 92%), Aspect=Perf (4961; 79%), Number=Sing (4869; 78%).
VERB
tokens may have the following values of Gender
:
Fem
(1591; 25% of non-emptyGender
): reklála, zǿla, atišlála, vídela, stánala, reklá, papýtala, dála, tórnala, ískalaMasc
(3619; 58% of non-emptyGender
): reklól, zøl, atišlól, imǽl, papýtal, vídel, dal, advórnal, tórnal, stánalNeut
(1036; 17% of non-emptyGender
): imǽlo, stánalo, skrýto, reklólo, zǿlo, atišlólo, dašlólo, advórnalo, tórnalo, láhaloEMPTY
(8764): víka, íma, trǽbava, móža, hódi, právi, fáti, íde, stánava, íšte
Paradigm réčem | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Number=Plur | reklíli, reklí, raklí | ||
Animacy=Nhum|Number=Plur | reklýly | ||
Number=Sing | reklól, rekól, reklóla | reklála, reklá, rékla | reklólo, rekló |
Number=Plur | reklýly | reklý, reklýly |
PRON
3580 PRON tokens (41% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (3580; 100%), Person=3 (3576; 100%), PronType=Prs (3575; 100%), Number=Sing (3378; 94%).
PRON
tokens may have the following values of Gender
:
Fem
(1106; 31% of non-emptyGender
): jé, tja, ji, jí, týje, hi, jo, to, te, jaMasc
(1843; 51% of non-emptyGender
): go, mú, toj, mu, tóga, tíje, tæh, mo, to, tómuNeut
(631; 18% of non-emptyGender
): go, to, mu, mú, gu, mo, tómu, móEMPTY
(5190): só, sí, mí, gi, kaná, ja, tí, ty, mó, mi
Paradigm ja | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Number=Plur|PronType=Prs | tæh | ||
Animacy=Hum|Case=Nom|Number=Plur|PronType=Prs | tíje | ||
Animacy=Nhum|Case=Acc|Number=Plur|PronType=Prs | to | ||
Animacy=Nhum|Case=Nom|Number=Plur|PronType=Prs | to | ||
Case=Acc|Number=Sing | jé | ||
Case=Acc|Number=Sing|PronType=Prs | go, tóga, gu, tógu, néga | jé, týje, jo, ja, néje, týjo | go, to, gu |
Case=Acc|Number=Plur|PronType=Prs | to | to | to |
Case=Gen|Number=Sing|PronType=Prs | mú, mo, tómu | jí, hi, je, tój | mú, mo, tómu, mó |
Case=Gen|Number=Plur|PronType=Prs | Tæm | ||
Case=Nom|Number=Sing|PronType=Prs | toj | tja, te, tje, tæ | to |
Case=Nom|Number=Plur|PronType=Prs | to | to |
DET
2897 DET tokens (85% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: DeixisRef=EMPTY (2540; 88%), Number=Sing (2399; 83%), Case=Acc (1667; 58%), Deixis=EMPTY (1628; 56%).
DET
tokens may have the following values of Gender
:
Fem
(813; 28% of non-emptyGender
): annó, anná, ennó, žýne, žána, isózi, kakvó, drúgono, drúgy, isáziMasc
(1538; 53% of non-emptyGender
): annók, adín, kutrí, žýjen, vrítsi, žíne, žókne, ennók, badín, edínNeut
(546; 19% of non-emptyGender
): annó, inazí, drúgo, ennó, isazí, kakvó, itazí, žóno, žýne, drúgonoEMPTY
(526): bir, vrit, nǽko, kólko, inélkus, her, kač, sǽko, kólkono, bu
Paradigm adín | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Definite=Ind|Number=Sing | annóga | ||
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plur | annǽh | ||
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur | anní | ||
Animacy=Nhum|Case=Acc|Definite=Ind|Number=Plur | anný | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | Ennósa | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | ennókte | annóto | annóto |
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing | annókne, annóganek | annóno, ennóna, jennóna | |
Case=Acc|Definite=Ind|Degree=Dim|Number=Sing | anníčko | ||
Case=Acc|Definite=Ind|Number=Sing | annók, ennók, edín, jedín, adín | annó, ennó, enná, jennó | annó, ennó, jennó |
Case=Acc|Definite=Ind|Number=Plur | anný | ||
Case=Gen|Definite=Def|Deixis=Remt|Number=Sing | annómune | ||
Case=Gen|Definite=Ind|Number=Sing | annómu | annój | annómu |
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | adínyjet | ||
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | adínyjen, edínijon, adínajen, adínen | annána, jennóna | annóno |
Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | annýne | ||
Case=Nom|Definite=Ind|Number=Sing | adín, edín, jedín | anná, enná, ennó, jenná, jennó | annó, ennó |
Case=Nom|Definite=Ind|Number=Plur | anný |
ADJ
2216 ADJ tokens (85% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1712; 77%), Definite=Ind (1297; 59%), Deixis=EMPTY (1297; 59%), Case=Acc (1144; 52%).
ADJ
tokens may have the following values of Gender
:
Fem
(785; 35% of non-emptyGender
): starána, cǽlo, gulǽma, gulǽmo, górnono, Pomácka, hubavá, čárckono, altóneny, húbavoMasc
(984; 44% of non-emptyGender
): stáryjen, cǽla, mládyjen, gulǽma, míčkyjen, gulǽmyjen, húbava, móske, čárckyjen, gulǽmNeut
(447; 20% of non-emptyGender
): Pomácko, kámatno, parátiko, altónenono, húbavo, míčko, lóšo, cǽlo, právo, gulǽmoEMPTY
(396): mlógo, razý, málko, ájni, bajá, mlógu, mífko, nétekin, üčünǧǘno, ájnisi
Paradigm star | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plur | stárehne, stárene | ||
Animacy=Hum|Case=Acc|Definite=Ind|Number=Plur | stáreh | ||
Animacy=Hum|Case=Gen|Definite=Def|Deixis=Remt|Number=Plur | stáremne | ||
Animacy=Hum|Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plur | Stárite | ||
Animacy=Hum|Case=Nom|Definite=Def|Deixis=Remt|Number=Plur | stárine | ||
Animacy=Hum|Case=Nom|Definite=Ind|Number=Plur | stári | ||
Animacy=Nhum|Case=Acc|Definite=Def|Deixis=Remt|Number=Plur | stáryne | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1|Number=Sing | Stároso | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | stárokte | stároto | stároto |
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Plur | stáryte | ||
Case=Acc|Definite=Def|Deixis=Remt|Number=Sing | stárokne, stárane | stárono | |
Case=Acc|Definite=Def|Deixis=Remt|Number=Plur | stáryne | ||
Case=Acc|Definite=Ind|Number=Sing | stára, stárok | stáro, stára | |
Case=Acc|Definite=Ind|Number=Plur | stáry | ||
Case=Gen|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | stárumute | ||
Case=Gen|Definite=Def|Deixis=Remt|Number=Sing | stárumune, stáromune | stárojne | |
Case=Gen|Definite=Ind|Number=Sing | stáru | stároj | |
Case=Nom|Definite=Def|Deixis=Prox|DeixisRef=2|Number=Sing | stáryjet | staráta | |
Case=Nom|Definite=Def|Deixis=Remt|Number=Sing | stáryjen, stárijon, stáryen | starána | |
Case=Nom|Definite=Ind|Number=Sing | star | stará | |
Case=Nom|Definite=Ind|Number=Plur | stáry | ||
Case=Voc|Definite=Ind|Degree=Dim|Number=Sing | stárku |
PROPN
734 PROPN tokens (67% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (624; 85%), Definite=Ind (554; 75%), Case=Nom (428; 58%).
PROPN
tokens may have the following values of Gender
:
Fem
(244; 33% of non-emptyGender
): Aminǽ, Galínka, Ǧemilǽ, Hilmijá, Kavála, Mára, Srǽdo, Jurké, Melihá, MároMasc
(370; 50% of non-emptyGender
): Isén, Alí, Nasradín, Panedélnik, Ají, Asán, Jerím, Orhán, Asíp, EnésNeut
(120; 16% of non-emptyGender
): Kélčeno, Nedéle, Kélčetune, Iskéče, Pašavík, Basájkovo, Lýǧeno, Siníkovo, Bratánkovo, BunárEMPTY
(364): Ksánti, Elláda, Ǧumágün, Komotiní, Siría, Aleksandrúpoli, Vulgaría, Rodópi, Évro, Néa
Paradigm Hóǧe | Fem | Neut |
---|---|---|
Case=Acc|Definite=Ind | Hóǧa | |
Case=Nom|Definite=Def|Deixis=Remt | Hóǧana | Hóǧena |
Case=Nom|Definite=Ind | Hóǧe, HÓǦE, Hóǧa | |
Case=Voc|Definite=Ind | Hóǧa |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (137) occur only with one value of Gender
.
NUM
438 NUM tokens (37% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (438; 100%), Definite=Ind (318; 73%), Deixis=EMPTY (318; 73%), Animacy=EMPTY (288; 66%), Case=Acc (258; 59%), Number=Sing (237; 54%).
NUM
tokens may have the following values of Gender
:
Fem
(81; 18% of non-emptyGender
): annó, anná, ennó, enná, annána, anníčka, annój, annóno, annósoMasc
(306; 70% of non-emptyGender
): annók, dva, dvamínana, dvamína, dvomínana, trimínana, dvomína, ennók, trimína, adínNeut
(51; 12% of non-emptyGender
): annó, annóno, annóto, dvémne, annómune, ennó, ennóto, jennóEMPTY
(738): tri, dve, dvéne, kyrk, beš, on, tríne, 6, jedí, 5
Paradigm adín | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=1 | annóso | ||
Case=Acc|Definite=Def|Deixis=Prox|DeixisRef=2 | annógate | annóto, ennóto | |
Case=Acc|Definite=Def|Deixis=Remt | annókne | annóno | annóno |
Case=Acc|Definite=Ind | annók, ennók, jennók, jedín | annó, ennó | annó, ennó, jennó |
Case=Gen|Definite=Def|Deixis=Remt | annómune | ||
Case=Gen|Definite=Ind | annój | ||
Case=Nom|Definite=Def|Deixis=Remt | adínyjen, edíņon | annána | annóno |
Case=Nom|Definite=Ind|Degree=Dim | anníček | anníčka | |
Case=Nom|Definite=Ind | adín, jedín | anná, enná | annó |
AUX
213 AUX tokens (2% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=Perf (213; 100%), Mood=EMPTY (213; 100%), Person=EMPTY (213; 100%), Tense=Past (213; 100%), VerbForm=Part (213; 100%), Voice=Act (213; 100%), Number=Sing (179; 84%).
AUX
tokens may have the following values of Gender
:
Fem
(61; 29% of non-emptyGender
): býla, búla, býly, bíla, búly, bylá, buláMasc
(95; 45% of non-emptyGender
): bul, byl, bil, búli, býli, bíli, búlyNeut
(57; 27% of non-emptyGender
): búlo, býlo, býly, búly, buló, bílo, bíluEMPTY
(10225): je, da, so, še, si, som, sa, ša, jo, sme
Paradigm býdom | Masc | Fem | Neut |
---|---|---|---|
Animacy=Hum|Number=Plur | búli, býli, bíli | ||
Animacy=Nhum|Number=Plur | búly | ||
Number=Sing | bul, byl, bil | býla, búla, bíla, bylá, bulá | búlo, býlo, buló, bílo, bílu |
Number=Plur | býly, búly | býly, búly |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (1797; 82%),
VERB –[conj]–> VERB (1522; 61%),
NOUN –[amod]–> ADJ (1400; 88%),
VERB –[nsubj]–> NOUN (1260; 53%),
VERB –[nsubj]–> PRON (270; 52%),
NOUN –[amod]–> VERB (143; 85%),
VERB –[nsubj]–> ADJ (86; 52%),
ADJ –[det]–> DET (62; 76%),
ADJ –[conj]–> ADJ (55; 90%),
PROPN –[nmod]–> PROPN (48; 71%).