Treebank Statistics: UD_Macedonian-MTB: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
This is a layered feature with the following layers: Gender, Gender[psor].
339 tokens (25%) have a non-empty value of Gender.
234 types (43%) occur at least once with a non-empty value of Gender.
210 lemmas (44%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (177; 13% instances), PRON (68; 5% instances), ADJ (35; 3% instances), PROPN (26; 2% instances), DET (15; 1% instances), VERB (14; 1% instances), NUM (3; 0% instances), AUX (1; 0% instances).
NOUN
177 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (151; 85%), Definite=Ind (112; 63%).
NOUN tokens may have the following values of Gender:
Fem(67; 38% of non-emptyGender): јакна, година, авантури, бронза, книгата, колата, пари, снимка, собата, тортаMasc(76; 43% of non-emptyGender): Натпреварот, крајот, автомобил, дена, испитот, компјутерот, облаците, професорот, син, сладоледNeut(34; 19% of non-emptyGender): кино, Детето, дете, злато, место, писмо, Луѓето, Сонцето, време, времетоEMPTY(2): Рим, Спа
| Paradigm сонце | Fem | Neut |
|---|---|---|
| Сонцето | Сонцето |
Gender seems to be lexical feature of NOUN. 99% lemmas (124) occur only with one value of Gender.
PRON
68 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (68; 100%), Number=Sing (67; 99%), Definite=EMPTY (55; 81%), Person=3 (51; 75%), PronType=Prs (51; 75%).
PRON tokens may have the following values of Gender:
Fem(15; 22% of non-emptyGender): ја, Таа, сите, ѝMasc(41; 60% of non-emptyGender): го, му, Тој, кој, Неговата, каков, којшто, него, нему, јасNeut(12; 18% of non-emptyGender): тоа, го, којшто, Што, нешто, ништоEMPTY(112): се, ми, ме, тие, ти, ги, ние, си, што, Сѐ
| Paradigm го | Masc | Neut |
|---|---|---|
| Definite=Def|Person=3 | го | |
| Person=3 | го | Го |
| го |
ADJ
35 ADJ tokens (80% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (35; 100%), Degree=Pos (32; 91%), Definite=Ind (21; 60%).
ADJ tokens may have the following values of Gender:
Fem(12; 34% of non-emptyGender): голема, нова, учебната, добра, мала, мила, минатата, првата, убаваMasc(16; 46% of non-emptyGender): утрешниот, вознемирен, главниот, дрзок, зелениот, кинески, минатиот, незадоволен, позабавен, познатNeut(7; 20% of non-emptyGender): добро, корисно, одлично, прекрасно, светлото, слободно, слученоEMPTY(9): болен, глупави, долги, играни, нови, одбрани, последниве, презадоволни, скапа
| Paradigm добар | Fem | Neut |
|---|---|---|
| добра | добро |
PROPN
26 PROPN tokens (96% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (26; 100%), Definite=EMPTY (15; 58%).
PROPN tokens may have the following values of Gender:
Fem(8; 31% of non-emptyGender): Мери, Марија, Џејн, Браун, ФранцијаMasc(17; 65% of non-emptyGender): Петар, Јован, Марко, Бетовен, Вардар, Лудвиг, Париз, Сем, Смит, ТинексNeut(1; 4% of non-emptyGender): ИгуацуEMPTY(1): ван
Gender seems to be lexical feature of PROPN. 100% lemmas (17) occur only with one value of Gender.
DET
15 DET tokens (75% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (15; 100%), Case=EMPTY (11; 73%), Person=EMPTY (10; 67%), Poss=EMPTY (9; 60%), Definite=EMPTY (8; 53%).
DET tokens may have the following values of Gender:
Fem(4; 27% of non-emptyGender): Оваа, една, некоја, нејзиниотMasc(6; 40% of non-emptyGender): мојот, каков, неговиот, својот, твојотNeut(5; 33% of non-emptyGender): она, Ова, такво, тоаEMPTY(5): моите, Тие, некои
| Paradigm ова | Fem | Neut |
|---|---|---|
| Case=Nom | Оваа | |
| Ова |
Gender seems to be lexical feature of DET. 92% lemmas (11) occur only with one value of Gender.
VERB
14 VERB tokens (5% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (14; 100%), Mood=EMPTY (13; 93%), Person=EMPTY (12; 86%), VerbForm=Part (12; 86%), Aspect=Perf (10; 71%), Tense=EMPTY (10; 71%).
VERB tokens may have the following values of Gender:
Fem(3; 21% of non-emptyGender): јакна, прочитанаMasc(9; 64% of non-emptyGender): одземен, возбуден, гледал, казнет, можел, напишал, оставил, совладанNeut(2; 14% of non-emptyGender): испорачано, слученоEMPTY(262): дојде, облеков, студеше, сакам, јави, Мислам, дојдеш, воодушеви, гледав, дојди
Gender seems to be lexical feature of VERB. 100% lemmas (12) occur only with one value of Gender.
NUM
3 NUM tokens (43% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Definite=Ind (3; 100%), NumType=Card (3; 100%), Number=Plur (2; 67%).
NUM tokens may have the following values of Gender:
Fem(1; 33% of non-emptyGender): еднаMasc(2; 67% of non-emptyGender): дваEMPTY(4): 15, неколку, пет, три
AUX
1 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (1; 100%), Mood=Ind (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=EMPTY (1; 100%), VerbForm=Part (1; 100%), Voice=Act (1; 100%).
AUX tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): билEMPTY(61): ќе, е, беше, бев, биде, сте, Бевме, Сум, би, бидат
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (18; 75%),
NOUN –[det]–> DET (10; 63%),
PROPN –[flat]–> PROPN (3; 75%),
VERB –[nsubj:pass]–> NOUN (3; 100%),
ADJ –[conj]–> ADJ (2; 67%),
ADJ –[nsubj]–> NOUN (2; 100%),
ADJ –[nsubj]–> PRON (2; 67%),
ADJ –[nsubj]–> PROPN (2; 100%),
NOUN –[expl]–> PRON (2; 100%),
PROPN –[appos]–> NOUN (2; 100%).