Treebank Statistics: UD_Old_East_Slavic-Birchbark: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
9985 tokens (37%) have a non-empty value of Gender
.
7304 types (64%) occur at least once with a non-empty value of Gender
.
3024 lemmas (64%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (4748; 17% instances), PROPN (2528; 9% instances), ADJ (844; 3% instances), VERB (816; 3% instances), DET (596; 2% instances), NUM (286; 1% instances), PRON (145; 1% instances), AUX (22; 0% instances).
NOUN
4748 NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3381; 71%).
NOUN
tokens may have the following values of Gender
:
Fem
(2203; 46% of non-emptyGender
): ржи, кѹно, соли, кѹнъ, грамота, гривьнѣ, гривьнъ, рожи, гривено, гривнаMasc
(1941; 41% of non-emptyGender
): поклонъ, гн҃е, поклоно, конь, попа, рубль, сн҃а, кони, приказъ, господинеNeut
(604; 13% of non-emptyGender
): покланѧние, жита, села, серебра, слово, село, челомъ, серебро, целомъ, цоломъEMPTY
(117): люди, дети, дѣтемъ, дѣтемь, людие, людми, людье, людьми, дитьи, дѣтеи
Gender
seems to be lexical feature of NOUN
. 100% lemmas (959) occur only with one value of Gender
.
PROPN
2528 PROPN tokens (91% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (2419; 96%), NameType=Giv (2119; 84%).
PROPN
tokens may have the following values of Gender
:
Fem
(258; 10% of non-emptyGender
): мариѧ, ѧна, маренѣ, настасиѧ, куролѣ, лаидиколѣ, лугу, марие, мароѳу, милославьMasc
(2197; 87% of non-emptyGender
): ивана, петра, бориса, евана, степана, павла, смена, завида, лѹкѣ, михалѧNeut
(73; 3% of non-emptyGender
): городищи, ——скь, —–ина, –руньского, -остер…, [гр]од[и], [сл]а[в]…, бабине, бологожь, болъсинѣ EMPTY
(236): в…, ж…, кшетахъ, ли…, малѧтѣ, м…, ньжѧть, нѣжѧтѣ, са…, стоють
Gender
seems to be lexical feature of PROPN
. 100% lemmas (1261) occur only with one value of Gender
.
ADJ
844 ADJ tokens (90% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (699; 83%), Variant=EMPTY (579; 69%), Poss=EMPTY (571; 68%).
ADJ
tokens may have the following values of Gender
:
Fem
(271; 32% of non-emptyGender
): ст҃ѣ, бѣлонога, дворнюю, десѧтаꙗ, добре, другую, новаѧ, пуста, пѧте, пѧтьMasc
(436; 52% of non-emptyGender
): велика, гнѣд, пѧта, ст҃го, бж҃и, ворон, добра, добръ, желѣзен, зелоногоNeut
(137; 16% of non-emptyGender
): добро, проста, здорово, борзи, борзѣ, годьнъ, лживꙑѧ, (д)анилово, (до)ро[г]а[ѧ], (т)[о]тарьскогоEMPTY
(96): коневꙑхъ, страднꙑх, (д)митрову, (ко)невꙑхъ, (л)юдними, (с)тар[ѣ]шимъ, (с)траднꙑх, (св)[об]однꙑхо, :ѕ҃:, [б]орови[ц]ки
Paradigm свѧтыи | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | ст҃го | свѧтое | |
Case=Dat|Number=Sing | [ст҃](м)[у], ст҃ому | свѧтее, ст҃ѣ | |
Case=Dat|Number=Sing|Variant=Short | ст҃ѣ | ||
Case=Gen|Number=Sing | ст҃го, (св҃)[т]аго, свгто, свѧ[т]-[го, св҃ѧ[т]о | ст҃ | |
Case=Gen|Number=Sing|Variant=Short | ст҃ѣ | ||
Case=Gen|Number=Plur | ст҃хъ, ст҃ꙑх | ||
Case=Nom|Number=Sing | свѧтꙑ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи | ||
Case=Voc|Number=Sing | (свѧ) |
VERB
816 VERB tokens (32% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Person=EMPTY (805; 99%), Mood=EMPTY (789; 97%), Voice=Act (738; 90%), Tense=Past (723; 89%), Number=Sing (668; 82%), VerbForm=PartRes (640; 78%).
VERB
tokens may have the following values of Gender
:
Fem
(83; 10% of non-emptyGender
): дала, задѣла, пеюци, рекла, ѧла, (-)шьла, (се)[д]ѧци, —–ила, [бꙑл]а, ѿанаMasc
(668; 82% of non-emptyGender
): далъ, възѧле, дале, послале, въдале, взѧле, велѣлъ, взѧлъ, взѧлѣ, възѧлъNeut
(65; 8% of non-emptyGender
): шло, бꙑло, пошло, (шло, диꙗлось, куплено, погибло, (пог)[ꙑб]ло, (соро)слосѧ, [п]окладено EMPTY
(1773): даи, возми, възьми, пришли, посли, присъли, далъ, иди, кланѧюсѧ, купи
Paradigm взѧти | Masc | Fem | Neut |
---|---|---|---|
Analyt=Yes|Number=Sing|Person=2|Tense=Past|VerbForm=PartRes|Voice=Act | взѧла | ||
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Act | вь:зѧ | ||
Analyt=Yes|Number=Sing|Tense=Pqp|VerbForm=PartRes|Voice=Act | возѧлъ | ||
Analyt=Yes|Number=Dual|Tense=Past|VerbForm=PartRes|Voice=Act | [в]ъзѧла | ||
Case=Nom|Number=Sing|Person=3|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Act | въземо | ||
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Act | взьмъ, взѧв·ъ, воземо, възъмъ | възьмъши | |
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass | взѧтъ | возѧто, возѧтъ, възѧто | |
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Act | въз | ||
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Act | възѧле, взѧле, взѧлъ, възѧлъ, взѧ | ||
Number=Plur|Tense=Past|VerbForm=PartRes|Voice=Act | взѧлѣ, взѧли, взѧл[и], взѧл, ѹзѧлѣ |
DET
596 DET tokens (83% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (536; 90%).
DET
tokens may have the following values of Gender
:
Fem
(132; 22% of non-emptyGender
): моѧ, свою, мою, ту, моѥи, сама, своеи, твоѧ, моѥӏ, своѥиMasc
(282; 47% of non-emptyGender
): своѥму, мои, свои, твои, того, моѥго, моѥму, саме, самь, сегоNeut
(182; 31% of non-emptyGender
): то, томъ, того, томо, все, моего, томь, [т]о, вохо, моеEMPTY
(126): же, никомѹ, всихъ, всѣхъ, моими, мъихъ, нашихо, никому, ницимъ, своими
Paradigm тотъ | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | того, тото | ту, тꙋ, [т]ѹ | то, [т]о, (т)о, т(о, т[о] |
Case=Acc|Number=Dual | тѣ | ||
Case=Acc|Number=Plur | тꙑ, тꙑи, тꙑхъ | тои, тѣ, тꙑи | |
Case=Dat|Number=Sing | томѹ, тому, томуо, тъмѹ | тои, то:е | тому, то |
Case=Gen|Number=Sing | того | тоѣ | того, тога, (т)[о]го, [тог]о, то[го] |
Case=Ins|Number=Sing | т[ꙑ](мъ), тиме | ||
Case=Loc|Number=Sing | томо, томъ, томь | томъ, томо, томь, (т)омо, (то) | |
Case=Nom | та | ||
Case=Nom|Number=Sing | тъ, те, то, тъто | то, [т]о, [то | |
Case=Nom|Number=Plur | те, ти, тѣ | тѣ |
NUM
286 NUM tokens (23% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=EMPTY (283; 99%), Case=Nom (174; 61%), Number=Sing (158; 55%), NumType=EMPTY (146; 51%).
NUM
tokens may have the following values of Gender
:
Fem
(97; 34% of non-emptyGender
): три, двь, дви, дове, двѣ, дъвѣ, две, дови, полутори, в҃еMasc
(181; 63% of non-emptyGender
): поло, полъ, пло, два, дова, полѹ, пол, три, (п)олъ, поNeut
(8; 3% of non-emptyGender
): три, д[ъ]ва, два, дова, дъва, одиномо, сотъEMPTY
(975): ·в҃·, ·г҃·, :в҃:, :в:, ·г·, в҃, ·ӏ҃·, три, г҃, :г҃:
Paradigm два | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | два, дова, дъв(а) | дви, двь, двѣ, дъвь, дъвѣ, (д)[в]е, дове, дъ | д[ъ]ва, два, дова |
Case=Nom | два, дова | двь, дви, дове, две, дови, двѣ, довѣ, дъвѣ, д | дъва |
PRON
145 PRON tokens (11% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Clitic=EMPTY (145; 100%), PronType=EMPTY (143; 99%), Number=Sing (139; 96%), Person=EMPTY (138; 95%).
PRON
tokens may have the following values of Gender
:
Fem
(20; 14% of non-emptyGender
): ю, ее, …ю, е и, е]ѥ, еи, еѣ, єи, ѥи, ѥӏ Masc
(106; 73% of non-emptyGender
): ѥго, его, емѹ, него, емꙋ, ѥму, нь, и, немь, нимоNeut
(19; 13% of non-emptyGender
): то, е, томъ, что, [ц]ьто, [ѥ]же, его, не, ничимо, томоEMPTY
(1120): ми, сѧ, тꙑ, ти, ѧ, мене, тобѣ, мнѣ, ѧзъ, мѧ
Paradigm и | Masc | Fem | Neut |
---|---|---|---|
Case=Acc,Gen|Number=Sing | ѥго | ||
Case=Acc|Number=Sing | ѥго, его, нь, и, [ѥг]о, ·и·, н(его), не, него, ї, ѥг | ю, … | е, его, не |
Case=Acc|Number=Dual | нѧ | ||
Case=Acc|Number=Plur | ихъ, ӏ, ӏх | ||
Case=Dat|Number=Sing | емѹ, емꙋ, ѥму, (-)емꙋ, емо, ему, емѫ, ному, ньмуо, ѥм | е | |
Case=Gen | ѥго | ||
Case=Gen|Number=Sing | ѥго, его, него, ного, єго, ево, егъ, не | е]ѥ, ее, еи, еѣ, єи, ѥи, ѥӏ | |
Case=Ins|Number=Sing | нимо, нимь, имо, нимъ | ||
Case=Ins|Number=Plur | ним[и | ||
Case=Loc|Number=Sing | немь, не |
AUX
22 AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Person=EMPTY (22; 100%), Tense=Past (22; 100%), Voice=Act (22; 100%), Number=Sing (21; 95%), VerbForm=PartRes (21; 95%), Analyt=EMPTY (12; 55%).
AUX
tokens may have the following values of Gender
:
Fem
(2; 9% of non-emptyGender
): бꙑлаMasc
(14; 64% of non-emptyGender
): бꙑлъ, бꙑле, бꙑло, б]ꙑ[лъ], былъ, бꙑли, бꙑлNeut
(6; 27% of non-emptyGender
): бꙑло, (бꙑ)ло, б[ꙑ] ло EMPTY
(353): еси, ѥси, есмь, бꙑ, ѥсмь, есте, есть, есемо, ѥсме, есме
Paradigm быти | Masc | Fem | Neut |
---|---|---|---|
Analyt=Yes|Number=Sing|VerbForm=Fin | былъ | ||
Analyt=Yes|Number=Sing|VerbForm=PartRes | бꙑлъ, бꙑле, бꙑло | бꙑла | бꙑло |
Analyt=Yes|Number=Plur|VerbForm=PartRes | бꙑли | ||
Number=Sing|VerbForm=PartRes | бꙑлъ, б]ꙑ[лъ], бꙑле, бꙑло, бꙑл | бꙑла | бꙑло, (бꙑ) |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
PROPN –[conj]–> PROPN (377; 71%),
NOUN –[amod]–> ADJ (371; 88%),
NOUN –[conj]–> NOUN (294; 55%),
NOUN –[det]–> DET (287; 84%),
NOUN –[appos]–> PROPN (118; 94%),
PROPN –[orphan]–> PROPN (110; 81%),
VERB –[conj]–> VERB (95; 61%),
VERB –[nsubj]–> PROPN (92; 55%),
PROPN –[flat:name]–> ADJ (84; 95%),
PROPN –[flat:name]–> PROPN (79; 90%).