home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

9954 tokens (37%) have a non-empty value of Gender. 7282 types (64%) occur at least once with a non-empty value of Gender. 3011 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4736; 17% instances), PROPN (2517; 9% instances), ADJ (842; 3% instances), VERB (812; 3% instances), DET (596; 2% instances), NUM (286; 1% instances), PRON (143; 1% instances), AUX (22; 0% instances).

NOUN

4736 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3371; 71%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (954) occur only with one value of Gender.

PROPN

2517 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2408; 96%), NameType=Giv (2108; 84%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (1256) occur only with one value of Gender.

ADJ

842 ADJ tokens (90% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (698; 83%), Variant=EMPTY (577; 69%), Poss=EMPTY (569; 68%).

ADJ tokens may have the following values of Gender:

Paradigm свѧтыиMascFemNeut
Case=Acc|Number=Singст҃госвѧтое
Case=Dat|Number=Sing[ст҃](м)[у], ст҃омусвѧтее, ст҃ѣ
Case=Dat|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Number=Singст҃го, (св҃)[т]аго, свгто, свѧ[т]-[го, св҃ѧ[т]о(го, сгто, ст҃ого, ст[о]гост҃ѣ, ст҃ее, ст҃ье, ст҃ѣ
Case=Gen|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Number=Plurст҃хъ, ст҃ꙑх
Case=Nom|Number=Singсвѧтꙑ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи
Case=Voc|Number=Sing(свѧ)тꙑе

VERB

812 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (801; 99%), Mood=EMPTY (785; 97%), Voice=Act (734; 90%), Tense=Past (719; 89%), Number=Sing (664; 82%), VerbForm=PartRes (636; 78%).

VERB tokens may have the following values of Gender:

Paradigm взѧтиMascFemNeut
Analyt=Yes|Number=Sing|Person=2|Tense=Past|VerbForm=PartRes|Voice=Actвзѧла
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвь:зѧлъ
Analyt=Yes|Number=Sing|Tense=Pqp|VerbForm=PartRes|Voice=Actвозѧлъ
Analyt=Yes|Number=Dual|Tense=Past|VerbForm=PartRes|Voice=Act[в]ъзѧла
Case=Nom|Number=Sing|Person=3|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Actвъземо
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Actвзьмъ, взѧв·ъ, воземо, възъмъвъзьмъши
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passвзѧтъвозѧто, возѧтъ, възѧто
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actвъзмѧ, възьмѧ
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвъзѧле, взѧле, взѧлъ, възѧлъ, взѧлъ, возѧле, возѧло, (в)зѧло, [въз]ѧ[ле, взѧ, взѧл, взѧло, возѧль, всѧло, въ(зѧ)ль, възалъ, възьль, възѧл[е, възѧль, възѧль], ѹвзѧлъ, ѹзѧле, ꙋзѧле
Number=Plur|Tense=Past|VerbForm=PartRes|Voice=Actвзѧлѣ, взѧли, взѧл[и], взѧл, ѹзѧлѣ

DET

596 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (536; 90%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Case=Acc|Number=Singтого, тототу, тꙋ, [т]ѹто, [т]о, (т)о, т(о, т[о]
Case=Acc|Number=Dualтѣ
Case=Acc|Number=Plurтꙑ, тꙑи, тꙑхътои, тѣ, тꙑи
Case=Dat|Number=Singтомѹ, тому, томуо, тъмѹтои, то:етому, томѹ
Case=Gen|Number=Singтоготоѣтого, тога, (т)[о]го, [тог]о, то[го]
Case=Ins|Number=Singт[ꙑ](мъ), тиме
Case=Loc|Number=Singтомо, томъ, томьтомъ, томо, томь, (т)омо, (то)мъ, [т]омь, томо, том
Case=Nomта
Case=Nom|Number=Singтъ, те, то, тътото, [т]о, [то
Case=Nom|Number=Plurте, ти, тѣтѣ

NUM

286 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=EMPTY (283; 99%), Case=Nom (174; 61%), Number=Sing (158; 55%), NumType=EMPTY (146; 51%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдва, дова, дъв(а)дви, двь, двѣ, дъвь, дъвѣ, (д)[в]е, дове, дъвѣд[ъ]ва, два, дова
Case=Nomдва, довадвь, дви, дове, две, дови, двѣ, довѣ, дъвѣ, дви, д[ъв]…, д{в}овѣ, девѣ, дъвѣ, дъв[ѣ]дъва

PRON

143 PRON tokens (11% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (143; 100%), PronType=EMPTY (142; 99%), Number=Sing (137; 96%), Person=EMPTY (136; 95%).

PRON tokens may have the following values of Gender:

Paradigm иMascFemNeut
Case=Acc,Gen|Number=Singѥго
Case=Acc|Number=Singѥго, его, нь, и, [ѥг]о, ·и·, н(его), не, него, ї, ѥгю, …ю, ѫе, его, не
Case=Acc|Number=Dualнѧ
Case=Acc|Number=Plurихъ, ӏ, ӏхъ
Case=Dat|Number=Singемѹ, емꙋ, ѥму, (-)емꙋ, емо, ему, емѫ, ному, ньмуо, ѥмеи, ее
Case=Genѥго
Case=Gen|Number=Singѥго, его, него, ного, єго, ево, егъ, него, ньго, єво, ѥго, ѥг, ѥгоое]ѥ, ее, еи, еѣ, єи, ѥи, ѥӏ
Case=Ins|Number=Singнимо, нимь, имо, нимъ
Case=Ins|Number=Plurним[и
Case=Loc|Number=Singнемь, немь‐, немо, немъ, нь[мо], нѣмѣ

AUX

22 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (22; 100%), Tense=Past (22; 100%), Voice=Act (22; 100%), Number=Sing (21; 95%), VerbForm=PartRes (21; 95%), Analyt=EMPTY (12; 55%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Number=Sing|VerbForm=Finбылъ
Analyt=Yes|Number=Sing|VerbForm=PartResбꙑлъ, бꙑле, бꙑлобꙑлабꙑло
Analyt=Yes|Number=Plur|VerbForm=PartResбꙑли
Number=Sing|VerbForm=PartResбꙑлъ, б]ꙑ[лъ], бꙑле, бꙑло, бꙑлбꙑлабꙑло, (бꙑ)ло, б[ꙑ]ло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[conj]–> PROPN (374; 71%), NOUN –[amod]–> ADJ (371; 88%), NOUN –[conj]–> NOUN (293; 54%), NOUN –[det]–> DET (287; 84%), NOUN –[appos]–> PROPN (118; 95%), PROPN –[orphan]–> PROPN (108; 81%), VERB –[conj]–> VERB (95; 61%), VERB –[nsubj]–> PROPN (91; 54%), PROPN –[flat:name]–> ADJ (84; 95%), PROPN –[flat:name]–> PROPN (79; 90%).