home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

9985 tokens (37%) have a non-empty value of Gender. 7304 types (64%) occur at least once with a non-empty value of Gender. 3024 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4748; 17% instances), PROPN (2528; 9% instances), ADJ (844; 3% instances), VERB (816; 3% instances), DET (596; 2% instances), NUM (286; 1% instances), PRON (145; 1% instances), AUX (22; 0% instances).

NOUN

4748 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3381; 71%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (959) occur only with one value of Gender.

PROPN

2528 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2419; 96%), NameType=Giv (2119; 84%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (1261) occur only with one value of Gender.

ADJ

844 ADJ tokens (90% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (699; 83%), Variant=EMPTY (579; 69%), Poss=EMPTY (571; 68%).

ADJ tokens may have the following values of Gender:

Paradigm свѧтыиMascFemNeut
Case=Acc|Number=Singст҃госвѧтое
Case=Dat|Number=Sing[ст҃](м)[у], ст҃омусвѧтее, ст҃ѣ
Case=Dat|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Number=Singст҃го, (св҃)[т]аго, свгто, свѧ[т]-[го, св҃ѧ[т]о(го, сгто, ст҃ого, ст[о]гост҃ѣ, ст҃ее, ст҃ье, ст҃ѣ
Case=Gen|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Number=Plurст҃хъ, ст҃ꙑх
Case=Nom|Number=Singсвѧтꙑ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи
Case=Voc|Number=Sing(свѧ)тꙑе

VERB

816 VERB tokens (32% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (805; 99%), Mood=EMPTY (789; 97%), Voice=Act (738; 90%), Tense=Past (723; 89%), Number=Sing (668; 82%), VerbForm=PartRes (640; 78%).

VERB tokens may have the following values of Gender:

Paradigm взѧтиMascFemNeut
Analyt=Yes|Number=Sing|Person=2|Tense=Past|VerbForm=PartRes|Voice=Actвзѧла
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвь:зѧлъ
Analyt=Yes|Number=Sing|Tense=Pqp|VerbForm=PartRes|Voice=Actвозѧлъ
Analyt=Yes|Number=Dual|Tense=Past|VerbForm=PartRes|Voice=Act[в]ъзѧла
Case=Nom|Number=Sing|Person=3|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Actвъземо
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Actвзьмъ, взѧв·ъ, воземо, възъмъвъзьмъши
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passвзѧтъвозѧто, возѧтъ, възѧто
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actвъзмѧ, възьмѧ
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвъзѧле, взѧле, взѧлъ, възѧлъ, взѧлъ, возѧле, возѧло, (в)зѧло, [въз]ѧ[ле, взѧ, взѧл, взѧло, возѧль, всѧло, въ(зѧ)ль, възалъ, възьль, възѧл[е, възѧль, възѧль], ѹвзѧлъ, ѹзѧле, ꙋзѧле
Number=Plur|Tense=Past|VerbForm=PartRes|Voice=Actвзѧлѣ, взѧли, взѧл[и], взѧл, ѹзѧлѣ

DET

596 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (536; 90%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Case=Acc|Number=Singтого, тототу, тꙋ, [т]ѹто, [т]о, (т)о, т(о, т[о]
Case=Acc|Number=Dualтѣ
Case=Acc|Number=Plurтꙑ, тꙑи, тꙑхътои, тѣ, тꙑи
Case=Dat|Number=Singтомѹ, тому, томуо, тъмѹтои, то:етому, томѹ
Case=Gen|Number=Singтоготоѣтого, тога, (т)[о]го, [тог]о, то[го]
Case=Ins|Number=Singт[ꙑ](мъ), тиме
Case=Loc|Number=Singтомо, томъ, томьтомъ, томо, томь, (т)омо, (то)мъ, [т]омь, томо, том
Case=Nomта
Case=Nom|Number=Singтъ, те, то, тътото, [т]о, [то
Case=Nom|Number=Plurте, ти, тѣтѣ

NUM

286 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=EMPTY (283; 99%), Case=Nom (174; 61%), Number=Sing (158; 55%), NumType=EMPTY (146; 51%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдва, дова, дъв(а)дви, двь, двѣ, дъвь, дъвѣ, (д)[в]е, дове, дъвѣд[ъ]ва, два, дова
Case=Nomдва, довадвь, дви, дове, две, дови, двѣ, довѣ, дъвѣ, дви, д[ъв]…, д{в}овѣ, девѣ, дъвѣ, дъв[ѣ]дъва

PRON

145 PRON tokens (11% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (145; 100%), PronType=EMPTY (143; 99%), Number=Sing (139; 96%), Person=EMPTY (138; 95%).

PRON tokens may have the following values of Gender:

Paradigm иMascFemNeut
Case=Acc,Gen|Number=Singѥго
Case=Acc|Number=Singѥго, его, нь, и, [ѥг]о, ·и·, н(его), не, него, ї, ѥгю, …ю, ѫе, его, не
Case=Acc|Number=Dualнѧ
Case=Acc|Number=Plurихъ, ӏ, ӏхъ
Case=Dat|Number=Singемѹ, емꙋ, ѥму, (-)емꙋ, емо, ему, емѫ, ному, ньмуо, ѥмеи, ее
Case=Genѥго
Case=Gen|Number=Singѥго, его, него, ного, єго, ево, егъ, него, ньго, єво, ѥго, ѥг, ѥгоое]ѥ, ее, еи, еѣ, єи, ѥи, ѥӏ
Case=Ins|Number=Singнимо, нимь, имо, нимъ
Case=Ins|Number=Plurним[и
Case=Loc|Number=Singнемь, немь‐, немо, немъ, нь[мо], нѣмѣ

AUX

22 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (22; 100%), Tense=Past (22; 100%), Voice=Act (22; 100%), Number=Sing (21; 95%), VerbForm=PartRes (21; 95%), Analyt=EMPTY (12; 55%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Number=Sing|VerbForm=Finбылъ
Analyt=Yes|Number=Sing|VerbForm=PartResбꙑлъ, бꙑле, бꙑлобꙑлабꙑло
Analyt=Yes|Number=Plur|VerbForm=PartResбꙑли
Number=Sing|VerbForm=PartResбꙑлъ, б]ꙑ[лъ], бꙑле, бꙑло, бꙑлбꙑлабꙑло, (бꙑ)ло, б[ꙑ]ло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[conj]–> PROPN (377; 71%), NOUN –[amod]–> ADJ (371; 88%), NOUN –[conj]–> NOUN (294; 55%), NOUN –[det]–> DET (287; 84%), NOUN –[appos]–> PROPN (118; 94%), PROPN –[orphan]–> PROPN (110; 81%), VERB –[conj]–> VERB (95; 61%), VERB –[nsubj]–> PROPN (92; 55%), PROPN –[flat:name]–> ADJ (84; 95%), PROPN –[flat:name]–> PROPN (79; 90%).