home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Masc|Neut.

10337 tokens (37%) have a non-empty value of Gender. 7551 types (65%) occur at least once with a non-empty value of Gender. 3093 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4970; 18% instances), PROPN (2607; 9% instances), ADJ (850; 3% instances), VERB (827; 3% instances), DET (610; 2% instances), NUM (294; 1% instances), PRON (154; 1% instances), AUX (25; 0% instances).

NOUN

4970 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3466; 70%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (990) occur only with one value of Gender.

PROPN

2607 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2494; 96%), NameType=Giv (2175; 83%).

PROPN tokens may have the following values of Gender:

Paradigm КолѣньцеMascNeut
NameType=Geoколинца
NameType=Givколенеча

Gender seems to be lexical feature of PROPN. 100% lemmas (1293) occur only with one value of Gender.

ADJ

850 ADJ tokens (89% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (706; 83%), Variant=EMPTY (583; 69%), Poss=EMPTY (577; 68%).

ADJ tokens may have the following values of Gender:

Paradigm свѧтыиMascFemNeut
Case=Acc|Number=Singст҃госвѧтое
Case=Dat|Number=Sing[ст҃](м)[у], ст҃омусвѧтее, ст҃ѣ
Case=Dat|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Degree=Pos|Number=Singст҃го, ст҃ого
Case=Gen|Number=Singст҃го, (св҃)[т]аго, свгто, свѧ[т]-[го, св҃ѧ[т]о(го, сгто, ст҃ого, ст[о]гост҃ѣ, ст҃ее, ст҃ье, ст҃ѣ
Case=Gen|Number=Sing|Variant=Shortст҃ѣ
Case=Gen|Number=Plurст҃хъ, ст҃ꙑх
Case=Nom|Number=Singсвѧтꙑ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи
Case=Voc|Number=Sing(свѧ)тꙑе

VERB

827 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (826; 100%), Mood=EMPTY (801; 97%), Voice=Act (745; 90%), Tense=Past (732; 89%), Number=Sing (679; 82%), VerbForm=PartRes (646; 78%).

VERB tokens may have the following values of Gender:

Paradigm взѧтиMascFemNeut
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвь:зѧлъвзѧла
Analyt=Yes|Number=Sing|Tense=Pqp|VerbForm=PartRes|Voice=Actвозѧлъ
Analyt=Yes|Number=Dual|Tense=Past|VerbForm=PartRes|Voice=Act[в]ъзѧла
Case=Nom|Number=Sing|Person=3|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Actвъземо
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Actвзьмъ, взѧв·ъ, воземо, възъмъвъзьмъши
Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Passвзѧтъвозѧто, возѧтъ, възѧто
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actвъзмѧ, възьмѧ
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвъзѧле, взѧле, взѧлъ, възѧлъ, взѧлъ, возѧле, возѧло, (в)зѧло, [въз]ѧ[ле, взѧ, взѧл, взѧло, возѧль, всѧло, въ(зѧ)ль, възалъ, възьль, възѧл[е, възѧль, възѧль], ѹвзѧлъ, ѹзѧле, ꙋзѧле
Number=Plur|Tense=Past|VerbForm=PartRes|Voice=Actвзѧлѣ, взѧли, взѧл[и], взѧл, ѹзѧлѣ

DET

610 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (550; 90%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Case=Acc|Number=Singтого, тототу, тꙋ, [т]ѹто, [т]о, (т)о, т(о, т[о]
Case=Acc|Number=Sing|PronType=Demто
Case=Acc|Number=Dualтѣ
Case=Acc|Number=Plurтꙑ, тꙑи, тꙑхътои, тѣ, тꙑи
Case=Dat|Number=Singтому, томуо, томѹ, тъмѹтои, то:етому, томѹ
Case=Dat|Number=Sing|PronType=Demтомѹ
Case=Gen|Number=Singтоготоѣтого, тога, (т)[о]го, [тог]о, то[го]
Case=Ins|Number=Singт[ꙑ](мъ), тиме
Case=Ins|Number=Sing|PronType=Demтомо, томъ
Case=Loc|Number=Singтомо, томъ, томьтомъ, томо, томь, (т)омо, (то)мъ, [т]омь, томо, том
Case=Nomта
Case=Nom|Number=Singтъ, те, то, тътото, [т]о, [то
Case=Nom|Number=Sing|PronType=Demто
Case=Nom|Number=Plurте, ти, тѣтѣ

NUM

294 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=EMPTY (283; 96%), Case=Nom (180; 61%), Number=Sing (158; 54%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдва, дова, дъв(а)дви, двь, двѣ, дъвь, дъвѣ, (д)[в]е, дове, дъвѣд[ъ]ва, два, дова
Case=Loc|NumForm=Wordдовѹ
Case=Nom|NumForm=Wordд)ви
Case=Nomдва, довадвь, дви, дове, две, дови, двѣ, довѣ, дъвѣ, дви, д[ъв]…, д{в}овѣ, девѣ, дъвѣ, дъв[ѣ]дъва

PRON

154 PRON tokens (12% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (154; 100%), Number=Sing (148; 96%), Person=EMPTY (143; 93%), PronType=EMPTY (143; 93%).

PRON tokens may have the following values of Gender:

Paradigm иMascFemNeut
Case=Acc,Gen|Number=Singѥго
Case=Acc,Gen|Number=Sing|Person=3|PronType=Prsего, ѥго
Case=Acc|Number=Singѥго, его, нь, и, [ѥг]о, ·и·, н(его), не, него, ї, ѥгю, …ю, ѫе, его, не
Case=Acc|Number=Dualнѧ
Case=Acc|Number=Plurихъ, ӏ, ӏхъ
Case=Dat|Number=Singемѹ, емꙋ, ѥму, (-)емꙋ, емо, ему, емѫ, ному, ньмуо, ѥмеи, ее
Case=Genѥго
Case=Gen|Number=Singѥго, его, него, ного, єго, ево, егъ, него, ньго, єво, ѥго, ѥг, ѥгоое]ѥ, ее, еи, еѣ, єи, ѥи, ѥӏ
Case=Gen|Number=Sing|Person=3|PronType=Prsего
Case=Ins|Number=Singнимо, нимь, имо, нимъ
Case=Ins|Number=Sing|Person=3|PronType=Prsнемъ
Case=Ins|Number=Plurним[и
Case=Loc|Number=Singнемь, немь‐, немо, немъ, нь[мо], нѣмѣ

AUX

25 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (25; 100%), Tense=Past (25; 100%), Voice=Act (25; 100%), Number=Sing (24; 96%), VerbForm=PartRes (24; 96%), Analyt=EMPTY (15; 60%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Number=Sing|VerbForm=Finбылъ
Analyt=Yes|Number=Sing|VerbForm=PartResбꙑлъ, бꙑле, бꙑлобꙑлабꙑло
Analyt=Yes|Number=Plur|VerbForm=PartResбꙑли
Fragment=Yes|Number=Sing|VerbForm=PartResлъб(ꙑло
Number=Sing|VerbForm=PartResбꙑлъ, б]ꙑ[лъ], бꙑле, бꙑло, бꙑлбꙑлабꙑло, (бꙑ)ло, [бꙑ]ло, б[ꙑ]ло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: PROPN –[conj]–> PROPN (391; 71%), NOUN –[amod]–> ADJ (377; 87%), NOUN –[conj]–> NOUN (305; 55%), NOUN –[det]–> DET (294; 81%), NOUN –[appos]–> PROPN (121; 95%), PROPN –[orphan]–> PROPN (113; 81%), VERB –[conj]–> VERB (96; 60%), VERB –[nsubj]–> PROPN (92; 53%), PROPN –[flat:name]–> ADJ (84; 95%), PROPN –[flat:name]–> PROPN (81; 90%).