Treebank Statistics: UD_Old_East_Slavic-Birchbark: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
Some words have combined values of the feature; 1 combinations have been observed: Masc|Neut.
10337 tokens (37%) have a non-empty value of Gender.
7551 types (65%) occur at least once with a non-empty value of Gender.
3093 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (4970; 18% instances), PROPN (2607; 9% instances), ADJ (850; 3% instances), VERB (827; 3% instances), DET (610; 2% instances), NUM (294; 1% instances), PRON (154; 1% instances), AUX (25; 0% instances).
NOUN
4970 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3466; 70%).
NOUN tokens may have the following values of Gender:
Fem(2258; 45% of non-emptyGender): ржи, кѹно, соли, грамота, кѹнъ, гривьнѣ, гривьнъ, рожи, гривено, гривнаMasc(2043; 41% of non-emptyGender): поклонъ, поклоно, гн҃е, конь, попа, рубль, сн҃а, кони, приказъ, рублѧNeut(669; 13% of non-emptyGender): покланѧние, жита, села, серебра, слово, село, челомъ, целомъ, дети, сереброEMPTY(3): —–(о)[у]·, вѣверъ…, лю…
Gender seems to be lexical feature of NOUN. 100% lemmas (990) occur only with one value of Gender.
PROPN
2607 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2494; 96%), NameType=Giv (2175; 83%).
PROPN tokens may have the following values of Gender:
Fem(265; 10% of non-emptyGender): мариѧ, ѧна, маренѣ, настасиѧ, анѣ, куролѣ, лаидиколѣ, лугу, марие, мароѳуMasc(2265; 87% of non-emptyGender): ивана, петра, бориса, евана, павла, степана, смена, завида, лѹкѣ, михалѧNeut(77; 3% of non-emptyGender): городищи, курьѥго, ——скь, —–ина, –руньского, -остер…, [гр]од[и], [сл]а[в]…, бабине, бологожь EMPTY(236): в…, ж…, кшетахъ, ли…, малѧтѣ, м…, ньжѧть, нѣжѧтѣ, са…, стоють
| Paradigm Колѣньце | Masc | Neut |
|---|---|---|
| NameType=Geo | колинца | |
| NameType=Giv | коленеча |
Gender seems to be lexical feature of PROPN. 100% lemmas (1293) occur only with one value of Gender.
ADJ
850 ADJ tokens (89% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (706; 83%), Variant=EMPTY (583; 69%), Poss=EMPTY (577; 68%).
ADJ tokens may have the following values of Gender:
Fem(272; 32% of non-emptyGender): ст҃ѣ, бѣлонога, дворнюю, десѧтаꙗ, добре, другую, новаѧ, пуста, пѧте, пѧтьMasc(439; 52% of non-emptyGender): ст҃го, велика, гнѣд, пѧта, бж҃и, виновате, ворон, добра, добръ, желѣзенMasc,Neut(1; 0% of non-emptyGender): пътровоNeut(138; 16% of non-emptyGender): добро, проста, здорово, борзи, борзѣ, годьнъ, лживꙑѧ, (д)анилово, (до)ро[г]а[ѧ], (т)[о]тарьскогоEMPTY(102): коневꙑхъ, страднꙑх, (д)митрову, (ко)невꙑхъ, (л)юдними, (с)тар[ѣ]шимъ, (с)траднꙑх, (св)[об]однꙑхо, :ѕ҃:, [б]орови[ц]ки
| Paradigm свѧтыи | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | ст҃го | свѧтое | |
| Case=Dat|Number=Sing | [ст҃](м)[у], ст҃ому | свѧтее, ст҃ѣ | |
| Case=Dat|Number=Sing|Variant=Short | ст҃ѣ | ||
| Case=Gen|Degree=Pos|Number=Sing | ст҃го, ст҃ого | ||
| Case=Gen|Number=Sing | ст҃го, (св҃)[т]аго, свгто, свѧ[т]-[го, св҃ѧ[т]о | ст҃ | |
| Case=Gen|Number=Sing|Variant=Short | ст҃ѣ | ||
| Case=Gen|Number=Plur | ст҃хъ, ст҃ꙑх | ||
| Case=Nom|Number=Sing | свѧтꙑ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи | ||
| Case=Voc|Number=Sing | (свѧ) |
VERB
827 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (826; 100%), Mood=EMPTY (801; 97%), Voice=Act (745; 90%), Tense=Past (732; 89%), Number=Sing (679; 82%), VerbForm=PartRes (646; 78%).
VERB tokens may have the following values of Gender:
Fem(83; 10% of non-emptyGender): дала, задѣла, пеюци, рекла, ѧла, (-)шьла, (се)[д]ѧци, —–ила, [бꙑл]а, ѿанаMasc(678; 82% of non-emptyGender): далъ, възѧле, дале, послале, въдале, взѧле, взѧлъ, велѣлъ, взѧлѣ, възѧлъNeut(66; 8% of non-emptyGender): шло, бꙑло, пошло, (шло, диꙗлось, куплено, погибло, (пог)[ꙑб]ло, (соро)слосѧ, [п]окладено EMPTY(1814): возми, даи, възьми, пришли, посли, присъли, далъ, иди, кланѧюсѧ, купи
| Paradigm взѧти | Masc | Fem | Neut |
|---|---|---|---|
| Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Act | вь:зѧ | взѧла | |
| Analyt=Yes|Number=Sing|Tense=Pqp|VerbForm=PartRes|Voice=Act | возѧлъ | ||
| Analyt=Yes|Number=Dual|Tense=Past|VerbForm=PartRes|Voice=Act | [в]ъзѧла | ||
| Case=Nom|Number=Sing|Person=3|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Act | въземо | ||
| Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Act | взьмъ, взѧв·ъ, воземо, възъмъ | възьмъши | |
| Case=Nom|Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass | взѧтъ | возѧто, возѧтъ, възѧто | |
| Case=Nom|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Act | въз | ||
| Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Act | възѧле, взѧле, взѧлъ, възѧлъ, взѧ | ||
| Number=Plur|Tense=Past|VerbForm=PartRes|Voice=Act | взѧлѣ, взѧли, взѧл[и], взѧл, ѹзѧлѣ |
DET
610 DET tokens (83% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (550; 90%).
DET tokens may have the following values of Gender:
Fem(133; 22% of non-emptyGender): моѧ, свою, мою, ту, моѥи, сама, своеи, твоѧ, моѥӏ, своѥиMasc(288; 47% of non-emptyGender): своѥму, мои, свои, твои, того, моѥго, моѥму, саме, самь, сегоNeut(189; 31% of non-emptyGender): то, томъ, томо, того, все, моего, томь, [т]о, вохо, моеEMPTY(129): же, никомѹ, всихъ, всѣхъ, моими, мъихъ, нашихо, никому, ницимъ, своими
| Paradigm тотъ | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | того, тото | ту, тꙋ, [т]ѹ | то, [т]о, (т)о, т(о, т[о] |
| Case=Acc|Number=Sing|PronType=Dem | то | ||
| Case=Acc|Number=Dual | тѣ | ||
| Case=Acc|Number=Plur | тꙑ, тꙑи, тꙑхъ | тои, тѣ, тꙑи | |
| Case=Dat|Number=Sing | тому, томуо, томѹ, тъмѹ | тои, то:е | тому, то |
| Case=Dat|Number=Sing|PronType=Dem | томѹ | ||
| Case=Gen|Number=Sing | того | тоѣ | того, тога, (т)[о]го, [тог]о, то[го] |
| Case=Ins|Number=Sing | т[ꙑ](мъ), тиме | ||
| Case=Ins|Number=Sing|PronType=Dem | томо, томъ | ||
| Case=Loc|Number=Sing | томо, томъ, томь | томъ, томо, томь, (т)омо, (то) | |
| Case=Nom | та | ||
| Case=Nom|Number=Sing | тъ, те, то, тъто | то, [т]о, [то | |
| Case=Nom|Number=Sing|PronType=Dem | то | ||
| Case=Nom|Number=Plur | те, ти, тѣ | тѣ |
NUM
294 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=EMPTY (283; 96%), Case=Nom (180; 61%), Number=Sing (158; 54%).
NUM tokens may have the following values of Gender:
Fem(98; 33% of non-emptyGender): три, двь, дви, дове, двѣ, дъвѣ, две, дови, полутори, в҃еMasc(188; 64% of non-emptyGender): поло, полъ, пло, два, дова, полѹ, пол, три, полътора, (п)олъNeut(8; 3% of non-emptyGender): три, д[ъ]ва, два, дова, дъва, одиномо, сотъEMPTY(991): ·в҃·, ·г҃·, :в҃:, :в:, ·г·, ·ӏ҃·, три, в҃, г҃, :г҃:
| Paradigm два | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | два, дова, дъв(а) | дви, двь, двѣ, дъвь, дъвѣ, (д)[в]е, дове, дъ | д[ъ]ва, два, дова |
| Case=Loc|NumForm=Word | довѹ | ||
| Case=Nom|NumForm=Word | д) | ||
| Case=Nom | два, дова | двь, дви, дове, две, дови, двѣ, довѣ, дъвѣ, д | дъва |
PRON
154 PRON tokens (12% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (154; 100%), Number=Sing (148; 96%), Person=EMPTY (143; 93%), PronType=EMPTY (143; 93%).
PRON tokens may have the following values of Gender:
Fem(20; 13% of non-emptyGender): ю, ее, …ю, е и, е]ѥ, еи, еѣ, єи, ѥи, ѥӏ Masc(111; 72% of non-emptyGender): ѥго, его, емѹ, него, емꙋ, ѥму, нь, и, немъ, немьNeut(23; 15% of non-emptyGender): то, е, томъ, что, [ц]ьто, [цего], [цто, [ѥ]же, его, неEMPTY(1142): ми, сѧ, тꙑ, ти, ѧ, мене, тобѣ, мнѣ, ѧзъ, мѧ
| Paradigm и | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc,Gen|Number=Sing | ѥго | ||
| Case=Acc,Gen|Number=Sing|Person=3|PronType=Prs | его, ѥго | ||
| Case=Acc|Number=Sing | ѥго, его, нь, и, [ѥг]о, ·и·, н(его), не, него, ї, ѥг | ю, … | е, его, не |
| Case=Acc|Number=Dual | нѧ | ||
| Case=Acc|Number=Plur | ихъ, ӏ, ӏх | ||
| Case=Dat|Number=Sing | емѹ, емꙋ, ѥму, (-)емꙋ, емо, ему, емѫ, ному, ньмуо, ѥм | е | |
| Case=Gen | ѥго | ||
| Case=Gen|Number=Sing | ѥго, его, него, ного, єго, ево, егъ, не | е]ѥ, ее, еи, еѣ, єи, ѥи, ѥӏ | |
| Case=Gen|Number=Sing|Person=3|PronType=Prs | его | ||
| Case=Ins|Number=Sing | нимо, нимь, имо, нимъ | ||
| Case=Ins|Number=Sing|Person=3|PronType=Prs | немъ | ||
| Case=Ins|Number=Plur | ним[и | ||
| Case=Loc|Number=Sing | немь, не |
AUX
25 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (25; 100%), Tense=Past (25; 100%), Voice=Act (25; 100%), Number=Sing (24; 96%), VerbForm=PartRes (24; 96%), Analyt=EMPTY (15; 60%).
AUX tokens may have the following values of Gender:
Fem(2; 8% of non-emptyGender): бꙑлаMasc(15; 60% of non-emptyGender): бꙑлъ, бꙑле, бꙑло, б]ꙑ[лъ], былъ, бꙑли, бꙑл, …лъ Neut(8; 32% of non-emptyGender): бꙑло, (бꙑ)ло, [бꙑ]ло, б(ꙑло, б[ꙑ] ло EMPTY(359): еси, ѥси, есмь, ѥсмь, бꙑ, есте, есть, есемо, ѥсме, есме
| Paradigm быти | Masc | Fem | Neut |
|---|---|---|---|
| Analyt=Yes|Number=Sing|VerbForm=Fin | былъ | ||
| Analyt=Yes|Number=Sing|VerbForm=PartRes | бꙑлъ, бꙑле, бꙑло | бꙑла | бꙑло |
| Analyt=Yes|Number=Plur|VerbForm=PartRes | бꙑли | ||
| Fragment=Yes|Number=Sing|VerbForm=PartRes | … | б(ꙑло | |
| Number=Sing|VerbForm=PartRes | бꙑлъ, б]ꙑ[лъ], бꙑле, бꙑло, бꙑл | бꙑла | бꙑло, (бꙑ) |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
PROPN –[conj]–> PROPN (391; 71%),
NOUN –[amod]–> ADJ (377; 87%),
NOUN –[conj]–> NOUN (305; 55%),
NOUN –[det]–> DET (294; 81%),
NOUN –[appos]–> PROPN (121; 95%),
PROPN –[orphan]–> PROPN (113; 81%),
VERB –[conj]–> VERB (96; 60%),
VERB –[nsubj]–> PROPN (92; 53%),
PROPN –[flat:name]–> ADJ (84; 95%),
PROPN –[flat:name]–> PROPN (81; 90%).