home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-TOROT: POS Tags: ADJ

There are 2226 ADJ lemmas (16%), 9199 ADJ types (17%) and 17987 ADJ tokens (7%). Out of 14 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent ADJ lemmas: святыи, великыи, мъногыи, вьсякыи, божии, другыи, русьскыи, блаженыи, добрыи, малыи

The 10 most frequent ADJ types: много, ст҃го, мало, велика, великъ, мнози, стг҃о, ст҃ѣи, блаженыи, многы

The 10 most frequent ambiguous lemmas: мъногыи (ADJ 735, NOUN 1), другыи (ADJ 303, NUM 1), прѣподобьныи (ADJ 153, NOUN 1), чужии (ADJ 43, NOUN 1), володимѣрь (PROPN 122, ADJ 35), вьсеволожь (ADJ 30, PROPN 4), святославль (ADJ 26, PROPN 1), гюргевъ (ADJ 22, PROPN 16), матерьнии (ADJ 22, NOUN 1), четвьртыи (ADJ 21, NOUN 1)

The 10 most frequent ambiguous types: много (ADJ 192, ADV 63), мало (ADJ 83, ADV 59), добро (ADJ 39, NOUN 20, ADV 7), ради (ADP 280, ADJ 36, VERB 1), радъ (ADJ 24, NOUN 1), добра (NOUN 26, ADJ 23), володимерь (PROPN 22, ADJ 21), всѧко (ADJ 21, ADV 3), не (ADV 3555, ADJ 18, PRON 18, CCONJ 14, VERB 5, ADP 1), зла (NOUN 61, ADJ 18)

Morphology

The form / lemma ratio of ADJ is 4.132525 (the average of all parts of speech is 3.947827).

The 1st highest number of forms (158) was observed with the lemma “святыи”: Ст҃моу, оѥ, свтые, свтыѧ, святей, святого, святую, святых, святыя, святѣй, свѧта, свѧтаго, свѧтее, свѧтыими, свѧтѣи, свѧтѣмь, свѧтꙑꙗ, свꙗтаго, свꙗтыи, свꙗтыꙗ, сстых, сс҃тꙑꙗ, стаг, стая, стаꙗ, стга, стго, стг҃, стг҃о, стии, стмъ, стм҃оу, стм҃у, стм҃ь, сто, стог, стог҃, сто҃ую, стх҃ъ, стъ, стъ҃іѧ, стъ҃іꙗ, стыи, стых, стыя, сты҃и, сты҃имъ, сты҃мь, сты҃х, стїи, стѣмь, стѹ҃ю, ст҃, ст҃[ꙑ]и, ст҃а, ст҃ааго, ст҃аг, ст҃аго, ст҃агѡ, ст҃ая, ст҃аѧ, ст҃аꙗ, ст҃г, ст҃го, ст҃е, ст҃еи, ст҃и, ст҃ии, ст҃м, ст҃ми, ст҃му, ст҃мъ, ст҃мь, ст҃мѹ, ст҃о, ст҃ого, ст҃ое, ст҃ои, ст҃ому, ст҃омъ, ст҃омь, ст҃омѹ, ст҃ом҃ь, ст҃оу, ст҃оую, ст҃ою, ст҃оі, ст҃оѣ, ст҃оѥ, ст҃у, ст҃ууму, ст҃ую, ст҃х, ст҃хъ, ст҃ъ, ст҃ы, ст҃ыа, ст҃ые, ст҃ыи, ст҃ыимь, ст҃ыихъ, ст҃ыи҃, ст҃ый, ст҃ым, ст҃ыма, ст҃ыми, ст҃ымъ, ст҃ымь, ст҃ых, ст҃ыхъ, ст҃ыя, ст҃ыѣ, ст҃ыѧ, ст҃ыꙗ, ст҃їи, ст҃ѣи, ст҃ѣишаѧ, ст҃ѣй, ст҃ѣмъ, ст҃ѣмь, ст҃ѹю, ст҃ѹѹмѹ, ст҃ꙋю, ст҃ꙑи, ст҃ꙑимь, ст҃ꙑихъ, ст҃ꙑмъ, ст҃ꙑѧ, ст҃ꙑꙗ, стꙑх҃, стꙑꙗ, с҃таг, с҃таго, с҃тая, с҃таꙗ, с҃тго, с҃темъ, с҃тии, с҃тму, с҃тмъ, с҃тмꙋ, с҃тое, с҃тою, с҃тоѥ, с҃тую, с҃тхъ, с҃ты, с҃тыа, с҃тыи, с҃тымъ, с҃тымь, с҃тых, с҃тыхъ, с҃тыя, с҃тыѧ, с҃тїи, с҃тѣи, с҃тꙋю.

The 2nd highest number of forms (117) was observed with the lemma “русьскыи”: Роускаа, Руской, роускии, роускимь, роускои, роускыхъ, роусскаа, роусскаг, роусскыа, роусскыи, роусьскаꙗ, роусьскои, роусьскымъ, роусьскыхъ, роус҃скоую, роус҃скым, роус҃скыхъ, роус҃стии, руска, рускаго, руская, рускиа, рускии, руским, рускимъ, руских, рускихъ, руския, рускиѣ, руское, рускои, рускому, рускою, руску, рускую, рускыи, рускым, рускымъ, рускыя, рускыѣ, рускіе, рускіх, рускія, рускіѣ, рускїа, рускꙑя, русские, русскии, русскими, русских, русскихъ, русскиѣ, русску, русскую, русскыи, русскымъ, русскых, русскыѣ, русскѣи, русстии, русстѣи, рустие, рустии, рустїи, рустѣи, рустѣмь, русько, руськую, русьская, русьскаꙗ, русьскии, русьскиѣ, русьску, русьскую, русьскы, русьскыи, русьскым, русьскымь, русьскых, русьскыя, русьскыѣ, русьскѣи, русьстии, русьстѣ, русьстѣи, рѹское, рѹскые, рѹсскомѹ, рѹсскꙑхъ, рѹськыи, рѹськѹ, рѹсьскаꙗ, рѹсьскои, рѹсьскоі, рѹсьскую, рѹсьскѣ, рѹсьскѣи, рѹсьскѹ, рѹсьскѹю, рѹсьскꙑи, рѹсьскꙑихъ, рѹсьскꙑꙗ, рѹсьстей, рѹсьстїи, рѹсьстѣи, рѹсьстѣмь, рѹс҃ска, рѹс҃скаꙗ, рѹс҃скои, рѹс҃скѹю, рѹс҃скꙑꙗ, рѹс҃стѣи, рꙋсскаа, рꙋсски, рꙋстѣи, рꙋсьскѹю, сѹрьскꙑи.

The 3rd highest number of forms (114) was observed with the lemma “божии”: Божий, Божіа, бжие, бжиею, бжию, бжия, бжьѧ, бжїа, бжїи, бжїим, бжїимъ, бжїимь, бжїй, бжїю, бжїѧ, бж҃и, бж҃ие, бж҃иею, бж҃ие҃, бж҃ии, бж҃иим, бж҃иимь, бж҃ию, бж҃ия, бж҃иѥ, бж҃иѥмь, бж҃иѥю, бж҃иꙇх, бж҃иꙗ, бж҃хъ, бж҃ье, бж҃ьею, бж҃ьи, бж҃ьим, бж҃ьимъ, бж҃ьимь, бж҃ьихъ, бж҃ью, бж҃ья, бж҃ьѣ, бж҃ьꙗ, бж҃я, бж҃іи, бж҃іѥю, бж҃їа, бж҃їе, бж҃їею, бж҃їи, бж҃їим, бж҃їимъ, бж҃їих, бж҃їихъ, бж҃їю, бж҃їѧ, бж҃ꙇю, би҃емь, би҃и, би҃ю, би҃ѥ, би҃ꙗ, божеꙗ, божии, божию, божия, божиѥю, божиꙗ, божїа, божїи, божїихъ, божїю, божїѧ, бьжмь, б҃жии, б҃жиии, б҃жиимъ, б҃жию, б҃жия, б҃жиѥ, б҃жиѥмь, б҃жиѥю, б҃жиꙗ, б҃жье, б҃жьею, б҃жьи, б҃жьими, б҃жью, б҃жья, б҃жьѣ, б҃жїа, б҃жїе, б҃жїею, б҃жїи, б҃жїим, б҃жїимъ, б҃жїих, б҃жїихъ, б҃жїю, б҃жїѧ, б҃ие, б҃ии, б҃иие, б҃иимь, б҃ию, б҃ия, б҃иѥ, б҃иꙗ, б҃ье, б҃ьею, б҃ьи, б҃ьимъ, б҃ьимь, б҃ью, б҃ья, б҃ьꙗ.

ADJ occurs with 5 features: Case (17956; 100% instances), Number (17956; 100% instances), Gender (17948; 100% instances), Degree (17555; 98% instances), Variant (6227; 35% instances)

ADJ occurs with 20 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Variant=Short

ADJ occurs with 188 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Masc|Number=Sing (1291 tokens). Examples: блаженыи, прпдбныи, ст҃ыи, бл҃женыи, великий, бл҃говѣрныи, великыи, великꙑи, новгородьскꙑи, бл҃гыи

Relations

ADJ nodes are attached to their parents using 23 different relations: amod (12106; 67% instances), conj (1284; 7% instances), root (1053; 6% instances), advmod (660; 4% instances), nsubj (589; 3% instances), obj (519; 3% instances), nmod (474; 3% instances), appos (235; 1% instances), xcomp (192; 1% instances), obl (176; 1% instances), advcl (160; 1% instances), obl:arg (127; 1% instances), orphan (76; 0% instances), dislocated (59; 0% instances), fixed (57; 0% instances), acl (55; 0% instances), advcl:cmp (45; 0% instances), ccomp (44; 0% instances), nsubj:pass (29; 0% instances), vocative (22; 0% instances), parataxis (14; 0% instances), obl:agent (10; 0% instances), dep (1; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (11943; 66% instances), VERB (2523; 14% instances), ADJ (1120; 6% instances), (1053; 6% instances), PROPN (940; 5% instances), PRON (193; 1% instances), NUM (68; 0% instances), AUX (67; 0% instances), ADV (57; 0% instances), DET (9; 0% instances), SCONJ (8; 0% instances), INTJ (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances)

13458 (75%) ADJ nodes are leaves.

2517 (14%) ADJ nodes have one child.

944 (5%) ADJ nodes have two children.

1068 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 24.

Children of ADJ nodes are attached using 29 different relations: cc (1562; 18% instances), conj (1288; 15% instances), cop (817; 10% instances), nsubj (812; 10% instances), advmod (618; 7% instances), obl (573; 7% instances), case (558; 7% instances), discourse (349; 4% instances), nmod (342; 4% instances), advcl (245; 3% instances), obl:arg (244; 3% instances), det (210; 2% instances), mark (175; 2% instances), orphan (137; 2% instances), appos (125; 1% instances), amod (69; 1% instances), csubj (63; 1% instances), fixed (56; 1% instances), ccomp (52; 1% instances), acl (45; 1% instances), vocative (45; 1% instances), dislocated (43; 1% instances), aux (39; 0% instances), parataxis (25; 0% instances), obl:agent (16; 0% instances), nummod (15; 0% instances), advcl:cmp (14; 0% instances), obj (5; 0% instances), xcomp (1; 0% instances)

Children of ADJ nodes belong to 13 different parts of speech: NOUN (1575; 18% instances), CCONJ (1563; 18% instances), ADJ (1120; 13% instances), ADV (1054; 12% instances), AUX (869; 10% instances), PRON (629; 7% instances), VERB (607; 7% instances), ADP (560; 7% instances), DET (192; 2% instances), PROPN (170; 2% instances), SCONJ (131; 2% instances), NUM (42; 0% instances), INTJ (31; 0% instances)