home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-TOROT: POS Tags: ADJ

There are 1386 ADJ lemmas (15%), 5640 ADJ types (17%) and 15631 ADJ tokens (10%). Out of 14 observed tags, the rank of ADJ is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADJ lemmas: свои, тыи, святыи, сии, великыи, мъногыи, онъ, нашь, мои, вьсь

The 10 most frequent ADJ types: своѥго, се, то, свои, свою, же, ст҃го, того, много, мои

The 10 most frequent ambiguous lemmas: свои (ADJ 1510, NOUN 1), тыи (ADJ 744, DET 449), сии (ADJ 671, DET 457), мъногыи (ADJ 461, NOUN 1), онъ (ADJ 428, DET 37), мои (ADJ 393, PRON 1), вьсь (DET 903, ADJ 302, NOUN 4), самъ (ADJ 228, DET 70), же (ADV 3124, ADJ 152, PRON 29, SCONJ 7, CCONJ 1, VERB 1), къто (ADJ 144, PRON 93, DET 3)

The 10 most frequent ambiguous types: се (INTJ 359, ADJ 214, DET 66), то (ADV 395, ADJ 174, DET 110), же (ADV 3046, ADJ 152, PRON 29, SCONJ 8, CCONJ 2, VERB 1), того (ADJ 128, DET 69), много (ADJ 124, ADV 33), си (ADJ 107, PRON 74, DET 48, ADV 5, INTJ 4, AUX 2), кто (ADJ 105, PRON 45, DET 2), самъ (ADJ 92, DET 26), сего (DET 100, ADJ 90), томѹ (ADJ 75, DET 15)

Morphology

The form / lemma ratio of ADJ is 4.069264 (the average of all parts of speech is 3.571475).

The 1st highest number of forms (109) was observed with the lemma “святыи”: оѥ, свтыѧ, святыя, святѣй, свѧтаго, свѧтее, свѧтыими, свѧтѣи, свѧтѣмь, свѧтꙑꙗ, свꙗтаго, свꙗтыи, свꙗтыꙗ, сстых, сс҃тꙑꙗ, стаг, стая, стаꙗ, стга, стго, стг҃о, стии, стмъ, стм҃у, стм҃ь, сто, стог҃, стх҃ъ, стыя, сты҃и, сты҃имъ, сты҃мь, стѹ҃ю, ст҃[ꙑ]и, ст҃а, ст҃ааго, ст҃аго, ст҃ая, ст҃аѧ, ст҃аꙗ, ст҃го, ст҃и, ст҃ии, ст҃ми, ст҃му, ст҃мъ, ст҃мь, ст҃о, ст҃ого, ст҃ое, ст҃ому, ст҃омѹ, ст҃оую, ст҃ою, ст҃оѣ, ст҃оѥ, ст҃у, ст҃ууму, ст҃ую, ст҃хъ, ст҃ъ, ст҃ыи, ст҃ыимь, ст҃ыихъ, ст҃ыма, ст҃ыми, ст҃ымь, ст҃ыхъ, ст҃ыя, ст҃ыѣ, ст҃ыꙗ, ст҃ѣи, ст҃ѣмь, ст҃ѹю, ст҃ѹѹмѹ, ст҃ꙑи, ст҃ꙑимь, ст҃ꙑихъ, ст҃ꙑмъ, ст҃ꙑѧ, ст҃ꙑꙗ, стꙑх҃, стꙑꙗ, с҃таго, с҃тая, с҃таꙗ, с҃тго, с҃темъ, с҃тии, с҃тму, с҃тмъ, с҃тмꙋ, с҃тое, с҃тою, с҃тоѥ, с҃тую, с҃тхъ, с҃ты, с҃тыа, с҃тыи, с҃тымъ, с҃тымь, с҃тых, с҃тыхъ, с҃тыя, с҃тыѧ, с҃тїи, с҃тѣи, с҃тꙋю.

The 2nd highest number of forms (94) was observed with the lemma “русьскыи”: Руской, Рускыи, Рускыя, роускыхъ, роусьскои, роусьскымъ, роусьскыхъ, роус҃скоую, роус҃скым, роус҃скыхъ, роус҃стии, руска, рускаго, руская, рускиа, рускии, руским, рускимъ, руских, рускихъ, руския, рускиѣ, руское, рускои, рускому, руску, рускую, рускым, рускымъ, рускыѣ, рускіх, рускія, рускіѣ, рускїа, рускꙑя, русскии, русскими, русскиѣ, русску, русскую, русскыи, русскымъ, русскых, русскыѣ, русскѣи, русстии, русстѣи, рустие, рустии, рустїи, рустѣмь, русько, руськую, русьская, русьскаꙗ, русьскии, русьскиѣ, русьску, русьскую, русьскы, русьскыи, русьскым, русьскымь, русьскых, русьскыя, русьскыѣ, русьскѣи, русьстии, русьстѣ, русьстѣи, рѹское, рѹскые, рѹсскомѹ, рѹськыи, рѹсьскаꙗ, рѹсьскои, рѹсьскую, рѹсьскѣ, рѹсьскѣи, рѹсьскѹ, рѹсьскѹю, рѹсьскꙑи, рѹсьскꙑихъ, рѹсьскꙑꙗ, рѹсьстїи, рѹсьстѣи, рѹсьстѣмь, рѹс҃ска, рѹс҃скаꙗ, рѹс҃скои, рѹс҃скꙑꙗ, рѹс҃стѣи, рꙋсьскѹю, сѹрьскꙑи.

The 3rd highest number of forms (80) was observed with the lemma “великыи”: Великий, велика, великааго, великаго, великая, великаѧ, великаꙗ, велики, великиа, великии, великим, великими, великимъ, великимь, великим҃, великих, великихъ, великия, великиѧ, велико, великог, великого, великое, великои, великом, великомȣ, великому, великомъ, великомь, великомѹ, великомꙋ, великою, велику, великууму, великую, великъ, великъмь, великы, великыи, великый, великым, великыма, великыми, великымъ, великых, великыя, великыѧ, великыꙗ, великій, великї, великїа, великїи, великѹ, великѹю, великѹѹмѹ, великꙋ, великꙋю, великꙑ, великꙑи, великꙑй, великꙑмь, великꙑхъ, великꙑꙗ, велице, велицемъ, велици, велиции, велицїи, велицѣ, велицѣи, велицѣмь, величии, велкому, велцѣ, велїка, велїкии, велїку, велѣкъ, вѣликою, вѣликоѥ.

ADJ occurs with 8 features: Case (15609; 100% instances), Number (15609; 100% instances), Gender (15504; 99% instances), Degree (9488; 61% instances), Variant (3606; 23% instances), Person (2700; 17% instances), Poss (2700; 17% instances), Reflex (1510; 10% instances)

ADJ occurs with 23 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, Reflex=Yes, Variant=Short

ADJ occurs with 309 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Masc|Number=Sing (733 tokens). Examples: блаженыи, бл҃говѣрныи, бл҃женыи, великыи, великꙑи, ст҃ыи, безбожныи, бл҃гыи, новгородьскꙑи, бж҃ии

Relations

ADJ nodes are attached to their parents using 22 different relations: amod (6486; 41% instances), nmod (2917; 19% instances), nsubj (1810; 12% instances), obj (999; 6% instances), obl (641; 4% instances), conj (578; 4% instances), advmod (559; 4% instances), root (549; 4% instances), iobj (295; 2% instances), flat (198; 1% instances), appos (155; 1% instances), xcomp (109; 1% instances), advcl (88; 1% instances), orphan (65; 0% instances), nsubj:pass (49; 0% instances), dislocated (30; 0% instances), ccomp (28; 0% instances), acl (26; 0% instances), obl:agent (19; 0% instances), dep (15; 0% instances), vocative (13; 0% instances), parataxis (2; 0% instances)

Parents of ADJ nodes belong to 12 different parts of speech: NOUN (9227; 59% instances), VERB (3915; 25% instances), ADJ (747; 5% instances), PROPN (679; 4% instances), (549; 4% instances), DET (124; 1% instances), ADV (113; 1% instances), AUX (90; 1% instances), CCONJ (88; 1% instances), PRON (56; 0% instances), NUM (42; 0% instances), INTJ (1; 0% instances)

12807 (82%) ADJ nodes are leaves.

1538 (10%) ADJ nodes have one child.

649 (4%) ADJ nodes have two children.

637 (4%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 11.

Children of ADJ nodes are attached using 26 different relations: case (743; 14% instances), cc (663; 12% instances), conj (532; 10% instances), cop (503; 9% instances), nsubj (447; 8% instances), advmod (371; 7% instances), nmod (313; 6% instances), discourse (290; 5% instances), obl (245; 5% instances), det (200; 4% instances), iobj (195; 4% instances), advcl (140; 3% instances), acl (117; 2% instances), orphan (117; 2% instances), appos (116; 2% instances), mark (83; 2% instances), flat (75; 1% instances), vocative (57; 1% instances), ccomp (50; 1% instances), amod (48; 1% instances), dislocated (17; 0% instances), nummod (14; 0% instances), obj (10; 0% instances), obl:agent (10; 0% instances), aux (7; 0% instances), parataxis (3; 0% instances)

Children of ADJ nodes belong to 13 different parts of speech: NOUN (983; 18% instances), ADJ (747; 14% instances), ADP (745; 14% instances), ADV (686; 13% instances), CCONJ (669; 12% instances), AUX (516; 10% instances), VERB (374; 7% instances), PRON (243; 5% instances), DET (160; 3% instances), PROPN (106; 2% instances), SCONJ (83; 2% instances), NUM (33; 1% instances), INTJ (21; 0% instances)