home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: ADJ

There are 1403 ADJ lemmas (14%), 4199 ADJ types (16%) and 8973 ADJ tokens (7%). Out of 17 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent ADJ lemmas: полоцкий, великий, рижский, божий, лохвицкий, добрый, святый, милый, будучий, 1655

The 10 most frequent ADJ types: великии, полоцкии, полоцкого, милым, полоцког(о), Бож(ъ)ю, ризког(о), великого, полоцких, ризкого

The 10 most frequent ambiguous lemmas: святый (ADJ 191, NOUN 2), 14 (ADJ 71, NUM 15), 12 (ADJ 42, NUM 15), 10 (ADJ 27, NUM 22), 9 (ADJ 26, NUM 5), 13 (ADJ 25, NUM 3), 4 (ADJ 25, NUM 17), 11 (ADJ 24, NUM 6), 15 (ADJ 24, NUM 10), 8 (ADJ 23, NUM 12)

The 10 most frequent ambiguous types: 14 (ADJ 29, NUM 8), 12 (ADJ 25, NUM 9), 9 (ADJ 22, NUM 3), 13 (ADJ 21, NUM 2), [14] (ADJ 21, NUM 3), 11 (ADJ 19, NUM 4), 2 (ADJ 17, NUM 6), 6 (ADJ 17, NUM 10), ради (ADJ 9, ADP 2), 5 (NUM 17, ADJ 16)

Morphology

The form / lemma ratio of ADJ is 2.992872 (the average of all parts of speech is 2.698737).

The 1st highest number of forms (102) was observed with the lemma “божий”: Б(о)ж(ъ)ю, Б(о)жег(о), Б(о)жего, Б(о)жиа, Б(о)жиего, Б(о)жиеи, Б(о)жии, Б(о)жиимъ, Б(о)жию, Б(о)жия, Б(о)жою, Б(о)жье, Б(о)жьег(о), Б(о)жьего, Б(о)жьее, Б(о)жьеи, Б(о)жьем, Б(о)жьемъ, Б(о)жьемь, Б(о)жьею, Б(о)жьи, Б(о)жью, Б(о)жьѧ, Б(о)жіи, Б(о)жію, Б(о)жія, Б(о)жіѧ, Б(о)зъ, Б(ож)ии, Б(ож)ье, Б(ож)ьею, Б(ож)ьими, Б(ож)ью, Б[о]жіа, Б[о]жіею, Бож(ъ)е, Бож(ъ)его, Бож(ъ)ее, Бож(ъ)еи, Бож(ъ)емъ, Бож(ъ)ею, Бож(ъ)и, Бож(ъ)имъ, Бож(ъ)ю, Бож(ъ)ѧ, Бож(ь)е, Божего, Божее, Божеи, Божею, Божии, Божия, Божое, Божъе, Божъего, Божъеи, Божъю, Божыю, Божьего, Божьем, Божьею, Божьи, Божьим, Божьимъ, Божью, Божю, Божіею, Бѡж(ъ)ю, Бѡж(ь)его, Бѡж(ь)ю, Бѡж(ь)я, бж̃ее, бж̃ей, бж̃емꙋ, бж̃ею, бж̃ого, бж̃ой, бж̃іей, бж̃їа, бж̃їе(й), бж̃їей, бж̃їею, бж̃їи, бж̃їими, бж̃їихъ, бж̃їй, бж̃їю, бж҃ею, бж҃и(и), бж҃иею, бж҃ое, бж҃ою, бж҃у, бж҃ую, бж҃ьемъ, бж҃ьє, бж҃іѧ, бож[ъ]ю, б҃жи(м), б҃жо(и), б҃жую, б҃жію.

The 2nd highest number of forms (92) was observed with the lemma “полоцкий”: Пол]оцкомъ, Полоцкая, Полоцкѡг(о), Полоцого, Полоцъкое, Полоцъкои, Полоцъком, Полоцъкомъ, Полоцькая, Полоцькое, Полочькаѧ, Полочьки, Полочьскаѧ, Полочьскую, Полѡтьцкыи, по[лоцк]ых, пол(о)цких, пол(о)цког(о), пол(о)цкые, пол(о)цкыи, пол(о)цкых, пол(о)цьког(о), пол(о)цькых, пол(оцкии), пол(оцкому), пололоцких, полотскии, полотског(о), полотского, полотьского, полоц(кии), полоцкаг(о), полоцкаѧ, полоцкго, полоцкие, полоцкии, полоцкий, полоцким, полоцкими, полоцкимъ, полоцких, полоцкихъ, полоцкиѣ, полоцког(о), полоцкого, полоцкогѡ, полоцкое, полоцкои, полоцком, полоцкому, полоцкомъ, полоцкомꙋ, полоцкою, полоцкую, полоцкые, полоцкыи, полоцкым, полоцкымъ, полоцкых, полоцкыхъ, полоцкіи, полоцъкаѧ, полоцъкие, полоцъкии, полоцъкиие, полоцъкими, полоцъкимъ, полоцъких, полоцъкихъ, полоцъкого, полоцъкому, полоцъкомꙋ, полоцъкою, полоцъкую, полоцъкы, полоцьки, полоцькии, полоцькими, полоцьких, полоцькихъ, полоцьког(о), полоцького, полоцькыи, полоцькых, полоцькыхъ, полочькии, полочькиих, полочьког(о), полочького, полочьскы, полѡтцкымъ, поцькыи.

The 3rd highest number of forms (74) was observed with the lemma “лохвицкий”: Ло(х)ви(ц)кои, Ло(х)ви(ц)кому, Лови(ц)кого, Лофи(ц)ким(м), Лохвицкого, ло(х)., ло(х)ви(ц)каѧ, ло(х)ви(ц)ки(м), ло(х)ви(ц)ки(м)ъ, ло(х)ви(ц)кимъ, ло(х)ви(ц)ко(г)[о], ло(х)ви(ц)ко(и), ло(х)ви(ц)ко(й), ло(х)ви(ц)ко(м), ло(х)ви(ц)кого, ло(х)ви[ц]кого, ло(х)вицки(й), ло(х)вицки(м), ло(х)вицкий, ло(х)вицкиє, ло(х)вицко(м), ло(х)вицкого, ло(х)вицъкая, ло(х)вицъки(и), ло(х)вицъки(й), ло(х)вицъки(м), ло(х)вицъки(х), ло(х)вицъкихъ, ло(х)вицъкиє, ло(х)вицъко(м), ло(х)вицъкого, ло(х)вицъкомъ, ло(х)вицъкіє, лофи(ц)кая, лофи(ц)ки(и), лофи(ц)ки(й), лофи(ц)ки(м), лофи(ц)ки(м)ъ, лофи(ц)ки(х), лофи(ц)кимъ, лофи(ц)ко(й), лофи(ц)ко(м)ъ, лофи(ц)кого, лофи(ц)комъ, лофицки(м), лофицкимъ, лофицкого, лофицъкая, лофицъки(и), лофицъкимъ, лофицъкого, лофицьки(м, лофицькому, лохви(ц)ки(и), лохви(ц)ки(й), лохви(ц)ки(м), лохви(ц)ки(м)ъ, лохви(ц)ки(х), лохви(ц)ким, лохви(ц)кимъ, лохви(ц)кихъ, лохви(ц)ко(й), лохви(ц)ко(м), лохви(ц)кого, лохви(ц)комъ, лохви(ц)кіє, лохвицки(м), лохвицкимъ, лохвицъки(м), лохвицъки(х), лохвицъкого, лохвыцкого, лохъви(ц)кого, лохъвицъки(х).

ADJ occurs with 11 features: Case (8891; 99% instances), Number (8891; 99% instances), Gender (8889; 99% instances), Degree (7921; 88% instances), NumForm (1024; 11% instances), NumType (1024; 11% instances), Variant (925; 10% instances), Animacy (187; 2% instances), Abbr (27; 0% instances), Poss (3; 0% instances), Typo (2; 0% instances)

ADJ occurs with 26 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Combi, NumForm=Cyril, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Ord, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, Typo=Yes, Variant=Short

ADJ occurs with 166 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Masc|Number=Sing (909 tokens). Examples: великии, полоцкии, троцкии, жомоитскии, виленскии, дворныи, полоцъкии, литовскии, вил(енскии), пол(ь)скии

Relations

ADJ nodes are attached to their parents using 27 different relations: amod (7081; 79% instances), conj (699; 8% instances), dep (197; 2% instances), obl (160; 2% instances), acl (153; 2% instances), root (133; 1% instances), nmod (111; 1% instances), obj (97; 1% instances), advcl (65; 1% instances), xcomp (51; 1% instances), ccomp (38; 0% instances), flat:name (32; 0% instances), flat (27; 0% instances), nsubj (27; 0% instances), iobj (22; 0% instances), parataxis (19; 0% instances), appos (16; 0% instances), acl:relcl (12; 0% instances), obl:depict (9; 0% instances), dislocated (5; 0% instances), orphan (5; 0% instances), reparandum (5; 0% instances), advmod (4; 0% instances), list (2; 0% instances), obl:tmod (1; 0% instances), parataxis:discourse (1; 0% instances), vocative (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (7151; 80% instances), ADJ (806; 9% instances), VERB (530; 6% instances), PROPN (283; 3% instances), (133; 1% instances), PRON (29; 0% instances), DET (26; 0% instances), ADV (9; 0% instances), NUM (4; 0% instances), PART (1; 0% instances), X (1; 0% instances)

6806 (76%) ADJ nodes are leaves.

1269 (14%) ADJ nodes have one child.

366 (4%) ADJ nodes have two children.

532 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 12.

Children of ADJ nodes are attached using 41 different relations: punct (766; 18% instances), conj (707; 16% instances), cc (650; 15% instances), obl (265; 6% instances), advmod (260; 6% instances), case (233; 5% instances), nsubj (203; 5% instances), dep (199; 5% instances), cop (183; 4% instances), iobj (135; 3% instances), det (105; 2% instances), mark (99; 2% instances), nmod (65; 1% instances), advcl (62; 1% instances), compound (57; 1% instances), xcomp (53; 1% instances), flat:name (48; 1% instances), amod (41; 1% instances), csubj (31; 1% instances), acl:relcl (29; 1% instances), obj (26; 1% instances), appos (22; 1% instances), parataxis (19; 0% instances), aux (15; 0% instances), ccomp (9; 0% instances), obl:tmod (8; 0% instances), nummod:gov (6; 0% instances), orphan (6; 0% instances), nsubj:pass (5; 0% instances), reparandum (5; 0% instances), acl (4; 0% instances), nsubj:outer (4; 0% instances), aux:pass (3; 0% instances), dislocated (3; 0% instances), expl (3; 0% instances), flat (3; 0% instances), nummod (3; 0% instances), parataxis:discourse (3; 0% instances), discourse (2; 0% instances), vocative (2; 0% instances), goeswith (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: ADJ (806; 19% instances), PUNCT (766; 18% instances), CCONJ (648; 15% instances), NOUN (528; 12% instances), PRON (244; 6% instances), VERB (229; 5% instances), ADP (224; 5% instances), AUX (202; 5% instances), ADV (169; 4% instances), DET (158; 4% instances), SCONJ (107; 2% instances), PART (103; 2% instances), PROPN (96; 2% instances), NUM (49; 1% instances), X (8; 0% instances), SYM (6; 0% instances)