home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: ADJ

There are 490 ADJ lemmas (10%), 882 ADJ types (7%) and 952 ADJ tokens (3%). Out of 17 observed tags, the rank of ADJ is: 5 in number of lemmas, 5 in number of types and 10 in number of tokens.

The 10 most frequent ADJ lemmas: свѧтыи, третии, добрыи, другыи, великыи, борзыи, пѧтыи, лоньскыи, четвертыи, сторовыи

The 10 most frequent ADJ types: добро, проста, велика, ст҃го, ст҃ѣ, гнѣд, добра, здорово, пѧта, трьтиѧ

The 10 most frequent ambiguous lemmas: другыи (ADJ 19, DET 11), ·ѕ҃· (NUM 48, ADJ 3), Василевъ (ADJ 3, PROPN 1), Ивановъ (ADJ 3, PROPN 1), ·в҃· (NUM 156, ADJ 2), ·д҃· (NUM 56, ADJ 2), двои (NUM 9, ADJ 2), ·г҃· (NUM 109, ADV 3, ADJ 1), ·е҃· (NUM 70, ADJ 1), ·з҃· (NUM 30, ADJ 1)

The 10 most frequent ambiguous types: добро (ADJ 8, NOUN 3, SCONJ 1), добре (ADJ 2, ADV 1), добръ (ADJ 2, NOUN 1), другую (ADJ 2, DET 1), пѧте (NUM 3, ADJ 2), пѧть (NUM 13, ADJ 2), сменова (ADJ 2, PROPN 1), :ѕ҃: (NUM 4, ADJ 1), боле (ADV 2, ADJ 1, NUM 1), вели (VERB 3, ADJ 1)

Morphology

The form / lemma ratio of ADJ is 1.800000 (the average of all parts of speech is 2.421872).

The 1st highest number of forms (25) was observed with the lemma “свѧтыи”: (свѧ)тꙑе, (св҃)[т]аго, ст҃[у], свгто, свѧ[т]-[го, свѧтее, свѧтое, свѧтꙑ, свѧтꙑмъ, свѧтꙑѧ, св҃ѧ[т]о(го, сгто, ст҃ѣ, ст҃го, ст҃ее, ст҃ого, ст҃ому, ст҃хъ, ст҃ье, ст҃ѣ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи, ст҃ꙑх, ст[о]го.

The 2nd highest number of forms (23) was observed with the lemma “третии”: [тре]теее, теретеѧ, тр[ьтиѧ], третиꙗ, треть, тре[тии, трет)[и]ѥ, третеѧ, третиеи, третии, третиѥго, третиѧ, третиӏ, третье, третьемъ, третьи, третьюю, третьѣ, третьѣѣ, трьтие, трьтие——–, трьтиѧ, трьтиѧѧ.

The 3rd highest number of forms (18) was observed with the lemma “другыи”: [д]рѹ[г]…, другии, другого, другои, другоѥ, другую, другꙑ, друогꙑ, дрѹгемо, дрѹги:и, дрѹгии, дрѹгоѥ, дрѹгъхо, дрѹгѹю, дрꙋ(г)ꙋю, дрꙋгаѧ, дрꙋгее, дрꙋгои.

ADJ occurs with 11 features: Case (924; 97% instances), Number (893; 94% instances), Gender (850; 89% instances), Poss (297; 31% instances), Variant (271; 28% instances), Degree (24; 3% instances), NumForm (11; 1% instances), Typo (4; 0% instances), Fragment (3; 0% instances), Animacy (1; 0% instances), NumType (1; 0% instances)

ADJ occurs with 26 feature-value pairs: Animacy=Anim, Case=Acc, Case=Acc,Gen, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Fragment=Yes, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, NumForm=Digit, NumForm=Word, NumType=Ord, Number=Count, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, Typo=Yes, Variant=Short

ADJ occurs with 156 feature combinations. The most frequent feature combination is Case=Gen|Gender=Masc|Number=Sing|Poss=Yes (67 tokens). Examples: сменова, ѥванова, (ѡ)фоносова, [фили]пова, [хр҃т҃о]в, би҃ѧ, богусл]алѧ, бѣшкова, бѹѧкъва, вармина

Relations

ADJ nodes are attached to their parents using 24 different relations: amod (474; 50% instances), flat:name (92; 10% instances), root (86; 9% instances), conj (79; 8% instances), nmod (75; 8% instances), obl (42; 4% instances), nsubj (20; 2% instances), obj (14; 1% instances), dep (13; 1% instances), advcl (12; 1% instances), parataxis (8; 1% instances), acl (7; 1% instances), orphan (6; 1% instances), ccomp (4; 0% instances), xcomp (4; 0% instances), iobj (3; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), flat (2; 0% instances), list (2; 0% instances), vocative (2; 0% instances), compound (1; 0% instances), dislocated (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (493; 52% instances), PROPN (135; 14% instances), VERB (106; 11% instances), (86; 9% instances), NUM (68; 7% instances), ADJ (50; 5% instances), PRON (7; 1% instances), X (5; 1% instances), ADP (1; 0% instances), PART (1; 0% instances)

577 (61%) ADJ nodes are leaves.

202 (21%) ADJ nodes have one child.

78 (8%) ADJ nodes have two children.

95 (10%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 30 different relations: punct (181; 25% instances), case (122; 17% instances), cc (79; 11% instances), conj (68; 9% instances), nsubj (62; 8% instances), advmod (49; 7% instances), cop (33; 5% instances), iobj (27; 4% instances), nmod (16; 2% instances), obl (13; 2% instances), dep (12; 2% instances), mark (12; 2% instances), nummod:gov (7; 1% instances), det (6; 1% instances), advcl (5; 1% instances), appos (5; 1% instances), flat (5; 1% instances), parataxis (5; 1% instances), orphan (4; 1% instances), amod (3; 0% instances), vocative (3; 0% instances), aux (2; 0% instances), flat:name (2; 0% instances), list (2; 0% instances), obj (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), reparandum (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (181; 25% instances), ADP (122; 17% instances), CCONJ (75; 10% instances), NOUN (73; 10% instances), ADJ (50; 7% instances), PART (43; 6% instances), AUX (36; 5% instances), PRON (30; 4% instances), DET (22; 3% instances), PROPN (21; 3% instances), VERB (20; 3% instances), NUM (19; 3% instances), SCONJ (14; 2% instances), X (14; 2% instances), ADV (11; 2% instances)