home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: ADJ

There are 181 ADJ lemmas (6%), 246 ADJ types (4%) and 670 ADJ tokens (4%). Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent ADJ lemmas: од, кодамо, покш, паро, якстере, омбоце, мазый, сэрей, рыжой, пиже

The 10 most frequent ADJ types: од, паро, покш, кодамо, якстере, мазый, сэрей, омбоце, пиже, васень

The 10 most frequent ambiguous lemmas: од (ADJ 33, NOUN 1), кодамо (ADJ 27, PRON 2, ADV 1), покш (ADJ 28, NOUN 2), паро (ADJ 27, NOUN 6), якстере (ADJ 20, NOUN 1), омбоце (ADJ 19, ADV 1), пиже (ADJ 12, NOUN 4), берянь (ADJ 10, NOUN 3), виде (INTJ 9, ADJ 8), стяко (ADJ 8, ADV 6)

The 10 most frequent ambiguous types: паро (ADJ 20, NOUN 1), кодамо (ADJ 15, PRON 2), пиже (ADJ 10, NOUN 1), берянь (ADJ 8, NOUN 2), виде (ADJ 6, INTJ 3), пешксе (ADJ 7, ADV 2), стяко (ADJ 7, ADV 5), лембе (ADJ 6, NOUN 1), кодамояк (ADJ 5, PRON 1), васенце (ADJ 3, DET 1)

Morphology

The form / lemma ratio of ADJ is 1.359116 (the average of all parts of speech is 2.044845).

The 1st highest number of forms (5) was observed with the lemma “омбоце”: омбоце, омбоценть, омбоцес, омбоцесь, омбоцесэнть.

The 2nd highest number of forms (4) was observed with the lemma “кондямо”: кондямо, кондямокс, кондямоль, коньдят.

The 3rd highest number of forms (4) was observed with the lemma “покш”: покш, покшоль, покшось, покшт.

ADJ occurs with 14 features: Number (232; 35% instances), Case (223; 33% instances), Definite (212; 32% instances), Number[subj] (60; 9% instances), Person[subj] (60; 9% instances), Tense (60; 9% instances), PronType (32; 5% instances), Derivation (29; 4% instances), NumType (27; 4% instances), Clitic (7; 1% instances), Style (4; 1% instances), AdpType (1; 0% instances), Number[psor] (1; 0% instances), Person[psor] (1; 0% instances)

ADJ occurs with 30 feature-value pairs: AdpType=Post, Case=Abl, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Tra, Clitic=Add, Definite=Def, Definite=Ind, Derivation=Dimin, Derivation=GenAttr, Derivation=VerbYks, NumType=Ord, Number=Plur, Number=Plur,Sing, Number=Sing, Number[psor]=Sing, Number[subj]=Plur, Number[subj]=Sing, Person[psor]=3, Person[subj]=2, Person[subj]=3, PronType=Ind, PronType=Int, PronType=Rel, Style=Arch, Tense=Past, Tense=Pres

ADJ occurs with 50 feature combinations. The most frequent feature combination is _ (361 tokens). Examples: од, покш, паро, якстере, пиже, кедровой, тусто, идем, мазы, стяко

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (448; 67% instances), root (68; 10% instances), conj (53; 8% instances), nsubj (19; 3% instances), advmod (10; 1% instances), advcl (9; 1% instances), fixed (8; 1% instances), obj (8; 1% instances), xcomp (8; 1% instances), compound (6; 1% instances), ccomp (4; 1% instances), nmod (4; 1% instances), obl (4; 1% instances), parataxis (4; 1% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), appos (3; 0% instances), advmod:deg (1; 0% instances), advmod:tmod (1; 0% instances), flat (1; 0% instances), nmod:gsubj (1; 0% instances), nmod:poss (1; 0% instances), nsubj:cop (1; 0% instances), obl:inst (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (459; 69% instances), VERB (81; 12% instances), (68; 10% instances), ADJ (35; 5% instances), PRON (12; 2% instances), ADV (7; 1% instances), DET (2; 0% instances), NUM (2; 0% instances), PROPN (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances)

442 (66%) ADJ nodes are leaves.

99 (15%) ADJ nodes have one child.

45 (7%) ADJ nodes have two children.

84 (13%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 37 different relations: punct (158; 30% instances), nsubj (70; 13% instances), conj (53; 10% instances), advmod:deg (32; 6% instances), obl (25; 5% instances), aux:neg (24; 5% instances), cop (17; 3% instances), advmod (15; 3% instances), cc (14; 3% instances), nsubj:cop (13; 2% instances), advmod:tmod (10; 2% instances), parataxis (8; 2% instances), nmod (7; 1% instances), advcl (6; 1% instances), advmod:comp (6; 1% instances), det (6; 1% instances), nmod:comp (6; 1% instances), appos (5; 1% instances), case (5; 1% instances), csubj (5; 1% instances), discourse (5; 1% instances), compound (4; 1% instances), mark (4; 1% instances), amod (3; 1% instances), fixed (3; 1% instances), obl:lmod (3; 1% instances), vocative (3; 1% instances), xcomp (3; 1% instances), aux:q (2; 0% instances), advmod:foc (1; 0% instances), cc:preconj (1; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), obj (1; 0% instances), obl:lmp (1; 0% instances), obl:tmod (1; 0% instances), orphan (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (158; 30% instances), NOUN (107; 20% instances), ADV (65; 12% instances), VERB (48; 9% instances), AUX (46; 9% instances), ADJ (35; 7% instances), PRON (21; 4% instances), CCONJ (15; 3% instances), PART (7; 1% instances), DET (6; 1% instances), ADP (5; 1% instances), PROPN (4; 1% instances), INTJ (3; 1% instances), NUM (2; 0% instances), SCONJ (1; 0% instances)