home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: ADV

There are 316 ADV lemmas (10%), 336 ADV types (5%) and 1705 ADV tokens (8%). Out of 16 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: ансяк, кода, пек, истя, ней, седе, мейле, уш, прок, яла

The 10 most frequent ADV types: ансяк, кода, пек, истя, мейле, ней, уш, седе, прок, яла

The 10 most frequent ambiguous lemmas: ансяк (ADV 100, CCONJ 1), кода (ADV 80, SCONJ 5), седе (ADV 51, PART 2), прок (ADV 37, PART 12, SCONJ 3, ADP 1, CCONJ 1), ков (ADV 21, NOUN 7), зярдо (ADV 21, SCONJ 16), течи (ADV 19, NOUN 4), нать (ADV 15, PART 2), васоло (ADV 13, ADJ 1), икеле (ADP 14, ADV 13, ADJ 1, NOUN 1)

The 10 most frequent ambiguous types: ансяк (ADV 68, CCONJ 1), кода (ADV 58, SCONJ 5), седе (ADV 33, PRON 3, PART 2), прок (ADV 33, PART 10, SCONJ 3, ADP 1, CCONJ 1), ков (ADV 11, NOUN 4), зярдо (SCONJ 14, ADV 13), седеяк (ADV 18, PRON 3), нать (ADV 12, PART 1), теке (ADV 11, SCONJ 7, PRON 6, DET 2), икеле (ADV 10, ADP 7)

Morphology

The form / lemma ratio of ADV is 1.063291 (the average of all parts of speech is 2.081106).

The 1st highest number of forms (3) was observed with the lemma “аламо”: аламо, аламодо, аламос.

The 2nd highest number of forms (3) was observed with the lemma “алкукс”: алкукс, алкукскак, алкуксонь.

The 3rd highest number of forms (3) was observed with the lemma “весть”: вестешка, весть, вестькак.

ADV occurs with 16 features: AdvType (864; 51% instances), Case (365; 21% instances), PronType (201; 12% instances), Degree (75; 4% instances), Definite (70; 4% instances), Clitic (44; 3% instances), Number (42; 2% instances), NumType (38; 2% instances), Derivation (18; 1% instances), Number[subj] (15; 1% instances), Person[subj] (15; 1% instances), Tense (14; 1% instances), Evident (5; 0% instances), Style (3; 0% instances), Typo (2; 0% instances), Mood (1; 0% instances)

ADV occurs with 48 feature-value pairs: AdvType=Deg, AdvType=Ideoph, AdvType=Loc, AdvType=Man, AdvType=Mod, AdvType=Sta, AdvType=Tim, Case=Abl, Case=Cmp, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Lat, Case=Loc, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Definite=Ind, Degree=Cmp, Degree=Dim, Degree=Sup, Derivation=AdvstO, Derivation=GenAttr, Derivation=Shka, Evident=Nfh, Mood=Imp, NumType=Dist, NumType=Mult, NumType=Ord, NumType=OrdMult, Number=Plur, Number=Plur,Sing, Number=Sing, Number[subj]=Plur, Number[subj]=Sing, Person[subj]=2, Person[subj]=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Rel, PronType=Tot, Style=Arch, Tense=Past, Tense=Pres, Typo=Yes

ADV occurs with 133 feature combinations. The most frequent feature combination is _ (473 tokens). Examples: ансяк, прок, истя, нать, парсте, секс, мик, теке, ялатеке, овсе

Relations

ADV nodes are attached to their parents using 38 different relations: advmod (558; 33% instances), advmod:tmod (471; 28% instances), advmod:deg (91; 5% instances), advmod:lmod (90; 5% instances), obl (89; 5% instances), mark (63; 4% instances), root (50; 3% instances), advmod:eval (42; 2% instances), advmod:foc (38; 2% instances), advmod:cmp (32; 2% instances), case (24; 1% instances), conj (20; 1% instances), obl:tmod (20; 1% instances), advcl (19; 1% instances), fixed (15; 1% instances), advmod:mmod (14; 1% instances), obl:lmod (9; 1% instances), orphan (9; 1% instances), acl (7; 0% instances), discourse (7; 0% instances), obl:cmp (5; 0% instances), advcl:tcl (3; 0% instances), ccomp (3; 0% instances), compound:prt (3; 0% instances), amod (2; 0% instances), appos (2; 0% instances), cc (2; 0% instances), cc:preconj (2; 0% instances), compound (2; 0% instances), dep (2; 0% instances), nmod (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), advmod:cau (1; 0% instances), compound:redup (1; 0% instances), obj (1; 0% instances), reparandum (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (1196; 70% instances), NOUN (159; 9% instances), ADV (127; 7% instances), ADJ (98; 6% instances), (50; 3% instances), PRON (39; 2% instances), DET (11; 1% instances), NUM (10; 1% instances), AUX (7; 0% instances), PROPN (4; 0% instances), ADP (2; 0% instances), SCONJ (2; 0% instances)

1307 (77%) ADV nodes are leaves.

268 (16%) ADV nodes have one child.

77 (5%) ADV nodes have two children.

53 (3%) ADV nodes have three or more children.

The highest child degree of a ADV node is 9.

Children of ADV nodes are attached using 38 different relations: punct (207; 32% instances), aux:neg (61; 10% instances), advmod:deg (42; 7% instances), obl:cmp (40; 6% instances), nsubj (30; 5% instances), obl (29; 5% instances), conj (28; 4% instances), advmod (26; 4% instances), fixed (25; 4% instances), discourse (24; 4% instances), advmod:cmp (22; 3% instances), cc (15; 2% instances), advmod:tmod (14; 2% instances), appos (9; 1% instances), parataxis (9; 1% instances), advcl (7; 1% instances), cop (7; 1% instances), nsubj:cop (5; 1% instances), compound (4; 1% instances), mark (4; 1% instances), orphan (4; 1% instances), advmod:foc (3; 0% instances), csubj (3; 0% instances), advcl:cmp (2; 0% instances), advmod:eval (2; 0% instances), aux:aspect (2; 0% instances), expl (2; 0% instances), obl:lmod (2; 0% instances), vocative (2; 0% instances), acl:relcl (1; 0% instances), advcl:tcl (1; 0% instances), aux:q (1; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), nummod (1; 0% instances), reparandum (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PUNCT (207; 32% instances), ADV (127; 20% instances), NOUN (87; 14% instances), AUX (72; 11% instances), VERB (48; 8% instances), PART (32; 5% instances), PRON (25; 4% instances), CCONJ (14; 2% instances), ADJ (8; 1% instances), INTJ (6; 1% instances), PROPN (5; 1% instances), SCONJ (4; 1% instances), ADP (2; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)