home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: ADV

There are 292 ADV lemmas (10%), 316 ADV types (5%) and 1463 ADV tokens (8%). Out of 16 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: ансяк, кода, пек, истя, седе, мейле, ней, уш, прок, яла

The 10 most frequent ADV types: ансяк, кода, пек, истя, мейле, ней, уш, седе, прок, яла

The 10 most frequent ambiguous lemmas: ансяк (ADV 84, CCONJ 1), кода (ADV 69, SCONJ 5), седе (ADV 51, PART 2), прок (ADV 35, PART 12, SCONJ 3, ADP 1, CCONJ 1), зярдо (ADV 18, SCONJ 16), ков (ADV 17, NOUN 5), течи (ADV 16, NOUN 4), нать (ADV 14, PART 2), икеле (ADP 12, ADV 11, NOUN 1), ламо (DET 16, ADV 11, PRON 2)

The 10 most frequent ambiguous types: ансяк (ADV 57, CCONJ 1), кода (ADV 52, SCONJ 5), седе (ADV 33, PART 2), прок (ADV 32, PART 10, SCONJ 3, ADP 1, CCONJ 1), зярдо (SCONJ 14, ADV 11), ков (ADV 7, NOUN 3), нать (ADV 11, PART 1), ламо (ADV 9, DET 8), икеле (ADV 9, ADP 6), икелев (ADV 8, ADP 5)

Morphology

The form / lemma ratio of ADV is 1.082192 (the average of all parts of speech is 2.044845).

The 1st highest number of forms (3) was observed with the lemma “алкукс”: алкукс, алкукскак, алкуксонь.

The 2nd highest number of forms (3) was observed with the lemma “весть”: вестешка, весть, вестькак.

The 3rd highest number of forms (3) was observed with the lemma “истя”: истя, истянь, истяяк.

ADV occurs with 16 features: AdvType (808; 55% instances), Case (310; 21% instances), PronType (164; 11% instances), Degree (64; 4% instances), Definite (51; 3% instances), Clitic (42; 3% instances), NumType (35; 2% instances), Number (28; 2% instances), Derivation (20; 1% instances), Number[subj] (12; 1% instances), Person[subj] (12; 1% instances), Tense (11; 1% instances), Evident (5; 0% instances), Style (3; 0% instances), Typo (2; 0% instances), Mood (1; 0% instances)

ADV occurs with 47 feature-value pairs: AdvType=Deg, AdvType=Ideoph, AdvType=Loc, AdvType=Man, AdvType=Mod, AdvType=Sta, AdvType=Tim, Case=Abl, Case=Cmp, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Lat, Case=Loc, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Definite=Ind, Degree=Cmp, Degree=Sup, Derivation=AdvstO, Derivation=Dimin, Derivation=GenAttr, Derivation=Shka, Evident=Nfh, Mood=Imp, NumType=Dist, NumType=Mult, NumType=Ord, NumType=OrdMult, Number=Plur,Sing, Number=Sing, Number[subj]=Plur, Number[subj]=Sing, Person[subj]=2, Person[subj]=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Rel, PronType=Tot, Style=Arch, Tense=Past, Tense=Pres, Typo=Yes

ADV occurs with 121 feature combinations. The most frequent feature combination is _ (361 tokens). Examples: ансяк, прок, истя, нать, парсте, мик, секс, ламо, стяко, теке

Relations

ADV nodes are attached to their parents using 42 different relations: advmod (445; 30% instances), advmod:tmod (418; 29% instances), obl (88; 6% instances), advmod:deg (81; 6% instances), mark (56; 4% instances), root (43; 3% instances), advmod:foc (38; 3% instances), advmod:lmod (35; 2% instances), advmod:lto (30; 2% instances), advmod:eval (29; 2% instances), advmod:comp (26; 2% instances), case (22; 2% instances), obl:tmod (20; 1% instances), advcl (17; 1% instances), conj (16; 1% instances), fixed (15; 1% instances), advmod:mmod (13; 1% instances), orphan (9; 1% instances), acl (7; 0% instances), advmod:lmp (5; 0% instances), discourse (5; 0% instances), obl:lmp (5; 0% instances), advmod:lfrom (4; 0% instances), nmod:comp (4; 0% instances), advcl:tcl (3; 0% instances), ccomp (3; 0% instances), compound:prt (3; 0% instances), parataxis (3; 0% instances), appos (2; 0% instances), cc (2; 0% instances), cc:preconj (2; 0% instances), compound (2; 0% instances), obl:lto (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), advmod:cau (1; 0% instances), amod (1; 0% instances), compound:redup (1; 0% instances), nmod (1; 0% instances), obl:lfrom (1; 0% instances), obl:lmod (1; 0% instances), reparandum (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (1031; 70% instances), NOUN (136; 9% instances), ADV (117; 8% instances), ADJ (65; 4% instances), (43; 3% instances), PRON (35; 2% instances), AUX (12; 1% instances), NUM (9; 1% instances), DET (7; 0% instances), PROPN (4; 0% instances), ADP (2; 0% instances), SCONJ (2; 0% instances)

1102 (75%) ADV nodes are leaves.

243 (17%) ADV nodes have one child.

71 (5%) ADV nodes have two children.

47 (3%) ADV nodes have three or more children.

The highest child degree of a ADV node is 9.

Children of ADV nodes are attached using 34 different relations: punct (187; 33% instances), aux:neg (56; 10% instances), advmod:deg (40; 7% instances), nmod:comp (33; 6% instances), nsubj (27; 5% instances), conj (26; 5% instances), fixed (25; 4% instances), obl (25; 4% instances), advmod (24; 4% instances), discourse (23; 4% instances), advmod:comp (21; 4% instances), advmod:tmod (13; 2% instances), cc (13; 2% instances), appos (9; 2% instances), advcl (7; 1% instances), cop (7; 1% instances), parataxis (7; 1% instances), compound (4; 1% instances), orphan (4; 1% instances), advmod:foc (3; 1% instances), mark (3; 1% instances), nsubj:cop (3; 1% instances), aux:aspect (2; 0% instances), aux:q (2; 0% instances), csubj (2; 0% instances), acl:relcl (1; 0% instances), advcl:tcl (1; 0% instances), advmod:eval (1; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), nummod (1; 0% instances), obl:lmp (1; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PUNCT (187; 33% instances), ADV (117; 20% instances), NOUN (74; 13% instances), AUX (68; 12% instances), VERB (43; 7% instances), PART (33; 6% instances), PRON (17; 3% instances), CCONJ (12; 2% instances), ADJ (7; 1% instances), INTJ (5; 1% instances), PROPN (5; 1% instances), SCONJ (3; 1% instances), ADP (2; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)