home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: PRON

There are 59 PRON lemmas (2%), 187 PRON types (3%) and 950 PRON tokens (5%). Out of 16 observed tags, the rank of PRON is: 6 in number of lemmas, 5 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: сон, мон, весе, мезе, тон, те, кона, сонсь, кие, минь

The 10 most frequent PRON types: сон, весе, те, мон, сонзэ, сонсь, мезе, минь, тензэ, сынь

The 10 most frequent ambiguous lemmas: весе (PRON 74, ADV 2, DET 2), те (PRON 58, DET 44), кона (PRON 50, DET 12), истямо (PRON 20, DET 6, ADV 4), кавонест (PRON 11, NUM 1), эрьва (DET 35, PRON 10), се (DET 13, PRON 10), секе (PRON 10, ADV 5, DET 3), неть (PRON 9, DET 1), теке (PRON 8, ADV 7, SCONJ 7, DET 1)

The 10 most frequent ambiguous types: весе (PRON 51, ADV 2, DET 1), те (DET 28, PRON 24), кона (PRON 13, DET 4), конань (PRON 14, DET 3), истямо (PRON 10, ADV 4, DET 4), кавонест (PRON 7, NUM 1), конат (PRON 7, DET 1), неть (PRON 7, DET 2), ки (PRON 4, NOUN 1), мезень (PRON 5, NOUN 1)

Morphology

The form / lemma ratio of PRON is 3.169492 (the average of all parts of speech is 2.044845).

The 1st highest number of forms (14) was observed with the lemma “мезе”: Мейсэ, мезде, мезе, мезекс, мезель, мезем, мезенек, мезень, мезес, мезесь, мезеть, мезть, мейсэль, месть.

The 2nd highest number of forms (14) was observed with the lemma “сон”: Сынсткак, сон, сонгак, сондензэ, сонензэ, сонзо, сонзэ, сонзэяк, сонсь, сыненсткак, сынст, сынь, тензэ, тенст.

The 3rd highest number of forms (14) was observed with the lemma “тон”: Тонгак, Тонтеметь, тенк, теть, тон, тондеть, тонеть, тонсь, тонть, тонь, тыненк, тынк, тынь, тыньгак.

PRON occurs with 19 features: Case (916; 96% instances), Number (906; 95% instances), PronType (850; 89% instances), Person (471; 50% instances), Definite (373; 39% instances), Variant (103; 11% instances), Reflex (71; 7% instances), Animacy (31; 3% instances), NumType (23; 2% instances), Clitic (20; 2% instances), Number[psor] (19; 2% instances), Person[psor] (19; 2% instances), Number[subj] (6; 1% instances), Person[subj] (6; 1% instances), Tense (6; 1% instances), AdvType (5; 1% instances), Derivation (1; 0% instances), Mood (1; 0% instances), Polarity (1; 0% instances)

PRON occurs with 48 feature-value pairs: AdvType=Loc, Animacy=Hum, Case=Abe, Case=Abl, Case=Dat, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Definite=Def, Definite=Ind, Derivation=PronGak, Mood=Ind, NumType=Card, NumType=Dist, NumType=Sets, Number=Plur, Number=Plur,Sing, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Tense=Past, Tense=Pres, Variant=Long, Variant=Short

PRON occurs with 169 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3|PronType=Prs (77 tokens). Examples: сон

Relations

PRON nodes are attached to their parents using 32 different relations: nsubj (335; 35% instances), det (158; 17% instances), obl (151; 16% instances), obj (95; 10% instances), nmod (59; 6% instances), root (39; 4% instances), obl:agent (23; 2% instances), conj (15; 2% instances), fixed (8; 1% instances), amod (7; 1% instances), nmod:comp (7; 1% instances), nsubj:cop (7; 1% instances), expl (6; 1% instances), advcl (5; 1% instances), ccomp (4; 0% instances), parataxis (4; 0% instances), vocative (4; 0% instances), nmod:poss (3; 0% instances), orphan (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), xcomp (2; 0% instances), advcl:tcl (1; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), dep (1; 0% instances), nmod:gsubj (1; 0% instances), obl:cau (1; 0% instances), obl:inst (1; 0% instances), obl:lmod (1; 0% instances), obl:lmp (1; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (580; 61% instances), NOUN (248; 26% instances), (39; 4% instances), PRON (25; 3% instances), ADJ (21; 2% instances), ADV (17; 2% instances), AUX (8; 1% instances), PROPN (8; 1% instances), DET (2; 0% instances), ADP (1; 0% instances), NUM (1; 0% instances)

749 (79%) PRON nodes are leaves.

126 (13%) PRON nodes have one child.

25 (3%) PRON nodes have two children.

50 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 35 different relations: punct (102; 26% instances), nsubj (37; 9% instances), case (32; 8% instances), fixed (23; 6% instances), cop (16; 4% instances), nmod (15; 4% instances), appos (14; 4% instances), conj (14; 4% instances), advmod (13; 3% instances), aux:neg (13; 3% instances), det (11; 3% instances), acl:relcl (10; 3% instances), discourse (9; 2% instances), advcl (8; 2% instances), cc (8; 2% instances), obl (8; 2% instances), advmod:tmod (7; 2% instances), parataxis (7; 2% instances), amod (5; 1% instances), orphan (5; 1% instances), vocative (5; 1% instances), nsubj:cop (4; 1% instances), obl:lmod (4; 1% instances), acl (3; 1% instances), dislocated (3; 1% instances), mark (3; 1% instances), advmod:eval (2; 1% instances), advmod:foc (2; 1% instances), advmod:deg (1; 0% instances), aux:q (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances), nummod (1; 0% instances), xcomp (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: PUNCT (102; 26% instances), NOUN (75; 19% instances), ADP (39; 10% instances), ADV (35; 9% instances), AUX (32; 8% instances), VERB (29; 7% instances), PRON (25; 6% instances), PART (16; 4% instances), ADJ (12; 3% instances), CCONJ (8; 2% instances), PROPN (7; 2% instances), DET (4; 1% instances), INTJ (3; 1% instances), SCONJ (2; 1% instances), NUM (1; 0% instances)