Treebank Statistics: UD_Erzya-JR: POS Tags: PRON
There are 62 PRON lemmas (2%), 210 PRON types (3%) and 1199 PRON tokens (6%).
Out of 16 observed tags, the rank of PRON is: 6 in number of lemmas, 6 in number of types and 5 in number of tokens.
The 10 most frequent PRON lemmas: сон, мон, тон, весе, мезе, те, кона, сонсь, кие, кияк
The 10 most frequent PRON types: сон, весе, те, мон, сонзэ, тон, мезе, сонсь, минь, сынь
The 10 most frequent ambiguous lemmas: весе (PRON 80, DET 3, ADV 2), те (PRON 70, DET 58), кона (PRON 54, DET 11), се (PRON 20, DET 13), секе (PRON 20, DET 2), истямо (DET 17, PRON 17, ADV 4, ADJ 3), вейке (NUM 37, PRON 14), ламо (ADV 14, PRON 13, DET 10), неть (PRON 12, DET 1), лия (DET 11, PRON 10, ADV 2)
The 10 most frequent ambiguous types: весе (PRON 56, ADV 2, DET 2), те (DET 39, PRON 30), сынь (PRON 17, VERB 1), конань (PRON 16, DET 1), кона (PRON 13, DET 4), неть (PRON 8, DET 2), секе (PRON 12, DET 1), ки (PRON 6, NOUN 1), истямо (DET 9, PRON 9, ADV 4, ADJ 1), конат (PRON 7, DET 2)
- весе
- те
- сынь
- конань
- кона
- неть
- секе
- ки
- истямо
- конат
Morphology
The form / lemma ratio of PRON is 3.387097 (the average of all parts of speech is 2.079051).
The 1st highest number of forms (15) was observed with the lemma “мезе”: мезде, мезе, мезекс, мезель, мезем, мезенек, мезень, мезес, мезесь, мезеть, мезть, мейсэ, мейсэль, мень, месть.
The 2nd highest number of forms (14) was observed with the lemma “сон”: Сынсткак, сон, сонгак, сондензэ, сонензэ, сонзо, сонзэ, сонзэяк, сонсь, сыненсткак, сынст, сынь, тензэ, тенст.
The 3rd highest number of forms (14) was observed with the lemma “тон”: Тонгак, Тонтеметь, тенк, теть, тон, тондеть, тонеть, тонсь, тонть, тонь, тыненк, тынк, тынь, тыньгак.
PRON occurs with 18 features: PronType (1199; 100% instances), Case (1161; 97% instances), Number (1155; 96% instances), Person (609; 51% instances), Definite (465; 39% instances), Variant (135; 11% instances), Reflex (85; 7% instances), NumType (35; 3% instances), Animacy (31; 3% instances), Clitic (30; 3% instances), ExtPos (22; 2% instances), Number[psor] (14; 1% instances), Person[psor] (14; 1% instances), Number[subj] (6; 1% instances), Person[subj] (6; 1% instances), Tense (6; 1% instances), Derivation (1; 0% instances), Polarity (1; 0% instances)
PRON occurs with 49 feature-value pairs: Animacy=Hum, Case=Abe, Case=Abl, Case=Dat, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Definite=Def, Definite=Ind, Derivation=PronGak, ExtPos=ADV, ExtPos=PRON, NumType=Card, NumType=Dist, NumType=Sets, Number=Plur, Number=Plur,Sing, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Number[subj]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Tense=Past, Tense=Pres, Variant=Long, Variant=Short
PRON occurs with 200 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3|PronType=Prs (100 tokens).
Examples: сон
Relations
PRON nodes are attached to their parents using 29 different relations: nsubj (443; 37% instances), obl (198; 17% instances), det (137; 11% instances), obj (116; 10% instances), nmod (66; 6% instances), nmod:poss (52; 4% instances), root (40; 3% instances), obl:agent (25; 2% instances), conj (23; 2% instances), nsubj:cop (18; 2% instances), obl:cmp (16; 1% instances), advmod (8; 1% instances), expl (8; 1% instances), fixed (7; 1% instances), obl:own (6; 1% instances), advcl (5; 0% instances), amod (5; 0% instances), ccomp (4; 0% instances), orphan (4; 0% instances), vocative (4; 0% instances), acl (2; 0% instances), appos (2; 0% instances), nmod:cmp (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), dep (1; 0% instances), nmod:gsubj (1; 0% instances)
Parents of PRON nodes belong to 11 different parts of speech: VERB (744; 62% instances), NOUN (290; 24% instances), ADJ (46; 4% instances), (40; 3% instances), PRON (33; 3% instances), ADV (26; 2% instances), PROPN (8; 1% instances), AUX (4; 0% instances), ADP (3; 0% instances), NUM (3; 0% instances), DET (2; 0% instances)
934 (78%) PRON nodes are leaves.
181 (15%) PRON nodes have one child.
31 (3%) PRON nodes have two children.
53 (4%) PRON nodes have three or more children.
The highest child degree of a PRON node is 7.
Children of PRON nodes are attached using 33 different relations: punct (112; 24% instances), case (45; 10% instances), appos (33; 7% instances), nsubj (32; 7% instances), advmod (29; 6% instances), nmod (28; 6% instances), fixed (22; 5% instances), aux:neg (21; 5% instances), conj (15; 3% instances), det (14; 3% instances), discourse (11; 2% instances), acl:relcl (10; 2% instances), cc (10; 2% instances), cop (10; 2% instances), advcl (9; 2% instances), orphan (8; 2% instances), parataxis (7; 2% instances), vocative (7; 2% instances), amod (6; 1% instances), nsubj:cop (6; 1% instances), obl (6; 1% instances), acl (5; 1% instances), mark (5; 1% instances), dislocated (4; 1% instances), nmod:cmp (2; 0% instances), nmod:poss (2; 0% instances), aux (1; 0% instances), cc:preconj (1; 0% instances), ccomp (1; 0% instances), compound:redup (1; 0% instances), nummod (1; 0% instances), obl:tmod (1; 0% instances), xcomp (1; 0% instances)
Children of PRON nodes belong to 15 different parts of speech: PUNCT (112; 24% instances), NOUN (95; 20% instances), ADP (53; 11% instances), ADV (37; 8% instances), AUX (33; 7% instances), PRON (33; 7% instances), VERB (30; 6% instances), PART (20; 4% instances), PROPN (18; 4% instances), ADJ (13; 3% instances), CCONJ (10; 2% instances), DET (6; 1% instances), INTJ (3; 1% instances), SCONJ (2; 0% instances), NUM (1; 0% instances)