Treebank Statistics: UD_Erzya-JR: POS Tags: PRON
There are 64 PRON
lemmas (2%), 204 PRON
types (3%) and 1175 PRON
tokens (6%).
Out of 16 observed tags, the rank of PRON
is: 6 in number of lemmas, 6 in number of types and 5 in number of tokens.
The 10 most frequent PRON
lemmas: сон, мон, тон, весе, мезе, те, кона, сонсь, кие, кияк
The 10 most frequent PRON
types: сон, весе, те, мон, сонзэ, тон, мезе, сонсь, минь, сынь
The 10 most frequent ambiguous lemmas: весе (PRON 80, DET 3, ADV 2), те (PRON 70, DET 58), кона (PRON 53, DET 13), се (PRON 20, DET 13), секе (PRON 20, DET 2), истямо (DET 17, PRON 17, ADV 4, ADJ 3), неть (PRON 12, DET 1), тона (PRON 10, DET 1), теке (ADV 12, PRON 9, SCONJ 7, DET 2), вейке (NUM 45, PRON 6)
The 10 most frequent ambiguous types: весе (PRON 56, ADV 2, DET 2), те (DET 39, PRON 30), сынь (PRON 17, VERB 1), кона (PRON 14, DET 4), конань (PRON 15, DET 2), неть (PRON 8, DET 2), секе (PRON 12, DET 1), ки (PRON 6, NOUN 1), истямо (DET 9, PRON 9, ADV 4, ADJ 1), конат (PRON 7, DET 2)
- весе
- те
- сынь
- кона
- конань
- неть
- секе
- ки
- истямо
- конат
Morphology
The form / lemma ratio of PRON
is 3.187500 (the average of all parts of speech is 2.080547).
The 1st highest number of forms (15) was observed with the lemma “мезе”: мезде, мезе, мезекс, мезель, мезем, мезенек, мезень, мезес, мезесь, мезеть, мезть, мейсэ, мейсэль, мень, месть.
The 2nd highest number of forms (14) was observed with the lemma “сон”: Сынсткак, сон, сонгак, сондензэ, сонензэ, сонзо, сонзэ, сонзэяк, сонсь, сыненсткак, сынст, сынь, тензэ, тенст.
The 3rd highest number of forms (14) was observed with the lemma “тон”: Тонгак, Тонтеметь, тенк, теть, тон, тондеть, тонеть, тонсь, тонть, тонь, тыненк, тынк, тынь, тыньгак.
PRON
occurs with 20 features: Case (1130; 96% instances), Number (1122; 95% instances), PronType (1063; 90% instances), Person (609; 52% instances), Definite (434; 37% instances), Variant (135; 11% instances), Reflex (85; 7% instances), Animacy (31; 3% instances), Clitic (28; 2% instances), NumType (28; 2% instances), ExtPos (23; 2% instances), Number[psor] (19; 2% instances), Person[psor] (19; 2% instances), Number[subj] (7; 1% instances), Person[subj] (7; 1% instances), Tense (7; 1% instances), AdvType (5; 0% instances), Derivation (1; 0% instances), Mood (1; 0% instances), Polarity (1; 0% instances)
PRON
occurs with 50 feature-value pairs: AdvType=Loc
, Animacy=Hum
, Case=Abe
, Case=Abl
, Case=Dat
, Case=Ela
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Nom
, Case=Prl
, Case=Tra
, Clitic=Add
, Definite=Def
, Definite=Ind
, Derivation=PronGak
, ExtPos=ADV
, ExtPos=PRON
, Mood=Ind
, NumType=Card
, NumType=Dist
, NumType=Sets
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Number[subj]=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, Person[subj]=1
, Person[subj]=2
, Person[subj]=3
, Polarity=Neg
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rcp
, PronType=Rel
, PronType=Tot
, Reflex=Yes
, Tense=Past
, Tense=Pres
, Variant=Long
, Variant=Short
PRON
occurs with 200 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3|PronType=Prs
(100 tokens).
Examples: сон
Relations
PRON
nodes are attached to their parents using 28 different relations: nsubj (422; 36% instances), obl (202; 17% instances), det (138; 12% instances), obj (112; 10% instances), nmod (64; 5% instances), nmod:poss (52; 4% instances), root (40; 3% instances), obl:agent (25; 2% instances), conj (20; 2% instances), nsubj:cop (18; 2% instances), obl:cmp (18; 2% instances), advmod (8; 1% instances), expl (8; 1% instances), fixed (7; 1% instances), obl:own (6; 1% instances), advcl (5; 0% instances), amod (5; 0% instances), ccomp (4; 0% instances), orphan (4; 0% instances), vocative (4; 0% instances), parataxis (3; 0% instances), acl (2; 0% instances), appos (2; 0% instances), xcomp (2; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), dep (1; 0% instances), nmod:gsubj (1; 0% instances)
Parents of PRON
nodes belong to 11 different parts of speech: VERB (729; 62% instances), NOUN (288; 25% instances), ADJ (42; 4% instances), (40; 3% instances), PRON (33; 3% instances), ADV (25; 2% instances), PROPN (8; 1% instances), AUX (4; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), NUM (1; 0% instances)
920 (78%) PRON
nodes are leaves.
173 (15%) PRON
nodes have one child.
30 (3%) PRON
nodes have two children.
52 (4%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 7.
Children of PRON
nodes are attached using 32 different relations: punct (111; 25% instances), case (44; 10% instances), appos (32; 7% instances), nsubj (32; 7% instances), advmod (26; 6% instances), fixed (23; 5% instances), aux:neg (20; 4% instances), nmod (18; 4% instances), conj (14; 3% instances), det (14; 3% instances), obl (12; 3% instances), discourse (11; 2% instances), acl:relcl (10; 2% instances), cop (10; 2% instances), advcl (9; 2% instances), cc (9; 2% instances), orphan (7; 2% instances), parataxis (7; 2% instances), vocative (7; 2% instances), amod (6; 1% instances), nsubj:cop (6; 1% instances), acl (5; 1% instances), mark (5; 1% instances), dislocated (4; 1% instances), nmod:poss (2; 0% instances), obl:cmp (2; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), compound:redup (1; 0% instances), nummod (1; 0% instances), obl:tmod (1; 0% instances), xcomp (1; 0% instances)
Children of PRON
nodes belong to 15 different parts of speech: PUNCT (111; 25% instances), NOUN (92; 20% instances), ADP (48; 11% instances), ADV (34; 8% instances), PRON (33; 7% instances), AUX (32; 7% instances), VERB (30; 7% instances), PART (20; 4% instances), PROPN (18; 4% instances), ADJ (12; 3% instances), CCONJ (9; 2% instances), DET (7; 2% instances), INTJ (3; 1% instances), SCONJ (2; 0% instances), NUM (1; 0% instances)