Treebank Statistics: UD_Erzya-JR: POS Tags: PRON
There are 63 PRON
lemmas (2%), 205 PRON
types (3%) and 1179 PRON
tokens (6%).
Out of 16 observed tags, the rank of PRON
is: 6 in number of lemmas, 6 in number of types and 5 in number of tokens.
The 10 most frequent PRON
lemmas: сон, мон, тон, весе, мезе, те, кона, сонсь, кие, кияк
The 10 most frequent PRON
types: сон, весе, те, мон, сонзэ, тон, мезе, сонсь, минь, сынь
The 10 most frequent ambiguous lemmas: весе (PRON 80, DET 3, ADV 2), те (PRON 70, DET 58), кона (PRON 53, DET 13), истямо (PRON 20, DET 17, ADV 4), се (PRON 20, DET 13), секе (PRON 15, ADV 5, DET 2), неть (PRON 12, DET 1), эрьва (DET 39, PRON 10), тона (PRON 10, DET 1), теке (ADV 12, PRON 9, SCONJ 7, DET 2)
The 10 most frequent ambiguous types: весе (PRON 56, ADV 2, DET 2), те (DET 39, PRON 30), сынь (PRON 17, VERB 1), кона (PRON 14, DET 4), конань (PRON 15, DET 2), неть (PRON 8, DET 2), истямо (PRON 10, DET 9, ADV 4), ки (PRON 6, NOUN 1), конат (PRON 7, DET 2), мезень (PRON 6, NOUN 1)
- весе
- те
- сынь
- кона
- конань
- неть
- истямо
- ки
- конат
- мезень
Morphology
The form / lemma ratio of PRON
is 3.253968 (the average of all parts of speech is 2.080194).
The 1st highest number of forms (15) was observed with the lemma “мезе”: мезде, мезе, мезекс, мезель, мезем, мезенек, мезень, мезес, мезесь, мезеть, мезть, мейсэ, мейсэль, мень, месть.
The 2nd highest number of forms (14) was observed with the lemma “сон”: Сынсткак, сон, сонгак, сондензэ, сонензэ, сонзо, сонзэ, сонзэяк, сонсь, сыненсткак, сынст, сынь, тензэ, тенст.
The 3rd highest number of forms (14) was observed with the lemma “тон”: Тонгак, Тонтеметь, тенк, теть, тон, тондеть, тонеть, тонсь, тонть, тонь, тыненк, тынк, тынь, тыньгак.
PRON
occurs with 19 features: Case (1136; 96% instances), Number (1128; 96% instances), PronType (1063; 90% instances), Person (609; 52% instances), Definite (443; 38% instances), Variant (135; 11% instances), Reflex (85; 7% instances), Animacy (31; 3% instances), Clitic (28; 2% instances), NumType (28; 2% instances), Number[psor] (19; 2% instances), Person[psor] (19; 2% instances), Number[subj] (7; 1% instances), Person[subj] (7; 1% instances), Tense (7; 1% instances), AdvType (5; 0% instances), Derivation (1; 0% instances), Mood (1; 0% instances), Polarity (1; 0% instances)
PRON
occurs with 48 feature-value pairs: AdvType=Loc
, Animacy=Hum
, Case=Abe
, Case=Abl
, Case=Dat
, Case=Ela
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Nom
, Case=Prl
, Case=Tra
, Clitic=Add
, Definite=Def
, Definite=Ind
, Derivation=PronGak
, Mood=Ind
, NumType=Card
, NumType=Dist
, NumType=Sets
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Number[subj]=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, Person[subj]=1
, Person[subj]=2
, Person[subj]=3
, Polarity=Neg
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rcp
, PronType=Rel
, PronType=Tot
, Reflex=Yes
, Tense=Past
, Tense=Pres
, Variant=Long
, Variant=Short
PRON
occurs with 191 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3|PronType=Prs
(100 tokens).
Examples: сон
Relations
PRON
nodes are attached to their parents using 33 different relations: nsubj (422; 36% instances), obl (195; 17% instances), det (146; 12% instances), obj (111; 9% instances), nmod (63; 5% instances), nmod:poss (52; 4% instances), root (40; 3% instances), obl:agent (25; 2% instances), conj (20; 2% instances), nsubj:cop (18; 2% instances), obl:cmp (18; 2% instances), expl (8; 1% instances), fixed (8; 1% instances), amod (7; 1% instances), obl:own (6; 1% instances), advcl (5; 0% instances), ccomp (4; 0% instances), obl:lmod (4; 0% instances), orphan (4; 0% instances), vocative (4; 0% instances), advmod (3; 0% instances), acl (2; 0% instances), appos (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), advcl:tcl (1; 0% instances), compound:redup (1; 0% instances), csubj:cop (1; 0% instances), dep (1; 0% instances), nmod:gsubj (1; 0% instances), nmod:lmod (1; 0% instances), obl:cau (1; 0% instances), obl:inst (1; 0% instances)
Parents of PRON
nodes belong to 11 different parts of speech: VERB (725; 61% instances), NOUN (295; 25% instances), ADJ (42; 4% instances), (40; 3% instances), PRON (34; 3% instances), ADV (25; 2% instances), PROPN (8; 1% instances), AUX (4; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), NUM (1; 0% instances)
925 (78%) PRON
nodes are leaves.
172 (15%) PRON
nodes have one child.
31 (3%) PRON
nodes have two children.
51 (4%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 7.
Children of PRON
nodes are attached using 38 different relations: punct (110; 24% instances), case (42; 9% instances), appos (32; 7% instances), nsubj (32; 7% instances), fixed (24; 5% instances), aux:neg (20; 4% instances), nmod (18; 4% instances), advmod (14; 3% instances), conj (14; 3% instances), det (13; 3% instances), discourse (11; 2% instances), acl:relcl (10; 2% instances), cop (10; 2% instances), advcl (9; 2% instances), cc (9; 2% instances), obl (8; 2% instances), advmod:tmod (7; 2% instances), orphan (7; 2% instances), parataxis (7; 2% instances), vocative (7; 2% instances), amod (6; 1% instances), nsubj:cop (6; 1% instances), acl (5; 1% instances), dislocated (4; 1% instances), obl:lmod (4; 1% instances), advmod:eval (3; 1% instances), mark (3; 1% instances), advmod:foc (2; 0% instances), aux:q (2; 0% instances), nmod:poss (2; 0% instances), obl:cmp (2; 0% instances), advmod:deg (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances), nummod (1; 0% instances), xcomp (1; 0% instances)
Children of PRON
nodes belong to 15 different parts of speech: PUNCT (110; 24% instances), NOUN (83; 18% instances), ADP (48; 11% instances), ADV (39; 9% instances), AUX (34; 8% instances), PRON (34; 8% instances), VERB (30; 7% instances), PART (18; 4% instances), PROPN (18; 4% instances), ADJ (15; 3% instances), CCONJ (9; 2% instances), DET (6; 1% instances), INTJ (3; 1% instances), SCONJ (2; 0% instances), NUM (1; 0% instances)