home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uyghur-UDT: POS Tags: PRON

There are 29 PRON lemmas (1%), 188 PRON types (1%) and 2692 PRON tokens (7%). Out of 16 observed tags, the rank of PRON is: 7 in number of lemmas, 5 in number of types and 4 in number of tokens.

The 10 most frequent PRON lemmas: ئۇ، _، بۇ، مەن، ئۇلار، ئۆز، سەن، بىز، شۇ، ئەڭ

The 10 most frequent PRON types: بۇ، ئۇ، مەن، ئۇنىڭ، سەن، ئۇلار، شۇ، ئۇنى، بىز، ئەڭ

The 10 most frequent ambiguous lemmas: _ (VERB 4247, NOUN 4246, AUX 501, PRON 479, PUNCT 396, ADJ 326, ADV 157, PART 119, NUM 77, CCONJ 75, X 64, ADP 56, INTJ 47, DET 28), سىلى (PRON 8, VERB 1), ھېچ (PRON 6, ADV 1)

The 10 most frequent ambiguous types: ئۇنىڭ (PRON 127, CCONJ 3), ئۆزىنى (PRON 30, NOUN 1), شۇنىڭ (PRON 15, CCONJ 12), ئەنە (PRON 13, PART 2), قانداق (ADV 30, DET 17, PRON 7), بەزىدە (ADJ 8, PRON 6), ھېچ (PRON 6, ADV 1), مانا (PART 14, NOUN 11, PRON 5), ھەممىڭلار (PRON 5, NOUN 1, VERB 1), ھېچنەرسە (PRON 4, NOUN 1)

Morphology

The form / lemma ratio of PRON is 6.482759 (the average of all parts of speech is 4.182394).

The 1st highest number of forms (104) was observed with the lemma “_”: ئاشۇنداق, ئۆز-ئۆزىگە, ئۆزلىرى, ئۆزلىرىنىڭ, ئۆزلىرىگە, ئۆزىلا, ئۆزىڭىزنى, ئۆزىگە, ئۇمۇ, ئۇنداقتا, ئۇنىڭدىكى, ئۇنىڭدىن, ئۇنىڭدىنمۇ, ئۇنىڭدەك, ئۇنىڭمۇ, ئۇنچىۋالا, ئۇياقتىن, ئۇياققا, بىزدە, بىزگە, بۇلارغا, بۇلارنى, بۇمۇ, بۇنى, بۇنىڭدا, بۇنىڭدىكى, بۇنىڭدىنمۇ, بۇنىڭغا, بۇياققا, بەزىدە, دېگىنىڭنى, سىزگە, سىلىگە, سېنىڭ, شۇكى, شۇلارنى, شۇنداقتىمۇ, شۇنداقمۇ, شۇندىلا, شۇنى, شۇنىڭدىن, شۇنىڭغا, شۇنچىۋالا, قانداق, قانداقتۇر, قاياققا, قايسىسىنى, قەيەردىن, كىمدۇر, كىمكى, كىملەرنىڭ, كىمنى, مانا, مۇشۇلارنى, مۇشۇنداق, مۇنداقچە, مۇنچە, مۇنۇ, مۇنۇلارنى, مېنى, مېنىمۇ, مېنىڭمۇ, مەشەدە, مەنمۇ, نىمىشقا, نىمە, نېمانداق, نېمانچە, نېمىدەپ, نېمىشقىدۇر, نېمىشقىمۇ, نېمىلا, نېمىلىكىنى, نېمىلەرنىدۇر, نېمە, نەدىن, نەدىندۇر, نەلەردە, نەلەرگىدۇر, نەگىدۇر, نەگە, ھېلىقى, ھېچقانچە, ھېچكىمنى, ھېچكىمگە, ھېچنىمە, ھېچنېمىنى, ھېچنېمىگە, ھېچنەرسىنى, ھېچنەرسە, ھەربىرىنىڭ, ھەرخىل, ھەركىم, ھەركۈنى, ھەممىسى, ھەممىسىلا, ھەممىسىنى, ھەممىلا, ھەممىمىز, ھەممىنى, ھەممىڭلار, ھەممىگە, ھەممەيلەننى, ھەممەيلەنگە.

The 2nd highest number of forms (17) was observed with the lemma “ئۆز”: ئۆز, ئۆزى, ئۆزىدىن, ئۆزىدە, ئۆزىمۇ, ئۆزىنى, ئۆزىنىڭ, ئۆزۈم, ئۆزۈمدىن, ئۆزۈممۇ, ئۆزۈمنى, ئۆزۈمنىڭ, ئۆزۈمگە, ئۆزۈڭ, ئۆزۈڭمۇ, ئۆزۈڭنى, ئۆزۈڭنىڭ.

The 3rd highest number of forms (7) was observed with the lemma “ئۇلار”: ئۇلار, ئۇلاردا, ئۇلاردىن, ئۇلارغا, ئۇلارمۇ, ئۇلارنى, ئۇلارنىڭ.

PRON occurs with 8 features: Case (2213; 82% instances), PronType (1945; 72% instances), Number (1364; 51% instances), Person (1358; 50% instances), Reflex (147; 5% instances), Number[psor] (110; 4% instances), Person[psor] (110; 4% instances), Polite (50; 2% instances)

PRON occurs with 20 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Plur,Sing, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polite=Form, PronType=Dem, PronType=Prs, Reflex=Yes

PRON occurs with 55 feature combinations. The most frequent feature combination is Case=Nom|PronType=Dem (557 tokens). Examples: بۇ، شۇ، ئاشۇ، مۇشۇ، ماۋۇ، ئاۋۇ

Relations

PRON nodes are attached to their parents using 30 different relations: nsubj (967; 36% instances), det (599; 22% instances), nmod:poss (305; 11% instances), obj (248; 9% instances), obl (239; 9% instances), advmod (103; 4% instances), nmod (63; 2% instances), compound (38; 1% instances), nmod:cau (17; 1% instances), cc (12; 0% instances), dep (12; 0% instances), appos (10; 0% instances), compound:redup (9; 0% instances), root (8; 0% instances), case (7; 0% instances), conj (7; 0% instances), discourse (7; 0% instances), mark (7; 0% instances), fixed (6; 0% instances), amod (5; 0% instances), parataxis (5; 0% instances), ccomp (4; 0% instances), nmod:comp (4; 0% instances), cop (2; 0% instances), nmod:abl (2; 0% instances), nmod:tmod (2; 0% instances), aux (1; 0% instances), compound:lvc (1; 0% instances), nmod:clas (1; 0% instances), nmod:ins (1; 0% instances)

Parents of PRON nodes belong to 12 different parts of speech: VERB (1392; 52% instances), NOUN (1098; 41% instances), ADJ (109; 4% instances), ADP (26; 1% instances), PRON (21; 1% instances), ADV (18; 1% instances), NUM (8; 0% instances), (8; 0% instances), DET (6; 0% instances), AUX (3; 0% instances), INTJ (2; 0% instances), PROPN (1; 0% instances)

2472 (92%) PRON nodes are leaves.

165 (6%) PRON nodes have one child.

34 (1%) PRON nodes have two children.

21 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 28 different relations: punct (114; 36% instances), case (32; 10% instances), cop (22; 7% instances), fixed (22; 7% instances), nmod:poss (20; 6% instances), nsubj (19; 6% instances), nmod (14; 4% instances), compound:redup (13; 4% instances), advmod (10; 3% instances), amod (10; 3% instances), conj (10; 3% instances), mark (5; 2% instances), dep (4; 1% instances), advmod:emph (3; 1% instances), nmod:part (3; 1% instances), appos (2; 1% instances), cc (2; 1% instances), compound (2; 1% instances), det (2; 1% instances), flat (2; 1% instances), acl (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), nmod:tmod (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: PUNCT (114; 36% instances), NOUN (73; 23% instances), ADP (41; 13% instances), PRON (21; 7% instances), AUX (17; 5% instances), VERB (15; 5% instances), ADV (13; 4% instances), ADJ (8; 3% instances), NUM (8; 3% instances), X (4; 1% instances), INTJ (3; 1% instances), PART (2; 1% instances)