home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xibe-XDT: POS Tags: PRON

There are 62 PRON lemmas (3%), 66 PRON types (2%) and 629 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 7 in number of lemmas, 7 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: ᠪᡞ, ᡤᡠᠪᠴᡞ, ᠮᡞᠨᡞ, ᠰᡞ, ᡤᡝᠷᡝᠨ, ᡨᡝᠷᡝ, ᡨᡝᠰᡝ, ᠮᡠᠰᡝ, ᠪᡝᠶᡝ, ᠰᡞᠨᡞ

The 10 most frequent PRON types: ᠪᡞ, ᡤᡠᠪᠴᡞ, ᠮᡞᠨᡞ, ᠰᡞ, ᡤᡝᠷᡝᠨ, ᡨᡝᠷᡝ, ᡨᡝᠰᡝ, ᠰᡞᠨᡞ, ᡞᠨᡞ, ᠠᡞ

The 10 most frequent ambiguous lemmas: ᠪᡞ (PRON 63, VERB 23, AUX 8), ᡤᡠᠪᠴᡞ (PRON 49, ADJ 8), ᠰᡞ (PRON 44, PROPN 24, X 3), ᡤᡝᠷᡝᠨ (PRON 41, DET 10, ADJ 4, NOUN 1), ᡨᡝᠷᡝ (PRON 41, DET 13), ᠪᡝᠶᡝ (PRON 24, NOUN 8), ᠠᡞ (PRON 19, NOUN 3), (ADP 159, PRON 16, PART 11, X 1), ᠪᡝ (ADP 742, PRON 13), ᡨᡠᡨᡨᡠ (PRON 7, ADV 1)

The 10 most frequent ambiguous types: ᠪᡞ (PRON 63, VERB 17, AUX 8), ᡤᡠᠪᠴᡞ (PRON 49, ADJ 8), ᠰᡞ (PRON 44, PROPN 25, X 3), ᡤᡝᠷᡝᠨ (PRON 41, DET 10, ADJ 4, NOUN 1), ᡨᡝᠷᡝ (PRON 30, DET 13), ᠠᡞ (PRON 19, NOUN 3), (ADP 158, PRON 17, PART 10, PROPN 1, X 1), ᠪᡝᠶᡝ (PRON 15, NOUN 8), ᠪᡝ (ADP 742, PRON 13), ᡨᡠᡨᡨᡠ (PRON 7, ADV 1)

Morphology

The form / lemma ratio of PRON is 1.064516 (the average of all parts of speech is 1.310593).

The 1st highest number of forms (4) was observed with the lemma “ᡨᡝᠷᡝ”: ᡨᡝᠷᡝ, ᡨᡝᠷᡝᠪᡝ, ᡨᡝᠷᡝᡞ, ᡨᡝᠷᡝᡠ.

The 2nd highest number of forms (3) was observed with the lemma “ᠠᡞᠪᡞ”: ᠠᡞᠪᡞ, ᠠᡞᠪᡞᠴᡞ, ᠠᡞᠪᡞᡩᡝᠷᡞ.

The 3rd highest number of forms (2) was observed with the lemma “ᠪᡝᠶᡝ”: ᠪᡝᠶᡝ, ᠪᡝᠶᡝᡞ.

PRON occurs with 10 features: PronType (595; 95% instances), Number (417; 66% instances), Person (405; 64% instances), Poss (102; 16% instances), Case (91; 14% instances), Clusivity (30; 5% instances), Reflex (24; 4% instances), Abbr (3; 0% instances), NumType (3; 0% instances), Polite (1; 0% instances)

PRON occurs with 23 feature-value pairs: Abbr=Yes, Case=Abl, Case=Acc, Case=Cmp, Case=Dat, Case=Gen, Case=Lat, Clusivity=Ex, Clusivity=In, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Elev, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Tot, Reflex=Yes

PRON occurs with 53 feature combinations. The most frequent feature combination is PronType=Tot (99 tokens). Examples: ᡤᡠᠪᠴᡞ, ᡤᡝᠷᡝᠨ, ᡨᠣᠮᡝ, ᠶᠠᠶᠠᠮᡠ, ᠶᠠᠶᠠ, ᡥᠠᠷᡨᡠᡢᡤᠠ, ᠮᡝᡞᠮᡝᠨᡞ, ᡤᡝᠷᡝᠨᠣᡫᡞ, ᡩᠠᠷᡞ

Relations

PRON nodes are attached to their parents using 13 different relations: nsubj (243; 39% instances), nmod:poss (108; 17% instances), det (103; 16% instances), obl (74; 12% instances), amod (32; 5% instances), obj (30; 5% instances), nmod (14; 2% instances), compound (9; 1% instances), root (6; 1% instances), advcl (4; 1% instances), appos (4; 1% instances), conj (1; 0% instances), iobj (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (327; 52% instances), NOUN (255; 41% instances), PRON (19; 3% instances), ADJ (16; 3% instances), (6; 1% instances), PROPN (3; 0% instances), ADV (2; 0% instances), X (1; 0% instances)

548 (87%) PRON nodes are leaves.

65 (10%) PRON nodes have one child.

9 (1%) PRON nodes have two children.

7 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 5.

Children of PRON nodes are attached using 16 different relations: case (42; 39% instances), punct (12; 11% instances), clf (10; 9% instances), nmod:poss (8; 7% instances), advmod (7; 6% instances), compound (7; 6% instances), acl (5; 5% instances), appos (4; 4% instances), mark (4; 4% instances), amod (3; 3% instances), aux (2; 2% instances), conj (1; 1% instances), cop (1; 1% instances), discourse (1; 1% instances), nsubj (1; 1% instances), obl (1; 1% instances)

Children of PRON nodes belong to 10 different parts of speech: ADP (42; 39% instances), PRON (19; 17% instances), NOUN (14; 13% instances), PUNCT (12; 11% instances), ADV (7; 6% instances), VERB (5; 5% instances), SCONJ (4; 4% instances), AUX (3; 3% instances), NUM (2; 2% instances), PART (1; 1% instances)