home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xibe-XDT: POS Tags: VERB

There are 658 VERB lemmas (28%), 1337 VERB types (43%) and 3090 VERB tokens (20%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: ᠣᠮᠪᡞ, ᠰᡝᠮᠪᡞ, ᡪᡞᠮᠪᡞ, ᡤᡝᠨᡝᠮᠪᡞ, ᠪᡞᠮᠪᡞ, ᠠᠷᠠᠮᠪᡞ, ᠶᠠᠪᡠᠮᠪᡞ, ᡨᡠᠸᠠᠮᠪᡞ, ᠠᠴᠠᠮᠪᡞ, ᡤᡞᠰᡠᠷᡝᠮᠪᡞ

The 10 most frequent VERB types: ᠰᡝᠮᡝ, ᡠᠯᠠᠮᡝ, ᠠᡣᡡ, ᠠᠷᠠᠮᡝ, ᠣᡥᠣ, ᠪᠠᡩᠠᠷᠠᠷᠠ, ᡞᠯᡞᠪᡠᠮᡝ, ᠰᡝᠷᡝ, ᠰᡝᡥᡝ, ᠰᡝᠯᡤᡞᠶᡝᠷᡝ

The 10 most frequent ambiguous lemmas: ᠣᠮᠪᡞ (VERB 108, AUX 8, SCONJ 1), ᠰᡝᠮᠪᡞ (VERB 104, AUX 18, ADP 5, SCONJ 2), ᠪᡞᠮᠪᡞ (VERB 55, AUX 18), ᠠᡣᡡ (VERB 30, ADV 7), ᠪᡞ (PRON 63, VERB 23, AUX 8), ᡨᠣᡣᡨᠣᠮᠪᡞ (VERB 19, ADV 2), ᠰᠣᡢᡤᠣᠮᠪᡞ (VERB 4, ADJ 1), ᡪᡞᡩᡝᠮᠪᡞ (VERB 2, NOUN 1), ᠠᠷᠪᡠᠨ (NOUN 29, VERB 1), ᠣᡫᡞ (SCONJ 13, VERB 1)

The 10 most frequent ambiguous types: ᠰᡝᠮᡝ (VERB 39, SCONJ 6, AUX 3), ᠠᡣᡡ (VERB 28, ADV 7), ᠣᡥᠣ (VERB 24, AUX 4), ᠰᡝᠷᡝ (VERB 20, AUX 1), ᠣᠮᠪᡞ (VERB 17, AUX 2), ᠪᡞ (PRON 63, VERB 17, AUX 8), ᠪᡞᠮᡝ (VERB 15, SCONJ 5, CCONJ 4, AUX 1), ᠪᡞᡥᡝ (VERB 11, AUX 6), ᠰᡝᠮᠪᡞ (AUX 11, VERB 9), ᠣᠮᡝ (VERB 8, AUX 1)

Morphology

The form / lemma ratio of VERB is 2.031915 (the average of all parts of speech is 1.310593).

The 1st highest number of forms (23) was observed with the lemma “ᠣᠮᠪᡞ”: ᠣᠪᡠᠮᠪᡞ, ᠣᠪᡠᠮᡝ, ᠣᠪᡠᠷᡝ, ᠣᠪᡠᡥᠠ, ᠣᠪᡠᡫᡞ, ᠣᠮᠪᡞ, ᠣᠮᠪᡞᠣ, ᠣᠮᠪᡞᡠ, ᠣᠮᡝ, ᠣᠰᠣ, ᠣᠴᡞ, ᠣᡣᡞ, ᠣᡣᡞᠨᡞ, ᠣᡥᠠᡣᡡ, ᠣᡥᠣ, ᠣᡥᠣᠪᡞ, ᠣᡨᠣᠯᠣ, ᠣᡪᠣᠷᠠᡣᡡ, ᠣᡪᠣᠷᠠᡣᡡᠨ, ᠣᡪᠣᠷᠠᡥᡡ, ᠣᡪᠣᠷᠣ, ᠣᡪᠣᠷᡣᡡ, ᠣᡫᡞ.

The 2nd highest number of forms (20) was observed with the lemma “ᠶᠠᠪᡠᠮᠪᡞ”: ᠶᠠᠪᡠ, ᠶᠠᠪᡠᠪᡠᠷᡝ, ᠶᠠᠪᡠᠪᡠᡥᠠ, ᠶᠠᠪᡠᠮᠠᡥᠠ, ᠶᠠᠪᡠᠮᠪᡞ, ᠶᠠᠪᡠᠮᠪᡞᡠ, ᠶᠠᠪᡠᠮᠰᠠᡣᠠ, ᠶᠠᠪᡠᠮᡝ, ᠶᠠᠪᡠᠴᡞ, ᠶᠠᠪᡠᠷᠠᠯᠠᠮᡝ, ᠶᠠᠪᡠᠷᠠᡣᡡ, ᠶᠠᠪᡠᠷᠠᡣᡡᠨ, ᠶᠠᠪᡠᠷᡝ, ᠶᠠᠪᡠᠷᡣᡡ, ᠶᠠᠪᡠᡣᡞ, ᠶᠠᠪᡠᡣᡞᠨᡞ, ᠶᠠᠪᡠᡥᠠ, ᠶᠠᠪᡠᡥᠠᡞ, ᠶᠠᠪᡠᡥᠠᡣᡡ, ᠶᠠᠪᡠᡥᠠᡣᡡᠨ.

The 3rd highest number of forms (19) was observed with the lemma “ᡤᡝᠨᡝᠮᠪᡞ”: ᡤᡝᠨᡝ, ᡤᡝᠨᡝᠮᠪᡞ, ᡤᡝᠨᡝᠮᠪᡞᡠ, ᡤᡝᠨᡝᠮᠪᡞᡥᡝ, ᡤᡝᠨᡝᠮᡝ, ᡤᡝᠨᡝᠴᡞ, ᡤᡝᠨᡝᠴᡞᠨᠠ, ᡤᡝᠨᡝᠷᠠᡣᡡ, ᡤᡝᠨᡝᠷᠠᡣᡡᠨ, ᡤᡝᠨᡝᠷᡝ, ᡤᡝᠨᡝᡣᡞ, ᡤᡝᠨᡝᡣᡞᠨᡞ, ᡤᡝᠨᡝᡥᡝ, ᡤᡝᠨᡝᡥᡝᠪᡞ, ᡤᡝᠨᡝᡥᡝᡠ, ᡤᡝᠨᡝᡥᡝᡢᡤᡝ, ᡤᡝᠨᡝᡥᡝᡣᡡ, ᡤᡝᠨᡝᡥᡝᡣᡡᠨ, ᡤᡝᠨᡝᡫᡞ.

VERB occurs with 8 features: VerbForm (2963; 96% instances), Aspect (2120; 69% instances), Tense (814; 26% instances), Mood (231; 7% instances), Polarity (183; 6% instances), Voice (172; 6% instances), Case (11; 0% instances), Typo (1; 0% instances)

VERB occurs with 23 feature-value pairs: Aspect=Imp, Aspect=Perf, Aspect=Prog, Case=Dat, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Opt, Mood=Sub, Polarity=Neg, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Vnoun, Voice=Act, Voice=Cau, Voice=Pass, Voice=Rcp

VERB occurs with 100 feature combinations. The most frequent feature combination is Aspect=Imp|VerbForm=Conv (968 tokens). Examples: ᠰᡝᠮᡝ, ᡠᠯᠠᠮᡝ, ᠠᠷᠠᠮᡝ, ᡞᠯᡞᠪᡠᠮᡝ, ᠪᡞᠮᡝ, ᡠᠷᡝᠪᡠᠮᡝ, ᠠᠴᠠᠮᡝ, ᠪᠠᡥᠠᠮᡝ, ᠰᡞᠪᡣᡞᠮᡝ, ᠸᡝᠮᡝ

Relations

VERB nodes are attached to their parents using 20 different relations: advcl (1304; 42% instances), root (702; 23% instances), acl (282; 9% instances), acl:relcl (254; 8% instances), obj (122; 4% instances), conj (114; 4% instances), parataxis (84; 3% instances), obl (83; 3% instances), ccomp (59; 2% instances), nsubj (38; 1% instances), xcomp (18; 1% instances), csubj (10; 0% instances), compound (6; 0% instances), appos (4; 0% instances), obl:tmod (3; 0% instances), flat (2; 0% instances), nmod (2; 0% instances), amod (1; 0% instances), nsubj:pass (1; 0% instances), obl:lmod (1; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: VERB (1747; 57% instances), (702; 23% instances), NOUN (578; 19% instances), ADJ (42; 1% instances), X (6; 0% instances), PRON (5; 0% instances), ADV (4; 0% instances), AUX (3; 0% instances), PROPN (3; 0% instances)

617 (20%) VERB nodes are leaves.

694 (22%) VERB nodes have one child.

509 (16%) VERB nodes have two children.

1270 (41%) VERB nodes have three or more children.

The highest child degree of a VERB node is 10.

Children of VERB nodes are attached using 33 different relations: punct (1365; 20% instances), advcl (1289; 19% instances), obj (1097; 16% instances), nsubj (797; 12% instances), obl (660; 10% instances), advmod (551; 8% instances), case (212; 3% instances), obl:tmod (124; 2% instances), xcomp (124; 2% instances), obl:lmod (101; 1% instances), conj (100; 1% instances), parataxis (96; 1% instances), ccomp (62; 1% instances), mark (51; 1% instances), compound (45; 1% instances), discourse (43; 1% instances), aux (36; 1% instances), nmod (34; 0% instances), cc (26; 0% instances), amod (11; 0% instances), iobj (10; 0% instances), csubj (8; 0% instances), nummod (8; 0% instances), nsubj:pass (7; 0% instances), cop (6; 0% instances), mark:adv (6; 0% instances), acl (5; 0% instances), acl:relcl (4; 0% instances), appos (3; 0% instances), flat (3; 0% instances), vocative (3; 0% instances), det (2; 0% instances), nmod:poss (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (2150; 31% instances), VERB (1747; 25% instances), PUNCT (1365; 20% instances), ADV (438; 6% instances), PRON (327; 5% instances), ADJ (281; 4% instances), ADP (213; 3% instances), PROPN (97; 1% instances), NUM (67; 1% instances), SCONJ (52; 1% instances), PART (47; 1% instances), AUX (46; 1% instances), CCONJ (23; 0% instances), X (22; 0% instances), DET (13; 0% instances), INTJ (2; 0% instances)