home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uyghur-UDT: POS Tags: ADV

There are 104 ADV lemmas (3%), 161 ADV types (1%) and 981 ADV tokens (2%). Out of 16 observed tags, the rank of ADV is: 4 in number of lemmas, 6 in number of types and 8 in number of tokens.

The 10 most frequent ADV lemmas: _، يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل

The 10 most frequent ADV types: يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن

The 10 most frequent ambiguous lemmas: _ (VERB 4247, NOUN 4246, AUX 501, PRON 479, PUNCT 396, ADJ 326, ADV 157, PART 119, NUM 77, CCONJ 75, X 64, ADP 56, INTJ 47, DET 28), شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17), قېتىم (ADV 28, DET 4), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), بولغاي (ADV 3, AUX 1), ئاخىر (NOUN 27, ADV 2), شۇنچىلىك (ADV 2, DET 1), بىللە (ADJ 19, ADV 1)

The 10 most frequent ambiguous types: شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17, PRON 7), قېتىم (ADV 28, DET 4), ئەمەس (AUX 30, ADV 10), كېيىن (NOUN 79, ADV 10), ئەمدى (NOUN 22, ADV 9), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), كىيىن (ADV 5, NOUN 2, VERB 1), بىردەمدىن (NOUN 6, ADV 4)

Morphology

The form / lemma ratio of ADV is 1.548077 (the average of all parts of speech is 4.182394).

The 1st highest number of forms (58) was observed with the lemma “_”: ئاخىرىغىچە, ئانچە-مۇنچە, ئوغرىلىقچە, ئەتىگەنلىكى, ئەجەپ, ئەمدى, بارا-بارا, بارىچە, بالدۇرلا, باياتىن, بىرئاز, بىردىنبىر, بىردەم, بىردەمدىن, بىرقەدەر, بۆلەكچىلا, بۇرۇن, بۇرۇننىڭ, بەكرەك, بەكلا, بەكمۇ, تاراملاپ, تولىمۇ, تېخىمۇ, تېخىچىلا, تەرلەپ, تەستىقلىنىشتا, خېلىلا, دۈم, دەس, راستتىنلا, زادىلا, شۇئان, قىشىچە, كىيىن, كۈنسايىن, كۈنسېرى, كېيىن, كېيىنلا, كېچىلەپ, كەچقۇرۇنلۇقى, لىققىدە, لەپىلدەپ, نېرى-بېرى, نېرىدا, پات-پات, پىيادىلەر, پېتى, ھازىردىن, ھازىرغىچە, ھازىرلا, ھازىرمۇ, ھۈپپىدە, ھېلىلا, ھېلىھەم, ھەدەپ, ھەرگىزمۇ, ھەقىقەتەنمۇ.

The 2nd highest number of forms (1) was observed with the lemma “ئاخىر”: ئاخىر.

The 3rd highest number of forms (1) was observed with the lemma “ئارىلاپ”: ئارىلاپ.

ADV occurs with 1 features: PronType (48; 5% instances)

ADV occurs with 1 feature-value pairs: PronType=Int

ADV occurs with 2 feature combinations. The most frequent feature combination is _ (933 tokens). Examples: يەنە، شۇنداق، بەك، شۇڭا، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن، ناھايىتى

Relations

ADV nodes are attached to their parents using 23 different relations: advmod (654; 67% instances), cc (64; 7% instances), nmod (41; 4% instances), amod (28; 3% instances), mark (25; 3% instances), obl (20; 2% instances), compound (19; 2% instances), discourse (19; 2% instances), compound:redup (18; 2% instances), advmod:emph (14; 1% instances), fixed (12; 1% instances), root (12; 1% instances), case (9; 1% instances), nsubj (8; 1% instances), parataxis (8; 1% instances), nmod:tmod (7; 1% instances), advcl (5; 1% instances), ccomp (5; 1% instances), conj (5; 1% instances), dep (3; 0% instances), compound:lvc (2; 0% instances), nummod (2; 0% instances), flat (1; 0% instances)

Parents of ADV nodes belong to 11 different parts of speech: VERB (619; 63% instances), NOUN (172; 18% instances), ADJ (111; 11% instances), NUM (24; 2% instances), ADV (19; 2% instances), PRON (13; 1% instances), (12; 1% instances), AUX (5; 1% instances), ADP (2; 0% instances), CCONJ (2; 0% instances), DET (2; 0% instances)

789 (80%) ADV nodes are leaves.

149 (15%) ADV nodes have one child.

25 (3%) ADV nodes have two children.

18 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 6.

Children of ADV nodes are attached using 24 different relations: punct (122; 47% instances), nsubj (21; 8% instances), advmod (17; 7% instances), nummod (17; 7% instances), fixed (15; 6% instances), compound (11; 4% instances), compound:redup (8; 3% instances), nmod (8; 3% instances), cop (6; 2% instances), det (6; 2% instances), mark (5; 2% instances), nmod:poss (5; 2% instances), amod (3; 1% instances), cc (3; 1% instances), advmod:emph (2; 1% instances), aux (2; 1% instances), obj (2; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), case (1; 0% instances), compound:lvc (1; 0% instances), discourse (1; 0% instances), nmod:tmod (1; 0% instances), parataxis (1; 0% instances)

Children of ADV nodes belong to 13 different parts of speech: PUNCT (122; 47% instances), NOUN (39; 15% instances), ADV (19; 7% instances), PRON (18; 7% instances), NUM (17; 7% instances), VERB (17; 7% instances), AUX (7; 3% instances), ADJ (6; 2% instances), ADP (6; 2% instances), DET (6; 2% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)