home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sindhi-Isra: POS Tags: ADV

There are 226 ADV lemmas (4%), 267 ADV types (2%) and 3118 ADV tokens (3%). Out of 15 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: جڏهن, اتي, هاڻ, وري, گڏ, پوءِ, اڄ, جيئن, جاري, ڏانهن

The 10 most frequent ADV types: جڏهن, وري, اتي, هاڻي, پوءِ, اڄ, جيئن, جاري, ڏانهن, تمام

The 10 most frequent ambiguous lemmas: اتي (ADV 175, NOUN 7), وري (ADV 146, VERB 5), گڏ (ADV 113, VERB 9, NOUN 4), پوءِ (ADP 135, ADV 105), اڄ (ADV 86, NOUN 4), جيئن (ADV 79, PRON 1), جاري (ADV 74, VERB 4, ADJ 2), ڏانهن (ADV 74, ADP 14), مٿي (ADV 73, NOUN 11, ADJ 3, ADP 2), تمام (ADV 69, ADJ 12)

The 10 most frequent ambiguous types: وري (ADV 146, VERB 3), اتي (ADV 145, NOUN 5), پوءِ (ADP 135, ADV 105, SCONJ 3), اڄ (ADV 86, NOUN 4), جيئن (ADV 79, PRON 1), جاري (ADV 74, VERB 4, ADJ 2), ڏانهن (ADV 74, ADP 14), تمام (ADV 69, ADJ 12), گڏ (ADV 69, NOUN 4, VERB 1), اڳتي (ADV 65, ADJ 2)

Morphology

The form / lemma ratio of ADV is 1.181416 (the average of all parts of speech is 1.872520).

The 1st highest number of forms (23) was observed with the lemma “_”: ئ, اوهين, تنهن, توسان, جيتري, جيڏيون, جيڪي, شل, مهرباني, نافذ, هرطرف, هروڀرو, هوريان, هيءُ, هيٺيان, واقعي, وانگيان, وٽان, وڌيڪ, يعني, پاسي, پٺٽي, گڏو.

The 2nd highest number of forms (5) was observed with the lemma “مٿي”: مٿان, مٿس, مٿن, مٿي, مٿين.

The 3rd highest number of forms (4) was observed with the lemma “هيٺ”: هيٺ, هيٺئين, هيٺان, هيٺين.

ADV occurs with 4 features: Number (78; 3% instances), Case (61; 2% instances), Gender (53; 2% instances), Person (4; 0% instances)

ADV occurs with 7 feature-value pairs: Case=Acc, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3

ADV occurs with 19 feature combinations. The most frequent feature combination is _ (3032 tokens). Examples: جڏهن, وري, اتي, هاڻي, پوءِ, اڄ, جاري, جيئن, ڏانهن, تمام

Relations

ADV nodes are attached to their parents using 20 different relations: advmod (2733; 88% instances), amod (90; 3% instances), obl (80; 3% instances), case (68; 2% instances), compound (29; 1% instances), mark (27; 1% instances), root (22; 1% instances), xcomp (18; 1% instances), advcl (14; 0% instances), compound:redup (10; 0% instances), nsubj (6; 0% instances), dep (5; 0% instances), acl (4; 0% instances), obj (3; 0% instances), cc (2; 0% instances), conj (2; 0% instances), nmod (2; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances), parataxis (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (2453; 79% instances), NOUN (350; 11% instances), ADJ (158; 5% instances), ADV (50; 2% instances), AUX (27; 1% instances), (22; 1% instances), PROPN (18; 1% instances), DET (16; 1% instances), PRON (15; 0% instances), NUM (7; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

2538 (81%) ADV nodes are leaves.

513 (16%) ADV nodes have one child.

36 (1%) ADV nodes have two children.

31 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 23 different relations: advmod:emph (237; 33% instances), nmod (107; 15% instances), obl (97; 13% instances), case (58; 8% instances), punct (35; 5% instances), advmod (33; 5% instances), cop (25; 3% instances), amod (24; 3% instances), nsubj (22; 3% instances), mark (18; 3% instances), compound (15; 2% instances), compound:redup (10; 1% instances), advcl (9; 1% instances), dep (9; 1% instances), cc (4; 1% instances), obj (4; 1% instances), conj (3; 0% instances), det (3; 0% instances), aux (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PART (251; 35% instances), NOUN (159; 22% instances), ADP (62; 9% instances), ADV (50; 7% instances), PUNCT (35; 5% instances), AUX (30; 4% instances), DET (30; 4% instances), ADJ (25; 3% instances), PRON (21; 3% instances), PROPN (20; 3% instances), SCONJ (15; 2% instances), VERB (15; 2% instances), CCONJ (4; 1% instances), NUM (2; 0% instances), INTJ (1; 0% instances)