home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: ADV

There are 266 ADV lemmas (2%), 253 ADV types (2%) and 1386 ADV tokens (1%). Out of 16 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent ADV lemmas: پیش، بعد، پھر، پہلے، بہت، مزید، سے، دوبارہ، جلد، دوران

The 10 most frequent ADV types: پیش، بعد، پہلے، پھر، بہت، سے، مزید، دوبارہ، جلد، انتہائی

The 10 most frequent ambiguous lemmas: پیش (ADV 130, ADJ 31, ADP 9, NOUN 6), بعد (ADP 260, ADV 97, NOUN 1, PRON 1), پھر (ADV 41, PART 18, SCONJ 1), پہلے (ADV 41, ADJ 11, ADP 11), بہت (ADV 38, DET 13, ADJ 6), مزید (ADJ 43, ADV 36, DET 9), سے (ADP 2504, ADV 35, PART 31, PROPN 2, ADJ 1), دوبارہ (ADV 30, ADJ 3), جلد (ADV 28, NOUN 3, DET 1), دوران (ADP 119, ADV 26, NOUN 4)

The 10 most frequent ambiguous types: پیش (ADV 130, ADJ 31, ADP 9, NOUN 5, VERB 1), بعد (ADP 261, ADV 94), پہلے (ADV 46, ADJ 38, ADP 13), پھر (ADV 41, PART 18, SCONJ 1), بہت (ADV 38, DET 13, ADJ 6), سے (ADP 2510, ADV 36, PART 31, PROPN 2, ADJ 1), مزید (ADJ 43, ADV 36, DET 9), دوبارہ (ADV 30, ADJ 3), جلد (ADV 28, NOUN 3, DET 1), انتہائی (ADV 27, ADJ 9)

Morphology

The form / lemma ratio of ADV is 0.951128 (the average of all parts of speech is 1.103404).

The 1st highest number of forms (3) was observed with the lemma “بعد”: بعد, بعد_ازاں, بعدازاں.

The 2nd highest number of forms (3) was observed with the lemma “بعدازاں”: ازاں, بعد_ازاں, بعدازاں.

The 3rd highest number of forms (3) was observed with the lemma “قبل”: ازیں, قبل, قبل_ازیں.

ADV occurs with 9 features: Case (475; 34% instances), Number (444; 32% instances), Gender (441; 32% instances), Person (412; 30% instances), AdpType (389; 28% instances), AdvType (145; 10% instances), Echo (4; 0% instances), Aspect (1; 0% instances), VerbForm (1; 0% instances)

ADV occurs with 12 feature-value pairs: AdpType=Post, AdvType=Deg, Aspect=Perf, Case=Acc, Case=Nom, Echo=Rdp, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3, VerbForm=Part

ADV occurs with 27 feature combinations. The most frequent feature combination is _ (742 tokens). Examples: پھر، دوبارہ، جلد، مزید، ہمیشہ، سے، ہنوز، تقریباً، فی_الحال، پیش

Relations

ADV nodes are attached to their parents using 15 different relations: advmod (848; 61% instances), compound (181; 13% instances), mark (106; 8% instances), amod (81; 6% instances), obl (66; 5% instances), case (37; 3% instances), dislocated (16; 1% instances), nmod (16; 1% instances), conj (12; 1% instances), nsubj (8; 1% instances), obj (7; 1% instances), acl:relcl (2; 0% instances), cc (2; 0% instances), dep (2; 0% instances), root (2; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (943; 68% instances), ADJ (184; 13% instances), NOUN (143; 10% instances), ADV (56; 4% instances), PROPN (25; 2% instances), ADP (10; 1% instances), NUM (8; 1% instances), DET (7; 1% instances), PRON (6; 0% instances), (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)

1139 (82%) ADV nodes are leaves.

203 (15%) ADV nodes have one child.

31 (2%) ADV nodes have two children.

13 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 8.

Children of ADV nodes are attached using 18 different relations: dep (47; 15% instances), case (44; 14% instances), nmod (34; 11% instances), amod (32; 10% instances), compound (31; 10% instances), obl (29; 9% instances), det (22; 7% instances), advmod (15; 5% instances), nsubj (12; 4% instances), cc (10; 3% instances), acl:relcl (9; 3% instances), conj (8; 3% instances), obj (8; 3% instances), cop (7; 2% instances), advcl (4; 1% instances), punct (4; 1% instances), mark (2; 1% instances), dislocated (1; 0% instances)

Children of ADV nodes belong to 13 different parts of speech: ADV (56; 18% instances), NOUN (56; 18% instances), PART (51; 16% instances), ADP (42; 13% instances), ADJ (34; 11% instances), DET (22; 7% instances), PROPN (15; 5% instances), VERB (13; 4% instances), PRON (11; 3% instances), AUX (7; 2% instances), CCONJ (7; 2% instances), PUNCT (4; 1% instances), SCONJ (1; 0% instances)