home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: ADV

There are 8 ADV lemmas (0%), 1 ADV types (6%) and 24067 ADV tokens (3%). Out of 16 observed tags, the rank of ADV is: 13 in number of lemmas, 3 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: _، lA، mA، ,، l، “، 6، f

The 10 most frequent ADV types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), lA (ADV 100, PROPN 6, ADP 1, PART 1), mA (SCONJ 667, PRON 320, ADV 5, VERB 1), , (PUNCT 254, CCONJ 68, NOUN 9, ADJ 8, ADP 7, NUM 5, SCONJ 4, VERB 4, PRON 3, PROPN 3, ADV 2, DET 1, PART 1), l (ADP 15628, PART 165, NOUN 29, SCONJ 28, ADV 2, VERB 2, ADJ 1, DET 1, NUM 1, PROPN 1, PUNCT 1, X 1), “ (PUNCT 207, ADP 6, PROPN 6, CCONJ 5, NOUN 5, ADJ 2, PART 2, ADV 1, VERB 1), 6 (NUM 8, ADV 1), f (CCONJ 1360, PART 814, SCONJ 18, PRON 4, VERB 3, NOUN 2, ADP 1, ADV 1, AUX 1, PUNCT 1)

The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91742, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49158, PRON 43493, ADV 24067, SCONJ 16614, NUM 15377, AUX 9163, DET 6363, PART 2519, X 927, INTJ 56)

Morphology

The form / lemma ratio of ADV is 0.125000 (the average of all parts of speech is 0.003041).

The 1st highest number of forms (1) was observed with the lemma “””: _.

The 2nd highest number of forms (1) was observed with the lemma “,”: _.

The 3rd highest number of forms (1) was observed with the lemma “6”: _.

ADV occurs with 10 features: Gender (19509; 81% instances), Number (19509; 81% instances), Definite (19507; 81% instances), Case (16050; 67% instances), Polarity (4526; 19% instances), AdpType (26; 0% instances), Mood (2; 0% instances), Person (2; 0% instances), Voice (2; 0% instances), PronType (1; 0% instances)

ADV occurs with 16 feature-value pairs: AdpType=Prep, Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Number=Plur, Number=Sing, Person=3, Polarity=Neg, PronType=Rel, Voice=Act

ADV occurs with 22 feature combinations. The most frequent feature combination is Case=Acc|Definite=Com|Gender=Masc|Number=Sing (12933 tokens). Examples: _

Relations

ADV nodes are attached to their parents using 8 different relations: advmod (18736; 78% instances), obj (2738; 11% instances), iobj (1131; 5% instances), nmod (1095; 5% instances), root (348; 1% instances), nsubj (14; 0% instances), nmod:poss (3; 0% instances), mark (2; 0% instances)

Parents of ADV nodes belong to 13 different parts of speech: VERB (12713; 53% instances), NOUN (8161; 34% instances), ADJ (976; 4% instances), ADV (826; 3% instances), PRON (487; 2% instances), (348; 1% instances), PROPN (342; 1% instances), NUM (90; 0% instances), CCONJ (56; 0% instances), DET (23; 0% instances), X (22; 0% instances), PART (14; 0% instances), AUX (9; 0% instances)

5790 (24%) ADV nodes are leaves.

13213 (55%) ADV nodes have one child.

2917 (12%) ADV nodes have two children.

2147 (9%) ADV nodes have three or more children.

The highest child degree of a ADV node is 16.

Children of ADV nodes are attached using 21 different relations: nmod:poss (14233; 51% instances), case (2471; 9% instances), nmod (1752; 6% instances), punct (1698; 6% instances), cc (1343; 5% instances), ccomp (1269; 5% instances), obj (1144; 4% instances), nummod (926; 3% instances), amod (674; 2% instances), xcomp (490; 2% instances), mark (456; 2% instances), advmod (420; 2% instances), nsubj (347; 1% instances), cop (264; 1% instances), appos (107; 0% instances), iobj (50; 0% instances), aux (35; 0% instances), acl (13; 0% instances), dep (2; 0% instances), compound (1; 0% instances), discourse (1; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: NOUN (12835; 46% instances), ADP (2471; 9% instances), PROPN (2025; 7% instances), VERB (1753; 6% instances), PUNCT (1698; 6% instances), PRON (1556; 6% instances), CCONJ (1354; 5% instances), ADJ (1160; 4% instances), NUM (962; 3% instances), ADV (826; 3% instances), SCONJ (315; 1% instances), AUX (303; 1% instances), DET (240; 1% instances), PART (147; 1% instances), X (50; 0% instances), INTJ (1; 0% instances)