home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: ADV

There are 98 ADV lemmas (2%), 1 ADV types (6%) and 26527 ADV tokens (4%). Out of 16 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: _، w، TBupdate، None، f، 6، l، “، 4، 5

The 10 most frequent ADV types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 216429, PUNCT 72574, ADJ 66760, ADP 62646, VERB 54473, PROPN 48965, ADV 26129, SCONJ 23987, NUM 15122, AUX 6581, DET 6330, PART 5856, CCONJ 5168, PRON 2460, INTJ 54, X 32), w (CCONJ 43321, NOUN 190, PUNCT 136, ADP 120, ADV 117, PROPN 78, VERB 71, SCONJ 69, ADJ 55, PRON 33, PART 10, DET 9, NUM 8, AUX 5, X 3), TBupdate (NOUN 401, ADJ 280, VERB 263, X 174, ADV 74, PROPN 69, ADP 4, SCONJ 2, CCONJ 1, DET 1, PART 1, PRON 1), None (NOUN 457, X 344, VERB 264, ADJ 125, PROPN 124, ADV 34, CCONJ 20, PRON 16, SCONJ 16, PART 14, ADP 8, DET 6, AUX 2), f (CCONJ 1247, AUX 459, PART 441, ADV 18, NOUN 12, SCONJ 8, PUNCT 7, VERB 4, ADP 3, ADJ 2, NUM 2, PRON 2), 6 (ADV 7, CCONJ 2), l (ADP 15449, PART 123, NOUN 98, AUX 67, CCONJ 33, ADJ 30, PUNCT 19, VERB 9, SCONJ 8, PROPN 7, ADV 6, PRON 5, DET 2, INTJ 2, NUM 1, X 1), “ (NOUN 112, ADP 34, CCONJ 20, PROPN 20, ADJ 12, VERB 8, PART 6, PRON 6, SCONJ 6, ADV 5, AUX 2, DET 2, X 2), 4 (ADV 5, ADJ 1, ADP 1), 5 (ADV 5, NOUN 1)

The 10 most frequent ambiguous types: _ (NOUN 218254, ADP 91694, PUNCT 75148, ADJ 67604, PROPN 58325, VERB 55215, CCONJ 50032, PRON 31239, ADV 26527, SCONJ 26034, NUM 15147, PART 8612, AUX 7723, DET 6362, X 917, INTJ 56)

Morphology

The form / lemma ratio of ADV is 0.010204 (the average of all parts of speech is 0.002933).

The 1st highest number of forms (1) was observed with the lemma “””: _.

The 2nd highest number of forms (1) was observed with the lemma “,”: _.

The 3rd highest number of forms (1) was observed with the lemma “.”: _.

ADV occurs with 8 features: Gender (24659; 93% instances), Number (24659; 93% instances), Definite (24343; 92% instances), Case (20740; 78% instances), Person (328; 1% instances), Mood (309; 1% instances), Voice (303; 1% instances), Polarity (2; 0% instances)

ADV occurs with 20 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Voice=Act, Voice=Pass

ADV occurs with 65 feature combinations. The most frequent feature combination is Case=Acc|Definite=Com|Gender=Masc|Number=Sing (13319 tokens). Examples: _

Relations

ADV nodes are attached to their parents using 14 different relations: advmod (23482; 89% instances), nmod (1194; 5% instances), dep (742; 3% instances), root (486; 2% instances), obj (292; 1% instances), conj (149; 1% instances), parataxis (89; 0% instances), nsubj (31; 0% instances), aux (26; 0% instances), nmod:poss (17; 0% instances), iobj (10; 0% instances), nsubj:pass (6; 0% instances), mark (2; 0% instances), ccomp (1; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (13775; 52% instances), NOUN (8786; 33% instances), ADJ (946; 4% instances), CCONJ (918; 3% instances), ADV (699; 3% instances), (486; 2% instances), PROPN (354; 1% instances), PRON (212; 1% instances), PART (132; 0% instances), NUM (64; 0% instances), PUNCT (46; 0% instances), SCONJ (38; 0% instances), DET (31; 0% instances), X (28; 0% instances), AUX (12; 0% instances)

5629 (21%) ADV nodes are leaves.

15814 (60%) ADV nodes have one child.

3006 (11%) ADV nodes have two children.

2078 (8%) ADV nodes have three or more children.

The highest child degree of a ADV node is 19.

Children of ADV nodes are attached using 25 different relations: nmod:poss (14926; 49% instances), nmod (3862; 13% instances), case (1810; 6% instances), punct (1803; 6% instances), cc (1537; 5% instances), ccomp (1145; 4% instances), mark (727; 2% instances), obj (642; 2% instances), conj (641; 2% instances), amod (567; 2% instances), advmod (557; 2% instances), dep (520; 2% instances), xcomp (447; 1% instances), nsubj (425; 1% instances), parataxis (388; 1% instances), cop (296; 1% instances), det (166; 1% instances), nummod (91; 0% instances), flat (22; 0% instances), aux (12; 0% instances), csubj (12; 0% instances), flat:name (6; 0% instances), iobj (5; 0% instances), nsubj:pass (4; 0% instances), appos (1; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: NOUN (16160; 53% instances), PRON (1888; 6% instances), VERB (1859; 6% instances), ADP (1814; 6% instances), PUNCT (1803; 6% instances), PROPN (1775; 6% instances), CCONJ (1543; 5% instances), ADJ (954; 3% instances), SCONJ (890; 3% instances), ADV (699; 2% instances), AUX (373; 1% instances), DET (289; 1% instances), PART (277; 1% instances), NUM (255; 1% instances), X (30; 0% instances), INTJ (3; 0% instances)