Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: ADV
There are 167 ADV
lemmas (3%), 359 ADV
types (4%) and 875 ADV
tokens (4%).
Out of 16 observed tags, the rank of ADV
is: 6 in number of lemmas, 6 in number of types and 10 in number of tokens.
The 10 most frequent ADV
lemmas: pas, toujours, quand, même, bien, plus, tous, beaucoup, pourquoi, alors
The 10 most frequent ADV
types: pas, ki, dima, bien, ga3, plus, meme, jamais, bark, vraiment
The 10 most frequent ambiguous lemmas: pas (PART 124, ADV 55, NOUN 2, VERB 1), toujours (ADV 54, VERB 1), quand (ADV 33, ADP 2, SCONJ 2, PRON 1), bien (ADV 34, ADJ 19, NOUN 18), plus (ADV 34, PUNCT 5, ADJ 2), tous (ADV 31, ADJ 27, PRON 7, DET 3, NOUN 3), pourquoi (ADV 25, ADP 2), alors (ADV 16, PART 3, ADP 1), comment (ADV 19, SCONJ 1), où (ADV 18, PRON 3)
The 10 most frequent ambiguous types: pas (ADV 48, NOUN 1), ki (ADV 34, ADP 15, PRON 4, SCONJ 4), bien (ADV 28, ADJ 3, INTJ 1), ga3 (ADV 28, ADJ 11), wach (PRON 20, ADV 12, DET 3), ni (ADV 11, CCONJ 4, VERB 1), sur (ADV 11, ADJ 5, ADP 4), hna (PRON 30, ADV 10), win (ADV 10, PRON 1), hata (ADP 12, ADV 9)
- pas
- ki
- bien
- ga3
- wach
- ni
- ADV 11: vive mouloudia ni 3anger ni mess3oudi c est le mouloudia ki reste
- CCONJ 4: ni amtar ni walou hadi darba ta3 l 3abd machi ta3 rabi
- VERB 1: le numero 4 la yougharid kharij e sserb rapelle toi seddam et l eglise a katar et ley et tu pense que madjliss el 3ar a raison et tu pense que el djazira ni pas fitna el kaddafi et son peuple el bar
- sur
- hna
- win
- hata
Morphology
The form / lemma ratio of ADV
is 2.149701 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (17) was observed with the lemma “pourquoi”: 3alach, 3lach, 3lache, 3lah, 3lash, chhal, limada, liyah, malkoum, pourquoi, pq, prkoi, w3alh, w3lahe, wa3lah, wa3leche, wach.
The 2nd highest number of forms (14) was observed with the lemma “beaucoup”: bazaf, bazafe, bazaffffffffffffffffffffffffffffffffffff, bazef, bcp, beaucoup, bezaf, bezaf;, beze, bezzaf, bq, bzf, jalma, katir.
The 3rd highest number of forms (13) was observed with the lemma “même”: 7ta, ;mem, hat, hata, hatan, hath, hatta, hetta, hta, htta, meme, méme, même.
ADV
occurs with 3 features: Polarity (76; 9% instances), Typo (14; 2% instances), AdpType (7; 1% instances)
ADV
occurs with 3 feature-value pairs: AdpType=Prep
, Polarity=Neg
, Typo=Yes
ADV
occurs with 5 feature combinations.
The most frequent feature combination is _
(779 tokens).
Examples: ki, dima, bien, ga3, plus, meme, jamais, bark, vraiment, wach
Relations
ADV
nodes are attached to their parents using 14 different relations: advmod (787; 90% instances), mark (23; 3% instances), fixed (22; 3% instances), parataxis (10; 1% instances), conj (7; 1% instances), case (5; 1% instances), dep (5; 1% instances), obj (5; 1% instances), amod (2; 0% instances), nmod (2; 0% instances), nsubj (2; 0% instances), obl (2; 0% instances), root (2; 0% instances), discourse (1; 0% instances)
Parents of ADV
nodes belong to 14 different parts of speech: VERB (433; 49% instances), NOUN (193; 22% instances), ADJ (77; 9% instances), PRON (63; 7% instances), PROPN (54; 6% instances), ADV (30; 3% instances), ADP (13; 1% instances), AUX (3; 0% instances), NUM (3; 0% instances), (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)
788 (90%) ADV
nodes are leaves.
68 (8%) ADV
nodes have one child.
12 (1%) ADV
nodes have two children.
7 (1%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 6.
Children of ADV
nodes are attached using 18 different relations: fixed (20; 17% instances), cc (19; 16% instances), case (17; 14% instances), advmod (14; 12% instances), goeswith (14; 12% instances), conj (6; 5% instances), parataxis (5; 4% instances), det (4; 3% instances), obl (3; 3% instances), amod (2; 2% instances), ccomp (2; 2% instances), discourse (2; 2% instances), nmod (2; 2% instances), nsubj (2; 2% instances), punct (2; 2% instances), vocative (2; 2% instances), advcl (1; 1% instances), mark (1; 1% instances)
Children of ADV
nodes belong to 14 different parts of speech: ADV (30; 25% instances), CCONJ (19; 16% instances), ADP (18; 15% instances), X (14; 12% instances), PRON (8; 7% instances), VERB (8; 7% instances), NOUN (7; 6% instances), ADJ (3; 3% instances), DET (3; 3% instances), INTJ (2; 2% instances), PUNCT (2; 2% instances), SCONJ (2; 2% instances), PART (1; 1% instances), PROPN (1; 1% instances)