Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: VERB
There are 1956 VERB
lemmas (36%), 2350 VERB
types (29%) and 3487 VERB
tokens (18%).
Out of 16 observed tags, the rank of VERB
is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent VERB
lemmas: vivre, est, c’est, a, nous_sommes, fait, soit, tu_es, suis, va
The 10 most frequent VERB
types: vive, est, rana, c, a, viva, rah, rak, suis, yal3ab
The 10 most frequent ambiguous lemmas: est (VERB 77, ADP 1, PRON 1), c’est (VERB 46, PRON 4), fait (VERB 21, ADJ 1), soit (VERB 21, CCONJ 2), suis (VERB 18, PRON 1), être (VERB 14, AUX 11), allez (VERB 13, INTJ 3), gagner (VERB 13, ADJ 1), je_suis (VERB 12, PRON 3), a_pas (VERB 12, NOUN 2, PART 1)
The 10 most frequent ambiguous types: est (VERB 62, CCONJ 10, AUX 2, PRON 1), c (VERB 38, PRON 10, ADV 2, AUX 1, SCONJ 1), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), rah (VERB 20, AUX 1), kayen (VERB 11, ADJ 1), koun (VERB 11, SCONJ 2), faut (VERB 11, ADJ 3, NOUN 1), sont (VERB 11, AUX 1, DET 1), tahya (VERB 11, NOUN 1), makan (VERB 9, NOUN 3)
- est
- c
- VERB 38: rouhi ya bladi rouhi b e slama c honteux les responssables nt3a albackss
- PRON 10: bon c po grave douk nab3at equipe yraybouha w tasktoo 3lina ga3
- ADV 2: star rabbi c non l ka3bét le mleh li 3anda izidou iroho !!!!
- AUX 1: l essentiel on esper qu’ il va apporter qq et laisser ziani howa li ye3alam meghni c quoi l’ islame
- SCONJ 1: rabi m3ak bi tawfik inchalah j c pas c j ai le froit de voter
- a
- ADP 46: M3a boumediene jusqu’ a la mort
- DET 41: allahoma la tohasibna bima yaf3aloho a tafihin
- VERB 29: 3andou el hak makache football en algerie alors il a bien fait
- AUX 17: حجج حجج حجج حجج حجج حجج comme il a dit n
- INTJ 5: a 3ayeniya vive alg
- PROPN 1: salam 3likoum khawti les algeriens nchallah ya rabi les verts yfarhouna f l a frique du sud w nroho le 2ème tour b rabi nchallah ad3ou m3ana ya r abi amine
- X 1: ana dhad roj3 lamouchaia y3atik saha y a kader 02
- rah
- kayen
- koun
- faut
- sont
- VERB 11: ellah yen3el hed ness ce sont des vrais animaux wallah ma3and’homch 9alb
- AUX 1: Moutamaniyatona lakom bi Al Fawaz incha Allah tous les marocains Da3awatohom ma3akom ds ce mois de Ramadan al Karim ya Rab demain la victoire pour nos freres Algeriens de meme pour le Maroc contre le Togo meme si les chances pour se qualifier sont minimes de toute facon on sera tres contents au moins de voir l Algerie en coupe du monde
- DET 1: on ne sait plus koi penser nous les Algeriens qui soutenir un dictateur qui tue sont peuple sans scupule ou bien des revolutionnaires qui travaillent pr les europeens et les americains ?? allah ansor al hak atna al kadir 3ala koul chay
- tahya
- makan
Morphology
The form / lemma ratio of VERB
is 1.201431 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (13) was observed with the lemma “est”: Est-, es, est, et, is, koun, rah, rahi, rahou, yekoun, è, èt, é.
The 2nd highest number of forms (12) was observed with the lemma “fait”: dar, daret, darou, dayer, dayra, dir, diri, fair, fais, fait, fi, khod.
The 3rd highest number of forms (12) was observed with the lemma “vivre”: 3ichi, ViVeeeeeeeeeeeeeeee, n3aichou, n3icho, tahia, tahiati, tahya, viiiiiiiiiiiiiive, viiiiiiiva, viva, vive, vivre.
VERB
occurs with 9 features: Number (1715; 49% instances), Person (1715; 49% instances), Gender (1708; 49% instances), Mood (413; 12% instances), Polarity (223; 6% instances), VerbForm (143; 4% instances), AdpType (26; 1% instances), Typo (11; 0% instances), Tense (6; 0% instances)
VERB
occurs with 17 feature-value pairs: AdpType=Prep
, Gender=Fem
, Gender=Masc
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Tense=Pres
, Typo=Yes
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
VERB
occurs with 40 feature combinations.
The most frequent feature combination is _
(1037 tokens).
Examples: est, rana, c, a, rah, rak, suis, kayen, va, kan
Relations
VERB
nodes are attached to their parents using 20 different relations: parataxis (1281; 37% instances), root (1049; 30% instances), xcomp (518; 15% instances), ccomp (256; 7% instances), conj (185; 5% instances), acl:relcl (98; 3% instances), advcl (62; 2% instances), discourse (5; 0% instances), acl (4; 0% instances), fixed (4; 0% instances), nsubj (4; 0% instances), obj (4; 0% instances), obl (4; 0% instances), csubj (3; 0% instances), nmod (3; 0% instances), vocative (3; 0% instances), amod (1; 0% instances), appos (1; 0% instances), dep (1; 0% instances), dislocated (1; 0% instances)
Parents of VERB
nodes belong to 10 different parts of speech: VERB (2157; 62% instances), (1049; 30% instances), NOUN (120; 3% instances), PRON (87; 2% instances), PROPN (32; 1% instances), ADJ (26; 1% instances), ADV (8; 0% instances), INTJ (6; 0% instances), ADP (1; 0% instances), NUM (1; 0% instances)
136 (4%) VERB
nodes are leaves.
778 (22%) VERB
nodes have one child.
1003 (29%) VERB
nodes have two children.
1570 (45%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 15.
Children of VERB
nodes are attached using 30 different relations: parataxis (1779; 19% instances), obj (1561; 17% instances), nsubj (1351; 15% instances), obl (1108; 12% instances), advmod (518; 6% instances), xcomp (517; 6% instances), cc (437; 5% instances), discourse (419; 5% instances), ccomp (259; 3% instances), vocative (194; 2% instances), mark (193; 2% instances), conj (192; 2% instances), punct (139; 2% instances), case (102; 1% instances), nmod (83; 1% instances), expl (59; 1% instances), advcl (58; 1% instances), iobj (38; 0% instances), aux (28; 0% instances), dislocated (28; 0% instances), dep (25; 0% instances), amod (20; 0% instances), expl:pv (12; 0% instances), goeswith (10; 0% instances), acl:relcl (8; 0% instances), csubj (4; 0% instances), det (4; 0% instances), compound (2; 0% instances), nummod (2; 0% instances), flat (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (2286; 25% instances), VERB (2157; 24% instances), PRON (1318; 14% instances), PROPN (1136; 12% instances), CCONJ (439; 5% instances), ADV (433; 5% instances), INTJ (404; 4% instances), ADJ (339; 4% instances), SCONJ (146; 2% instances), ADP (140; 2% instances), PUNCT (139; 2% instances), PART (120; 1% instances), NUM (41; 0% instances), AUX (29; 0% instances), X (14; 0% instances), DET (10; 0% instances)