Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: VERB
There are 1956 VERB lemmas (36%), 2350 VERB types (29%) and 3487 VERB tokens (18%).
Out of 16 observed tags, the rank of VERB is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent VERB lemmas: vivre, est, c’est, a, nous_sommes, fait, soit, tu_es, suis, va
The 10 most frequent VERB types: vive, est, rana, c, a, viva, rah, rak, suis, yal3ab
The 10 most frequent ambiguous lemmas: est (VERB 77, ADP 1, PRON 1), c’est (VERB 46, PRON 4), fait (VERB 21, ADJ 1), soit (VERB 21, CCONJ 2), suis (VERB 18, PRON 1), être (VERB 14, AUX 11), allez (VERB 13, INTJ 3), gagner (VERB 13, ADJ 1), je_suis (VERB 12, PRON 3), a_pas (VERB 12, NOUN 2, PART 1)
The 10 most frequent ambiguous types: est (VERB 62, CCONJ 10, AUX 2, PRON 1), c (VERB 38, PRON 10, ADV 2, AUX 1, SCONJ 1), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), rah (VERB 20, AUX 1), kayen (VERB 11, ADJ 1), koun (VERB 11, SCONJ 2), faut (VERB 11, ADJ 3, NOUN 1), sont (VERB 11, AUX 1, DET 1), tahya (VERB 11, NOUN 1), makan (VERB 9, NOUN 3)
- est
- c
- VERB 38: rouhi ya bladi rouhi b e slama c honteux les responssables nt3a albackss
- PRON 10: bon c po grave douk nab3at equipe yraybouha w tasktoo 3lina ga3
- ADV 2: star rabbi c non l ka3bét le mleh li 3anda izidou iroho !!!!
- AUX 1: l essentiel on esper qu’ il va apporter qq et laisser ziani howa li ye3alam meghni c quoi l’ islame
- SCONJ 1: rabi m3ak bi tawfik inchalah j c pas c j ai le froit de voter
- a
- ADP 46: M3a boumediene jusqu’ a la mort
- DET 41: allahoma la tohasibna bima yaf3aloho a tafihin
- VERB 29: 3andou el hak makache football en algerie alors il a bien fait
- AUX 17: حجج حجج حجج حجج حجج حجج comme il a dit n
- INTJ 5: a 3ayeniya vive alg
- PROPN 1: salam 3likoum khawti les algeriens nchallah ya rabi les verts yfarhouna f l a frique du sud w nroho le 2ème tour b rabi nchallah ad3ou m3ana ya r abi amine
- X 1: ana dhad roj3 lamouchaia y3atik saha y a kader 02
- rah
- kayen
- koun
- faut
- sont
- VERB 11: ellah yen3el hed ness ce sont des vrais animaux wallah ma3and’homch 9alb
- AUX 1: Moutamaniyatona lakom bi Al Fawaz incha Allah tous les marocains Da3awatohom ma3akom ds ce mois de Ramadan al Karim ya Rab demain la victoire pour nos freres Algeriens de meme pour le Maroc contre le Togo meme si les chances pour se qualifier sont minimes de toute facon on sera tres contents au moins de voir l Algerie en coupe du monde
- DET 1: on ne sait plus koi penser nous les Algeriens qui soutenir un dictateur qui tue sont peuple sans scupule ou bien des revolutionnaires qui travaillent pr les europeens et les americains ?? allah ansor al hak atna al kadir 3ala koul chay
- tahya
- makan
Morphology
The form / lemma ratio of VERB is 1.201431 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (13) was observed with the lemma “est”: Est-, es, est, et, is, koun, rah, rahi, rahou, yekoun, è, èt, é.
The 2nd highest number of forms (12) was observed with the lemma “fait”: dar, daret, darou, dayer, dayra, dir, diri, fair, fais, fait, fi, khod.
The 3rd highest number of forms (12) was observed with the lemma “vivre”: 3ichi, ViVeeeeeeeeeeeeeeee, n3aichou, n3icho, tahia, tahiati, tahya, viiiiiiiiiiiiiive, viiiiiiiva, viva, vive, vivre.
VERB occurs with 9 features: Number (1715; 49% instances), Person (1715; 49% instances), Gender (1708; 49% instances), Mood (413; 12% instances), Polarity (223; 6% instances), VerbForm (143; 4% instances), AdpType (26; 1% instances), Typo (11; 0% instances), Tense (6; 0% instances)
VERB occurs with 17 feature-value pairs: AdpType=Prep, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Inf, VerbForm=Part
VERB occurs with 40 feature combinations.
The most frequent feature combination is _ (1037 tokens).
Examples: est, rana, c, a, rah, rak, suis, kayen, va, kan
Relations
VERB nodes are attached to their parents using 20 different relations: parataxis (1281; 37% instances), root (1049; 30% instances), xcomp (518; 15% instances), ccomp (256; 7% instances), conj (185; 5% instances), acl:relcl (98; 3% instances), advcl (62; 2% instances), discourse (5; 0% instances), acl (4; 0% instances), fixed (4; 0% instances), nsubj (4; 0% instances), obj (4; 0% instances), obl (4; 0% instances), csubj (3; 0% instances), nmod (3; 0% instances), vocative (3; 0% instances), amod (1; 0% instances), appos (1; 0% instances), dep (1; 0% instances), dislocated (1; 0% instances)
Parents of VERB nodes belong to 10 different parts of speech: VERB (2157; 62% instances), (1049; 30% instances), NOUN (120; 3% instances), PRON (87; 2% instances), PROPN (32; 1% instances), ADJ (26; 1% instances), ADV (8; 0% instances), INTJ (6; 0% instances), ADP (1; 0% instances), NUM (1; 0% instances)
136 (4%) VERB nodes are leaves.
778 (22%) VERB nodes have one child.
1003 (29%) VERB nodes have two children.
1570 (45%) VERB nodes have three or more children.
The highest child degree of a VERB node is 15.
Children of VERB nodes are attached using 30 different relations: parataxis (1779; 19% instances), obj (1561; 17% instances), nsubj (1351; 15% instances), obl (1108; 12% instances), advmod (518; 6% instances), xcomp (517; 6% instances), cc (437; 5% instances), discourse (419; 5% instances), ccomp (259; 3% instances), vocative (194; 2% instances), mark (193; 2% instances), conj (192; 2% instances), punct (139; 2% instances), case (102; 1% instances), nmod (83; 1% instances), expl (59; 1% instances), advcl (58; 1% instances), iobj (38; 0% instances), aux (28; 0% instances), dislocated (28; 0% instances), dep (25; 0% instances), amod (20; 0% instances), expl:pv (12; 0% instances), goeswith (10; 0% instances), acl:relcl (8; 0% instances), csubj (4; 0% instances), det (4; 0% instances), compound (2; 0% instances), nummod (2; 0% instances), flat (1; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: NOUN (2286; 25% instances), VERB (2157; 24% instances), PRON (1318; 14% instances), PROPN (1136; 12% instances), CCONJ (439; 5% instances), ADV (433; 5% instances), INTJ (404; 4% instances), ADJ (339; 4% instances), SCONJ (146; 2% instances), ADP (140; 2% instances), PUNCT (139; 2% instances), PART (120; 1% instances), NUM (41; 0% instances), AUX (29; 0% instances), X (14; 0% instances), DET (10; 0% instances)