home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PUD: POS Tags: VERB

There are 585 VERB lemmas (12%), 1015 VERB types (14%) and 1783 VERB tokens (9%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: kAn-u_1، qAl-u_1، >amokan_1، bada>-a_1، EAd-u_1، Zahar-a_1، tam~-i_1، Earaf-i_1، $ak~al_1، Eamil-a_1

The 10 most frequent VERB types: كان، كانت، يمكن، قال، يكون، تم، تكون، يكن، بدأت، قالت

The 10 most frequent ambiguous lemmas: kAn-u_1 (VERB 185, AUX 86, PROPN 1), EAd-u_1 (VERB 21, AUX 3), $ak~al_1 (VERB 13, NOUN 2, AUX 1), >aEolan_1 (VERB 10, AUX 1), ra>aY-a_1 (VERB 10, PROPN 3), Hamal-i_1 (VERB 9, NOUN 3), badA-u_1 (VERB 9, AUX 1), bAt-i_1 (AUX 7, VERB 7), katab-u_1 (VERB 6, NOUN 1), qad~am_1 (VERB 6, NOUN 1)

The 10 most frequent ambiguous types: كان (VERB 77, AUX 36), كانت (VERB 48, AUX 19), يكون (VERB 16, AUX 15), تكون (VERB 12, AUX 5), يكن (VERB 12, AUX 1), كانوا (VERB 11, AUX 2), بات (VERB 7, AUX 2), تشكل (VERB 5, NOUN 2), تكن (AUX 4, VERB 4), ظهر (VERB 4, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.735043 (the average of all parts of speech is 1.409765).

The 1st highest number of forms (12) was observed with the lemma “kAn-u_1”: أكن, تكن, تكون, كان, كانت, كانتا, كانوا, كنت, نكون, يكن, يكون, يكونون.

The 2nd highest number of forms (7) was observed with the lemma “EAd-u_1”: تعد, تعود, عاد, عادت, عدنا, نعود, يعود.

The 3rd highest number of forms (7) was observed with the lemma “qAl-u_1”: تقول, قال, قالت, قيل, نقل, يقال, يقول.

VERB occurs with 9 features: Number (1782; 100% instances), Person (1781; 100% instances), Voice (1781; 100% instances), Aspect (1780; 100% instances), Tense (1780; 100% instances), Gender (1707; 96% instances), Mood (884; 50% instances), Case (2; 0% instances), Definite (2; 0% instances)

VERB occurs with 22 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Nom, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, Voice=Act, Voice=Pass

VERB occurs with 60 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|Person=3|Tense=Past|Voice=Act (437 tokens). Examples: كان، قال، تم، بدأ، أدى، بات، حدث، التقى، قرر، استمر

Relations

VERB nodes are attached to their parents using 17 different relations: root (776; 44% instances), acl:relcl (283; 16% instances), advcl (205; 11% instances), conj (166; 9% instances), ccomp (104; 6% instances), aux (94; 5% instances), xcomp (69; 4% instances), csubj (49; 3% instances), parataxis (18; 1% instances), fixed (4; 0% instances), cop (3; 0% instances), csubj:pass (3; 0% instances), nmod:gmod (3; 0% instances), appos (2; 0% instances), obj (2; 0% instances), dep (1; 0% instances), obl (1; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: (776; 44% instances), VERB (564; 32% instances), NOUN (300; 17% instances), PRON (41; 2% instances), ADJ (37; 2% instances), PROPN (34; 2% instances), ADP (25; 1% instances), PART (4; 0% instances), ADV (2; 0% instances)

99 (6%) VERB nodes are leaves.

90 (5%) VERB nodes have one child.

239 (13%) VERB nodes have two children.

1355 (76%) VERB nodes have three or more children.

The highest child degree of a VERB node is 9.

Children of VERB nodes are attached using 29 different relations: obl (1364; 22% instances), punct (1148; 18% instances), nsubj (1003; 16% instances), obj (576; 9% instances), mark (367; 6% instances), advmod (276; 4% instances), compound:prt (274; 4% instances), advcl (212; 3% instances), conj (175; 3% instances), cc (165; 3% instances), ccomp (162; 3% instances), nsubj:pass (125; 2% instances), aux (90; 1% instances), obl:tmod (75; 1% instances), xcomp (68; 1% instances), case (56; 1% instances), dep (36; 1% instances), csubj (35; 1% instances), parataxis (18; 0% instances), acl (13; 0% instances), iobj (8; 0% instances), csubj:pass (7; 0% instances), expl (4; 0% instances), dislocated (2; 0% instances), orphan (2; 0% instances), amod (1; 0% instances), cc:preconj (1; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (2305; 37% instances), PUNCT (1148; 18% instances), VERB (564; 9% instances), PRON (456; 7% instances), PROPN (395; 6% instances), PART (390; 6% instances), ADP (321; 5% instances), ADV (237; 4% instances), CCONJ (165; 3% instances), ADJ (130; 2% instances), SCONJ (112; 2% instances), NUM (36; 1% instances), SYM (5; 0% instances), X (1; 0% instances)