home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: VERB

There are 1570 VERB lemmas (10%), 4085 VERB types (15%) and 21296 VERB tokens (8%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: قَال، كَان، تَمّ، أَكَّد، أَعلَن، أَضَاف، أَشَار، وَصَل، ذَكَر، بَلَغ

The 10 most frequent VERB types: قال، كان، أضاف، كانت، تم، أكد، يتم، يمكن، أشار، أوضح

The 10 most frequent ambiguous lemmas: كَان (VERB 911, AUX 386), ذَكَر (VERB 252, NOUN 3), أَوضَح (VERB 211, ADJ 1), ضَمّ (VERB 111, NOUN 18), عَاد (VERB 95, AUX 1), طَلَب (NOUN 174, VERB 86), لَيس (AUX 118, VERB 74), عَدّ (VERB 60, NOUN 1), حَدَث (VERB 52, NOUN 48), هَدَف (NOUN 153, VERB 52)

The 10 most frequent ambiguous types: كان (VERB 378, AUX 118), كانت (VERB 251, AUX 61), يكون (VERB 99, AUX 89), ذكر (VERB 98, NOUN 13), تكون (VERB 77, AUX 75), تقدم (VERB 70, NOUN 12), عقد (NOUN 100, VERB 69, X 1), كشف (VERB 62, NOUN 15), تقوم (VERB 59, X 3), ليس (AUX 83, VERB 59)

Morphology

The form / lemma ratio of VERB is 2.601911 (the average of all parts of speech is 1.761966).

The 1st highest number of forms (16) was observed with the lemma “أَصَاب”: أصاب, أصابت, أصيب, أصيبا, أصيبت, أصيبوا, اصاب, اصبن, اصيب, اصيبت, اصيبوا, تصاب, تصيب, يصابوا, يصب, يصيب.

The 2nd highest number of forms (16) was observed with the lemma “أَكَّد”: أكد, أكدا, أكدت, أكدتا, أكدنا, أكدوا, أكّد, اكد, اكدت, اكدوا, اكـــدت, تؤكد, يؤكد, يؤكدون, يؤكّد, يــؤكد.

The 3rd highest number of forms (15) was observed with the lemma “كَان”: أكون, اكن, تكـــون, تكن, تكون, كان, كانا, كانت, كانتا, كانوا, كنا, كنت, يكن, يكون, يكونوا.

VERB occurs with 7 features: Gender (21296; 100% instances), Number (21296; 100% instances), Aspect (21244; 100% instances), Person (21244; 100% instances), Voice (21244; 100% instances), Mood (10081; 47% instances), VerbForm (10081; 47% instances)

VERB occurs with 17 feature-value pairs: Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, VerbForm=Fin, Voice=Act, Voice=Pass

VERB occurs with 65 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|Person=3|Voice=Act (6575 tokens). Examples: قال، كان، أضاف، تم، أكد، أشار، أوضح، أعلن، جاء، ذكر

Relations

VERB nodes are attached to their parents using 14 different relations: parataxis (5078; 24% instances), conj (3124; 15% instances), ccomp (3093; 15% instances), acl (3077; 14% instances), acl:relcl (2178; 10% instances), root (1904; 9% instances), advcl (1611; 8% instances), xcomp (676; 3% instances), csubj (515; 2% instances), dep (18; 0% instances), appos (13; 0% instances), fixed (5; 0% instances), orphan (3; 0% instances), csubj:pass (1; 0% instances)

Parents of VERB nodes belong to 17 different parts of speech: VERB (7604; 36% instances), NOUN (5158; 24% instances), CCONJ (4368; 21% instances), (1904; 9% instances), ADJ (738; 3% instances), X (632; 3% instances), DET (553; 3% instances), NUM (165; 1% instances), PRON (50; 0% instances), ADV (39; 0% instances), PART (35; 0% instances), SCONJ (33; 0% instances), ADP (9; 0% instances), PROPN (4; 0% instances), AUX (2; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)

114 (1%) VERB nodes are leaves.

2061 (10%) VERB nodes have one child.

5567 (26%) VERB nodes have two children.

13554 (64%) VERB nodes have three or more children.

The highest child degree of a VERB node is 23.

Children of VERB nodes are attached using 32 different relations: nsubj (14672; 21% instances), obl (11066; 16% instances), obj (8029; 12% instances), obl:arg (6555; 10% instances), mark (6193; 9% instances), punct (3824; 6% instances), cc (3453; 5% instances), conj (3026; 4% instances), ccomp (2722; 4% instances), advmod (1855; 3% instances), xcomp (1681; 2% instances), aux (1444; 2% instances), advcl (1180; 2% instances), nsubj:pass (775; 1% instances), advmod:emph (516; 1% instances), csubj (349; 1% instances), parataxis (306; 0% instances), nmod (260; 0% instances), dep (199; 0% instances), dislocated (167; 0% instances), aux:pass (110; 0% instances), iobj (107; 0% instances), appos (39; 0% instances), acl (35; 0% instances), nummod (23; 0% instances), amod (19; 0% instances), det (12; 0% instances), case (5; 0% instances), orphan (3; 0% instances), acl:relcl (1; 0% instances), csubj:pass (1; 0% instances), discourse (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (29281; 43% instances), VERB (7604; 11% instances), CCONJ (4729; 7% instances), SCONJ (4112; 6% instances), PUNCT (3824; 6% instances), X (3362; 5% instances), PRON (3283; 5% instances), DET (3123; 5% instances), ADJ (2635; 4% instances), NUM (1728; 3% instances), PART (1664; 2% instances), AUX (1554; 2% instances), ADP (1168; 2% instances), ADV (532; 1% instances), PROPN (19; 0% instances), SYM (6; 0% instances), INTJ (4; 0% instances)