home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: CCONJ

There are 47 CCONJ lemmas (0%), 102 CCONJ types (0%) and 19784 CCONJ tokens (7%). Out of 17 observed tags, the rank of CCONJ is: 8 in number of lemmas, 7 in number of types and 6 in number of tokens.

The 10 most frequent CCONJ lemmas: وَ، فَ، أَو، كَمَا، حَيثُ، لٰكِنَّ، لِ، إِذَا، لِأَنَّ، مِمَّا

The 10 most frequent CCONJ types: و، ف، كما، أو، حيث، ل، لٰكن، او، إذا، لأن

The 10 most frequent ambiguous lemmas: لِ (ADP 6946, CCONJ 210, PART 1), حَتَّى (ADP 176, ADV 65, CCONJ 50), أَي (CCONJ 40, X 13)

The 10 most frequent ambiguous types: و (CCONJ 16175, X 3), ف (CCONJ 637, X 48), كما (CCONJ 385, PRON 2), أو (CCONJ 300, X 3), ل (ADP 6805, CCONJ 210, PART 24, X 2), او (CCONJ 157, X 1), إذا (CCONJ 122, ADV 1), لكن (CCONJ 104, X 23, ADV 2), ثم (CCONJ 86, ADV 12), اذا (CCONJ 76, ADV 1)

Morphology

The form / lemma ratio of CCONJ is 2.170213 (the average of all parts of speech is 1.761966).

The 1st highest number of forms (42) was observed with the lemma “وَ”: و, وأسلم, وأفريقيا, وأوروبا, وإسرائيل, وإيطاليا, واسرائيل, واعتدال, والأردن, والاستخبارات, والامارات, والاميركية, والبرازيل, والبورصة, والتجارة, والتضامن, والتوجيه, والجودة, والسعودية, والصحة, والعمل, والغاز, والفاحشة, واللحوم, والمتوسط, والمتوسطة, والمجر, والمحلي, والنحاس, والنسيج, والهند, والهوية, وبوش, وجونز, وسامراء, وغربه, وقرغيزستان, ولبنان, ومصر, ومنوعة, ونيجيريا, وهي.

The 2nd highest number of forms (3) was observed with the lemma “أَي”: أي, اى, اي.

The 3rd highest number of forms (2) was observed with the lemma “أَم”: أم, ام.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 13 different relations: cc (13693; 69% instances), root (4145; 21% instances), mark (1251; 6% instances), advmod (349; 2% instances), case (154; 1% instances), fixed (87; 0% instances), advmod:emph (68; 0% instances), dep (12; 0% instances), nmod (11; 0% instances), conj (6; 0% instances), orphan (4; 0% instances), nsubj (3; 0% instances), obl:arg (1; 0% instances)

Parents of CCONJ nodes belong to 15 different parts of speech: NOUN (7207; 36% instances), VERB (4729; 24% instances), (4145; 21% instances), ADJ (1320; 7% instances), X (765; 4% instances), NUM (517; 3% instances), CCONJ (496; 3% instances), DET (205; 1% instances), PART (113; 1% instances), PRON (112; 1% instances), ADV (70; 0% instances), ADP (57; 0% instances), PROPN (29; 0% instances), SCONJ (17; 0% instances), INTJ (2; 0% instances)

15379 (78%) CCONJ nodes are leaves.

222 (1%) CCONJ nodes have one child.

3609 (18%) CCONJ nodes have two children.

574 (3%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 25.

Children of CCONJ nodes are attached using 23 different relations: punct (4830; 47% instances), parataxis (4514; 44% instances), cc (476; 5% instances), fixed (234; 2% instances), advcl (97; 1% instances), dep (58; 1% instances), nsubj (26; 0% instances), obl (26; 0% instances), nmod (17; 0% instances), obl:arg (16; 0% instances), case (11; 0% instances), acl (9; 0% instances), obj (9; 0% instances), appos (4; 0% instances), ccomp (4; 0% instances), conj (4; 0% instances), orphan (3; 0% instances), csubj (2; 0% instances), mark (2; 0% instances), advmod (1; 0% instances), advmod:emph (1; 0% instances), amod (1; 0% instances), det (1; 0% instances)

Children of CCONJ nodes belong to 15 different parts of speech: PUNCT (4830; 47% instances), VERB (4368; 42% instances), CCONJ (496; 5% instances), NOUN (168; 2% instances), ADP (135; 1% instances), ADJ (122; 1% instances), DET (58; 1% instances), X (48; 0% instances), SCONJ (39; 0% instances), PART (21; 0% instances), PRON (20; 0% instances), ADV (19; 0% instances), NUM (18; 0% instances), PROPN (3; 0% instances), INTJ (1; 0% instances)