This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ar/pos issue tracker

CONJ: coordinating conjunction

This document is a placeholder for the language-specific documentation for CONJ.


Treebank Statistics (UD_Arabic)

There are 48 CONJ lemmas (0%), 102 CONJ types (0%) and 23968 CONJ tokens (8%). Out of 16 observed tags, the rank of CONJ is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent CONJ lemmas: وَ، أَنَّ، أَن، إِنَّ، فَ، أَو، كَمَا، حَيثُ، لٰكِنَّ، لِ

The 10 most frequent CONJ types: و، أن، ان، ف، إن، كما، أو، حيث، ل، او

The 10 most frequent ambiguous lemmas: إِنَّ (CONJ 934, PART 200), لِ (ADP 6661, CONJ 202, PART 1), حَتَّى (ADP 176, ADV 65, CONJ 50), أَي (CONJ 38, X 13), إِن (CONJ 20, X 13)

The 10 most frequent ambiguous types: و (CONJ 15052, X 3), أن (CONJ 2896, X 3, VERB 1), ان (CONJ 1956, PART 30, X 11, VERB 1), ف (CONJ 580, X 48), إن (CONJ 571, PART 170, X 3), أو (CONJ 300, X 3), ل (ADP 6520, CONJ 202, PART 23, X 2), او (CONJ 157, X 1), إذا (CONJ 122, ADV 1), لكن (CONJ 104, X 25)

Morphology

The form / lemma ratio of CONJ is 2.125000 (the average of all parts of speech is 1.685612).

The 1st highest number of forms (42) was observed with the lemma “وَ”: و, وأسلم, وأفريقيا, وأوروبا, وإسرائيل, وإيطاليا, واسرائيل, واعتدال, والأردن, والاستخبارات, والامارات, والاميركية, والبرازيل, والبورصة, والتجارة, والتضامن, والتوجيه, والجودة, والسعودية, والصحة, والعمل, والغاز, والفاحشة, واللحوم, والمتوسط, والمتوسطة, والمجر, والمحلي, والنحاس, والنسيج, والهند, والهوية, وبوش, وجونز, وسامراء, وغربه, وقرغيزستان, ولبنان, ومصر, ومنوعة, ونيجيريا, وهي.

The 2nd highest number of forms (3) was observed with the lemma “أَي”: أي, اى, اي.

The 3rd highest number of forms (3) was observed with the lemma “إِنَّ”: أن, إن, ان.

CONJ does not occur with any features.

Relations

CONJ nodes are attached to their parents using 17 different relations: cc (12859; 54% instances), mark (6174; 26% instances), root (4117; 17% instances), advmod (317; 1% instances), advmod:emph (244; 1% instances), mwe (155; 1% instances), case (24; 0% instances), conj (18; 0% instances), dep (15; 0% instances), cop (12; 0% instances), dobj (10; 0% instances), nmod (8; 0% instances), aux (6; 0% instances), iobj (4; 0% instances), nsubj (2; 0% instances), parataxis (2; 0% instances), punct (1; 0% instances)

Parents of CONJ nodes belong to 13 different parts of speech: VERB (8501; 35% instances), NOUN (6946; 29% instances), ROOT (4117; 17% instances), ADJ (1666; 7% instances), X (1062; 4% instances), CONJ (549; 2% instances), NUM (527; 2% instances), PRON (237; 1% instances), ADV (141; 1% instances), PART (135; 1% instances), ADP (64; 0% instances), DET (12; 0% instances), AUX (11; 0% instances)

19165 (80%) CONJ nodes are leaves.

547 (2%) CONJ nodes have one child.

3693 (15%) CONJ nodes have two children.

563 (2%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 26.

Children of CONJ nodes are attached using 19 different relations: punct (4809; 44% instances), parataxis (4524; 42% instances), cc (595; 6% instances), mwe (421; 4% instances), nsubj (185; 2% instances), advcl (58; 1% instances), dep (58; 1% instances), ccomp (36; 0% instances), nmod (28; 0% instances), dobj (23; 0% instances), case (16; 0% instances), appos (15; 0% instances), advmod:emph (11; 0% instances), conj (9; 0% instances), acl (6; 0% instances), advmod (5; 0% instances), mark (5; 0% instances), csubj (2; 0% instances), aux (1; 0% instances)

Children of CONJ nodes belong to 13 different parts of speech: PUNCT (4808; 44% instances), VERB (4311; 40% instances), CONJ (549; 5% instances), PRON (443; 4% instances), NOUN (288; 3% instances), ADJ (162; 1% instances), X (116; 1% instances), PART (62; 1% instances), ADV (24; 0% instances), ADP (22; 0% instances), NUM (20; 0% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]