home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: SCONJ

There are 5 SCONJ lemmas (0%), 5 SCONJ types (0%) and 5475 SCONJ tokens (2%). Out of 17 observed tags, the rank of SCONJ is: 14 in number of lemmas, 16 in number of types and 11 in number of tokens.

The 10 most frequent SCONJ lemmas: أَنَّ، أَن، إِنَّ، إِن، إِذَا

The 10 most frequent SCONJ types: أن، ان، إن، اذا، إذا

The 10 most frequent ambiguous lemmas: إِنَّ (SCONJ 945, PART 212), إِن (SCONJ 20, X 13), إِذَا (CCONJ 180, SCONJ 18)

The 10 most frequent ambiguous types: أن (SCONJ 2919, CCONJ 4, X 3, VERB 1), ان (SCONJ 1956, PART 30, X 11, VERB 1), إن (SCONJ 582, PART 182, X 3), اذا (CCONJ 66, SCONJ 10, ADV 1), إذا (CCONJ 114, SCONJ 8, ADV 1)

Morphology

The form / lemma ratio of SCONJ is 1.000000 (the average of all parts of speech is 1.761981).

The 1st highest number of forms (3) was observed with the lemma “إِنَّ”: أن, إن, ان.

The 2nd highest number of forms (2) was observed with the lemma “أَن”: أن, ان.

The 3rd highest number of forms (2) was observed with the lemma “أَنَّ”: أن, ان.

SCONJ occurs with 1 features: ExtPos (334; 6% instances)

SCONJ occurs with 1 feature-value pairs: ExtPos=SCONJ

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (5141 tokens). Examples: أن، ان، إن، اذا، إذا

Relations

SCONJ nodes are attached to their parents using 11 different relations: mark (5179; 95% instances), cc (133; 2% instances), fixed (116; 2% instances), case (11; 0% instances), conj (11; 0% instances), obj (10; 0% instances), dep (5; 0% instances), nmod (3; 0% instances), root (3; 0% instances), obl:arg (2; 0% instances), parataxis (2; 0% instances)

Parents of SCONJ nodes belong to 13 different parts of speech: VERB (4112; 75% instances), NOUN (478; 9% instances), ADJ (381; 7% instances), X (195; 4% instances), ADV (85; 2% instances), ADP (44; 1% instances), PRON (43; 1% instances), DET (41; 1% instances), CCONJ (39; 1% instances), PART (31; 1% instances), NUM (15; 0% instances), SCONJ (8; 0% instances), (3; 0% instances)

5105 (93%) SCONJ nodes are leaves.

339 (6%) SCONJ nodes have one child.

14 (0%) SCONJ nodes have two children.

17 (0%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 6.

Children of SCONJ nodes are attached using 14 different relations: fixed (334; 76% instances), ccomp (32; 7% instances), nsubj (18; 4% instances), cc (14; 3% instances), punct (13; 3% instances), obl (9; 2% instances), conj (5; 1% instances), mark (5; 1% instances), obj (3; 1% instances), case (2; 0% instances), advmod (1; 0% instances), advmod:emph (1; 0% instances), dep (1; 0% instances), dislocated (1; 0% instances)

Children of SCONJ nodes belong to 12 different parts of speech: PRON (340; 77% instances), VERB (33; 8% instances), CCONJ (17; 4% instances), NOUN (16; 4% instances), PUNCT (13; 3% instances), SCONJ (8; 2% instances), ADJ (4; 1% instances), ADP (2; 0% instances), ADV (2; 0% instances), DET (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)