home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew: POS Tags: CCONJ

There are 41 CCONJ lemmas (0%), 45 CCONJ types (0%) and 5413 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 9 in number of lemmas, 11 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: ו, או, אבל, אך, _, אלא, אף, אולם, אפילו, לכן

The 10 most frequent CCONJ types: ו, או, אבל, אך, אלא, אף, אולם, אפילו, לכן, אילו

The 10 most frequent ambiguous lemmas: ו (CCONJ 4085, X 3), או (CCONJ 240, PROPN 1), אבל (CCONJ 204, NOUN 3), _ (VERB 420, NOUN 368, ADJ 231, ADP 190, ADV 174, PRON 130, CCONJ 113, AUX 99, X 86, SCONJ 47, PART 34, DET 33), אף (CCONJ 99, DET 20, NOUN 13), אולם (CCONJ 53, NOUN 19), אפילו (CCONJ 43, ADV 8), הרי (CCONJ 27, PROPN 1), באשר (CCONJ 16, PROPN 4), כאילו (CCONJ 15, ADV 1)

The 10 most frequent ambiguous types: ו (CCONJ 4157, X 3), או (CCONJ 240, PROPN 1), אבל (CCONJ 204, NOUN 2), אף (CCONJ 99, DET 20, ADV 14, NOUN 12), אולם (CCONJ 53, NOUN 12), אפילו (CCONJ 43, ADV 8), אילו (CCONJ 34, DET 1, PRON 1), הרי (CCONJ 27, NOUN 4, PROPN 1, SCONJ 1), עקב (CCONJ 27, VERB 3), מאשר (CCONJ 19, VERB 1)

Morphology

The form / lemma ratio of CCONJ is 1.097561 (the average of all parts of speech is 1.709692).

The 1st highest number of forms (6) was observed with the lemma “_”: אחר, ו, חרף, למעט, עקב, פלוס.

The 2nd highest number of forms (1) was observed with the lemma “אבל”: אבל.

The 3rd highest number of forms (1) was observed with the lemma “אגב”: אגב.

CCONJ occurs with 2 features: HebSource (85; 2% instances), Abbr (1; 0% instances)

CCONJ occurs with 3 feature-value pairs: Abbr=Yes, HebSource=ConvUncertainHead, HebSource=ConvUncertainLabel

CCONJ occurs with 4 feature combinations. The most frequent feature combination is _ (5327 tokens). Examples: ו, או, אבל, אך, אף, אלא, אולם, אפילו, לכן, אילו

Relations

CCONJ nodes are attached to their parents using 19 different relations: cc (4760; 88% instances), advmod (189; 3% instances), case (129; 2% instances), dep (103; 2% instances), mark (52; 1% instances), fixed (51; 1% instances), det (33; 1% instances), root (16; 0% instances), advcl (14; 0% instances), appos (13; 0% instances), parataxis (13; 0% instances), compound:smixut (11; 0% instances), conj (8; 0% instances), flat:name (8; 0% instances), advmod:phrase (4; 0% instances), nmod (4; 0% instances), acl (2; 0% instances), obl (2; 0% instances), aux:q (1; 0% instances)

Parents of CCONJ nodes belong to 15 different parts of speech: VERB (2121; 39% instances), NOUN (2058; 38% instances), ADJ (422; 8% instances), PROPN (269; 5% instances), AUX (148; 3% instances), ADV (126; 2% instances), CCONJ (94; 2% instances), PRON (71; 1% instances), ADP (43; 1% instances), NUM (33; 1% instances), (16; 0% instances), PUNCT (7; 0% instances), DET (2; 0% instances), SCONJ (2; 0% instances), X (1; 0% instances)

5210 (96%) CCONJ nodes are leaves.

137 (3%) CCONJ nodes have one child.

30 (1%) CCONJ nodes have two children.

36 (1%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 9 different relations: cc (106; 33% instances), dep (90; 28% instances), punct (70; 22% instances), conj (21; 7% instances), fixed (18; 6% instances), advmod (7; 2% instances), advcl (3; 1% instances), case (2; 1% instances), obl (1; 0% instances)

Children of CCONJ nodes belong to 12 different parts of speech: CCONJ (94; 30% instances), PUNCT (94; 30% instances), ADV (46; 14% instances), VERB (27; 8% instances), NOUN (24; 8% instances), SCONJ (13; 4% instances), PRON (6; 2% instances), PROPN (5; 2% instances), AUX (4; 1% instances), ADJ (3; 1% instances), ADP (1; 0% instances), INTJ (1; 0% instances)