home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-IcePaHC: POS Tags: CCONJ

There are 33 CCONJ lemmas (0%), 35 CCONJ types (0%) and 57253 CCONJ tokens (6%). Out of 16 observed tags, the rank of CCONJ is: 12 in number of lemmas, 14 in number of types and 7 in number of tokens.

The 10 most frequent CCONJ lemmas: og, en, eða, bæði, eður, né, hvorki, enda, ýmist, annaðhvort

The 10 most frequent CCONJ types: og, en, eða, eður, bæði, né, hvorki, enda, hvörki, ýmist

The 10 most frequent ambiguous lemmas: og (CCONJ 44084, ADV 1457, ADP 1291, SCONJ 17, INTJ 3, X 1), en (CCONJ 9452, SCONJ 2271, ADP 21, ADV 9, NOUN 1, X 1), eða (CCONJ 1794, ADP 1), bæði (CCONJ 597, DET 78, ADV 1), eður (CCONJ 593, SCONJ 6), (CCONJ 422, DET 3, ADV 2), hvorki (CCONJ 171, DET 6, NOUN 1, SCONJ 1), enda (ADV 162, VERB 94, CCONJ 51, NOUN 2, ADP 1), annaðhvort (PRON 51, CCONJ 12, DET 2), heldur (ADV 1375, ADJ 15, CCONJ 7, VERB 2)

The 10 most frequent ambiguous types: og (CCONJ 41028, ADV 1456, ADP 1291, SCONJ 17, X 1), en (CCONJ 5464, SCONJ 2268, ADV 25, ADP 21, DET 5, AUX 1, NOUN 1, X 1), eða (CCONJ 1684, ADP 1), eður (CCONJ 586, SCONJ 6), bæði (CCONJ 594, DET 102, VERB 7, ADV 2), (CCONJ 421, DET 3, ADV 2), hvorki (CCONJ 134, DET 5, ADV 1, SCONJ 1), enda (ADV 150, NOUN 86, CCONJ 50, VERB 14, ADP 1), ýmist (CCONJ 14, ADV 9, DET 2), hverki (CCONJ 15, DET 1)

Morphology

The form / lemma ratio of CCONJ is 1.060606 (the average of all parts of speech is 1.842490).

The 1st highest number of forms (4) was observed with the lemma “hvorki”: hverki, hvorki, hvortki, hvörki.

The 2nd highest number of forms (4) was observed with the lemma “og”: &, oc, og, óg.

The 3rd highest number of forms (3) was observed with the lemma “annaðhvort”: annaðhvert, annaðhvort, annaðhvurt.

CCONJ occurs with 12 features: Number (129; 0% instances), Case (114; 0% instances), Gender (114; 0% instances), Definite (56; 0% instances), PronType (55; 0% instances), VerbForm (18; 0% instances), Voice (18; 0% instances), Mood (15; 0% instances), Person (15; 0% instances), Tense (15; 0% instances), Foreign (10; 0% instances), Degree (4; 0% instances)

CCONJ occurs with 24 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Ind, Tense=Pres, VerbForm=Fin, VerbForm=Part, Voice=Act

CCONJ occurs with 33 feature combinations. The most frequent feature combination is _ (57114 tokens). Examples: og, en, eða, eður, bæði, né, hvorki, enda, ýmist, annaðhvort

Relations

CCONJ nodes are attached to their parents using 16 different relations: cc (56540; 99% instances), conj (262; 0% instances), amod (196; 0% instances), obl (92; 0% instances), parataxis (57; 0% instances), root (38; 0% instances), ccomp (16; 0% instances), acl (10; 0% instances), advcl (8; 0% instances), obj (7; 0% instances), acl:relcl (6; 0% instances), dep (6; 0% instances), compound:prt (5; 0% instances), appos (4; 0% instances), xcomp (4; 0% instances), iobj (2; 0% instances)

Parents of CCONJ nodes belong to 16 different parts of speech: VERB (34482; 60% instances), NOUN (12064; 21% instances), ADJ (4117; 7% instances), PROPN (2156; 4% instances), PRON (1379; 2% instances), ADV (1322; 2% instances), DET (850; 1% instances), AUX (323; 1% instances), NUM (213; 0% instances), ADP (161; 0% instances), X (82; 0% instances), CCONJ (39; 0% instances), (38; 0% instances), PART (14; 0% instances), SCONJ (10; 0% instances), INTJ (3; 0% instances)

56696 (99%) CCONJ nodes are leaves.

245 (0%) CCONJ nodes have one child.

182 (0%) CCONJ nodes have two children.

130 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 11.

Children of CCONJ nodes are attached using 28 different relations: advmod (150; 13% instances), obl (134; 12% instances), cop (103; 9% instances), conj (91; 8% instances), punct (87; 8% instances), nsubj (62; 5% instances), advcl (57; 5% instances), amod (49; 4% instances), ccomp (48; 4% instances), obj (44; 4% instances), dep (43; 4% instances), acl (40; 4% instances), cc (34; 3% instances), aux (32; 3% instances), xcomp (32; 3% instances), acl:relcl (31; 3% instances), compound:prt (26; 2% instances), case (20; 2% instances), mark (15; 1% instances), flat:foreign (10; 1% instances), nmod (6; 1% instances), det (5; 0% instances), discourse (5; 0% instances), iobj (5; 0% instances), vocative (3; 0% instances), appos (1; 0% instances), expl (1; 0% instances), nummod (1; 0% instances)

Children of CCONJ nodes belong to 15 different parts of speech: VERB (227; 20% instances), NOUN (194; 17% instances), ADV (175; 15% instances), AUX (143; 13% instances), PRON (88; 8% instances), PUNCT (87; 8% instances), ADJ (57; 5% instances), ADP (49; 4% instances), CCONJ (39; 3% instances), DET (23; 2% instances), PROPN (18; 2% instances), SCONJ (15; 1% instances), X (11; 1% instances), INTJ (5; 0% instances), NUM (4; 0% instances)