home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: CCONJ

There are 35 CCONJ lemmas (0%), 39 CCONJ types (0%) and 10738 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 14 in number of lemmas, 15 in number of types and 9 in number of tokens.

The 10 most frequent CCONJ lemmas: un, bet, vai, gan, taču, arī, ne, jo, kā, tomēr

The 10 most frequent CCONJ types: un, bet, vai, gan, taču, arī, ne, jo, kā, tomēr

The 10 most frequent ambiguous lemmas: un (CCONJ 6999, X 1), bet (CCONJ 1281, SCONJ 1), vai (SCONJ 445, CCONJ 423, PART 250, INTJ 2), gan (CCONJ 399, PART 164, SCONJ 57), taču (CCONJ 361, PART 51), arī (PART 1385, CCONJ 265, SCONJ 30), ne (PART 237, CCONJ 200, SCONJ 1), jo (SCONJ 375, CCONJ 187, PART 25), (SCONJ 715, ADV 503, CCONJ 185, PART 80, PRON 3), tomēr (CCONJ 175, PART 77, SCONJ 1)

The 10 most frequent ambiguous types: un (CCONJ 6649, ADP 1, X 1), bet (CCONJ 1016, SCONJ 1), vai (SCONJ 442, CCONJ 398, PART 99, INTJ 1), gan (CCONJ 383, PART 161, SCONJ 57), taču (CCONJ 228, PART 49), arī (PART 1259, CCONJ 264, SCONJ 30), ne (PART 217, CCONJ 194, DET 1, SCONJ 1), jo (SCONJ 345, CCONJ 180, PART 23), (SCONJ 667, ADV 359, CCONJ 177, PART 80, PRON 22, DET 6), tomēr (CCONJ 88, PART 71)

Morphology

The form / lemma ratio of CCONJ is 1.114286 (the average of all parts of speech is 2.233228).

The 1st highest number of forms (2) was observed with the lemma “arī”: ari, arī.

The 2nd highest number of forms (2) was observed with the lemma “kā”: ka, kā.

The 3rd highest number of forms (2) was observed with the lemma “nevis”: nevis, nevīs.

CCONJ occurs with 2 features: Polarity (200; 2% instances), Typo (5; 0% instances)

CCONJ occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (10533 tokens). Examples: un, bet, vai, gan, taču, arī, jo, kā, tomēr, nevis

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (10123; 94% instances), fixed (309; 3% instances), mark (248; 2% instances), case (33; 0% instances), conj (6; 0% instances), discourse (6; 0% instances), root (4; 0% instances), dep (3; 0% instances), nsubj (3; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (5340; 50% instances), NOUN (3364; 31% instances), ADJ (644; 6% instances), PROPN (465; 4% instances), ADV (314; 3% instances), CCONJ (264; 2% instances), PRON (154; 1% instances), NUM (63; 1% instances), SCONJ (36; 0% instances), X (28; 0% instances), SYM (21; 0% instances), PART (19; 0% instances), AUX (12; 0% instances), ADP (5; 0% instances), INTJ (4; 0% instances), (4; 0% instances), DET (1; 0% instances)

10298 (96%) CCONJ nodes are leaves.

431 (4%) CCONJ nodes have one child.

5 (0%) CCONJ nodes have two children.

4 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 9 different relations: fixed (413; 91% instances), punct (25; 5% instances), conj (7; 2% instances), discourse (3; 1% instances), flat:name (3; 1% instances), nmod (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 9 different parts of speech: CCONJ (264; 58% instances), PART (155; 34% instances), PUNCT (25; 5% instances), NOUN (4; 1% instances), SCONJ (4; 1% instances), ADV (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances)