home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SCONJ

There are 21 SCONJ lemmas (0%), 25 SCONJ types (0%) and 7588 SCONJ tokens (2%). Out of 17 observed tags, the rank of SCONJ is: 16 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent SCONJ lemmas: ka, lai, kā, ja, jo, vai, nekā, kamēr, līdz, tā

The 10 most frequent SCONJ types: ka, lai, kā, ja, jo, vai, nekā, kamēr, līdz, tā

The 10 most frequent ambiguous lemmas: ka (SCONJ 3070, PART 1), lai (SCONJ 1010, PART 62), (SCONJ 958, ADV 603, CCONJ 247, PART 94), ja (SCONJ 811, PART 2), jo (SCONJ 720, PART 33), vai (CCONJ 598, SCONJ 500, PART 301, INTJ 2), nekā (SCONJ 199, PART 5, ADV 3), kamēr (SCONJ 103, CCONJ 2, ADV 1), līdz (ADP 489, ADV 92, SCONJ 54, CCONJ 7), (DET 1384, ADV 434, SCONJ 48, PART 35, CCONJ 11, PRON 2)

The 10 most frequent ambiguous types: ka (SCONJ 3059, CCONJ 2, PRON 2, PART 1), lai (SCONJ 821, PART 42), (SCONJ 889, ADV 431, CCONJ 239, PART 94, PRON 34), ja (SCONJ 462, PART 2), jo (SCONJ 658, PART 31), vai (CCONJ 566, SCONJ 496, PART 119, INTJ 1), nekā (SCONJ 198, PRON 17, PART 6, ADV 3), kamēr (SCONJ 86, CCONJ 2), līdz (ADP 444, ADV 52, SCONJ 46, CCONJ 6, VERB 1), (DET 585, ADV 352, PART 23, SCONJ 13, CCONJ 10, PRON 1)

Morphology

The form / lemma ratio of SCONJ is 1.190476 (the average of all parts of speech is 2.339090).

The 1st highest number of forms (3) was observed with the lemma “ka”: ka, kad, kā.

The 2nd highest number of forms (3) was observed with the lemma “kā”: ka, kaa, kā.

The 3rd highest number of forms (2) was observed with the lemma “ja”: ja, ka.

SCONJ occurs with 1 features: Typo (9; 0% instances)

SCONJ occurs with 1 feature-value pairs: Typo=Yes

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (7579 tokens). Examples: ka, lai, kā, ja, jo, vai, nekā, kamēr, līdz, tā

Relations

SCONJ nodes are attached to their parents using 11 different relations: mark (6763; 89% instances), cc (497; 7% instances), fixed (210; 3% instances), discourse (47; 1% instances), dep (46; 1% instances), case (10; 0% instances), conj (9; 0% instances), advcl (2; 0% instances), root (2; 0% instances), ccomp (1; 0% instances), nmod (1; 0% instances)

Parents of SCONJ nodes belong to 17 different parts of speech: VERB (5435; 72% instances), NOUN (1144; 15% instances), ADJ (438; 6% instances), ADV (290; 4% instances), DET (65; 1% instances), SCONJ (48; 1% instances), PART (44; 1% instances), PROPN (42; 1% instances), NUM (35; 0% instances), PRON (23; 0% instances), AUX (8; 0% instances), X (7; 0% instances), CCONJ (4; 0% instances), (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)

7289 (96%) SCONJ nodes are leaves.

286 (4%) SCONJ nodes have one child.

8 (0%) SCONJ nodes have two children.

5 (0%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 9.

Children of SCONJ nodes are attached using 16 different relations: fixed (271; 82% instances), punct (23; 7% instances), iobj (12; 4% instances), cop (3; 1% instances), flat (3; 1% instances), goeswith (3; 1% instances), nsubj (3; 1% instances), obl (3; 1% instances), advcl (2; 1% instances), conj (2; 1% instances), nmod (2; 1% instances), aux:pass (1; 0% instances), cc (1; 0% instances), dep (1; 0% instances), discourse (1; 0% instances), nsubj:pass (1; 0% instances)

Children of SCONJ nodes belong to 12 different parts of speech: PART (175; 53% instances), SCONJ (48; 14% instances), CCONJ (45; 14% instances), PUNCT (23; 7% instances), DET (16; 5% instances), NOUN (10; 3% instances), AUX (4; 1% instances), PRON (3; 1% instances), X (3; 1% instances), ADV (2; 1% instances), VERB (2; 1% instances), ADP (1; 0% instances)