Treebank Statistics: UD_Finnish-FTB: POS Tags: CCONJ
There are 22 CCONJ
lemmas (0%), 25 CCONJ
types (0%) and 4779 CCONJ
tokens (3%).
Out of 17 observed tags, the rank of CCONJ
is: 13 in number of lemmas, 15 in number of types and 10 in number of tokens.
The 10 most frequent CCONJ
lemmas: ja, mutta, tai, vai, vaan, sekä, eli, että, joko, kuin
The 10 most frequent CCONJ
types: ja, mutta, tai, vai, vaan, sekä, eli, että, mut, joko
The 10 most frequent ambiguous lemmas: ja (CCONJ 3230, PART 2), mutta (CCONJ 604, SCONJ 1), vaan (CCONJ 126, PART 86), eli (CCONJ 55, PART 12), että (SCONJ 1772, CCONJ 38, PART 3), kuin (SCONJ 646, CCONJ 24), elikkä (CCONJ 4, PART 4), joskaan (CCONJ 4, SCONJ 1), vaikka (SCONJ 204, PART 73, CCONJ 4), joskin (SCONJ 5, CCONJ 2)
The 10 most frequent ambiguous types: ja (CCONJ 3080, PART 2), vaan (CCONJ 126, PART 73), eli (CCONJ 55, VERB 5), että (SCONJ 1404, CCONJ 38, AUX 2), mut (CCONJ 21, PRON 3), kuin (SCONJ 614, CCONJ 24), elikkä (CCONJ 4, PART 1), joskaan (CCONJ 4, SCONJ 1), vaikka (SCONJ 134, PART 65, CCONJ 3), joskin (SCONJ 5, CCONJ 2)
- ja
- vaan
- eli
- että
- mut
- kuin
- elikkä
- CCONJ 4: just kohta ool lähössä elikkä viide minuutin päästä .
- PART 1: elikkä tuota Täs on nyt mä kattelin ihan , ihan noiden teiän kooditusten : kooditusten perusteella ja , ja tota Totesin sitte et ku siin nyt mitään ihmeellistä hommaa hommaa tosiaan ei olis niin , niin tota - kyllä ne samat ehdot ihan ihan sitte ku Rakestaki että tota , että enempäähän me ei ei pystytä siinä tarjoomaan koska .
- joskaan
- vaikka
- joskin
Morphology
The form / lemma ratio of CCONJ
is 1.136364 (the average of all parts of speech is 2.048736).
The 1st highest number of forms (3) was observed with the lemma “mutta”: mut, mutt, mutta.
The 2nd highest number of forms (2) was observed with the lemma “vaikka”: vaikk, vaikka.
The 3rd highest number of forms (1) was observed with the lemma “eli”: eli.
CCONJ
occurs with 1 features: Style (49; 1% instances)
CCONJ
occurs with 1 feature-value pairs: Style=Coll
CCONJ
occurs with 2 feature combinations.
The most frequent feature combination is _
(4730 tokens).
Examples: ja, mutta, tai, vai, vaan, sekä, eli, että, joko, kuin
Relations
CCONJ
nodes are attached to their parents using 5 different relations: cc (4726; 99% instances), advmod (23; 0% instances), fixed (14; 0% instances), conj (13; 0% instances), cc:preconj (3; 0% instances)
Parents of CCONJ
nodes belong to 14 different parts of speech: VERB (2096; 44% instances), NOUN (1519; 32% instances), ADJ (502; 11% instances), PROPN (306; 6% instances), ADV (136; 3% instances), PRON (118; 2% instances), NUM (48; 1% instances), PART (15; 0% instances), CCONJ (12; 0% instances), ADP (10; 0% instances), SCONJ (10; 0% instances), X (4; 0% instances), INTJ (2; 0% instances), DET (1; 0% instances)
4683 (98%) CCONJ
nodes are leaves.
93 (2%) CCONJ
nodes have one child.
2 (0%) CCONJ
nodes have two children.
1 (0%) CCONJ
nodes have three or more children.
The highest child degree of a CCONJ
node is 3.
Children of CCONJ
nodes are attached using 2 different relations: punct (88; 88% instances), conj (12; 12% instances)
Children of CCONJ
nodes belong to 2 different parts of speech: PUNCT (88; 88% instances), CCONJ (12; 12% instances)