home fi/feat edit page issue tracker

Clitic: clitic

(Please note: this part of the documentation is not yet completed.)

Language-specific feature identifying clitics attached to the word.

Finnish has a number of particle clitics used to express questions, politeness, or focus. UD Finnish captures the presence of these clitics using the Clitic feature, which takes one or more of the following values, with multiple values expressing combinations, for example Clitic=Ko,S for -kos (-ko + -s) as in voikos.

Kin

Expresses focus. Can often be translated into English as also. Forms contrasting pair with -kaan.

Examples

Kaan

Expresses focus in negative contexts. Realized as -kaan or -kään. Forms contrasting pair with -kin.

Examples

Ko

Expresses a question. Realized as -ko or -kö.

Examples

Han

Realized as -han or -hän.

Examples

Pa

Realized as -pa or -pä.

Examples

S

TODO

Examples

Ka

Realized as -ka or -kä. Attached to the negative verb ei, serves also as a conjunction.

Examples

References


Treebank Statistics (UD_Finnish)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 4 combinations have been observed: Han|Ko, Han|Pa, Ko|S, Pa|S.

1661 tokens (1%) have a non-empty value of Clitic. 977 types (2%) occur at least once with a non-empty value of Clitic. 531 lemmas (2%) occur at least once with a non-empty value of Clitic. The feature is used with 11 part-of-speech tags: fi-pos/VERB (778; 0% instances), fi-pos/ADV (242; 0% instances), fi-pos/NOUN (221; 0% instances), fi-pos/PRON (191; 0% instances), fi-pos/AUX (106; 0% instances), fi-pos/ADJ (69; 0% instances), fi-pos/PROPN (22; 0% instances), fi-pos/SCONJ (12; 0% instances), fi-pos/ADP (10; 0% instances), fi-pos/NUM (9; 0% instances), fi-pos/CONJ (1; 0% instances).

VERB

778 fi-pos/VERB tokens (2% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: InfForm=EMPTY (756; 97%), Degree=EMPTY (748; 96%), PartForm=EMPTY (748; 96%), Case=EMPTY (739; 95%), VerbForm=Fin (726; 93%), Voice=Act (724; 93%), Number=Sing (613; 79%), Person=3 (508; 65%), Mood=Ind (411; 53%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Connegative=Yes|Mood=Cnd|VerbForm=Finolisikaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Finolekkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Finolekaanolekin
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActOlisihanOlisikohanolisikinolisikoOlisipa
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolenkinolenko
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actoletkooletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkionks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActolihanolikaanolikinolikoOlikosolipaolipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActOnhanonkaanonkinonkoonkosonpaOnpas
Mood=Ind|Number=Plur|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=Actolihan
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivathanolivatkinolivatkoolivatpa
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovathanovatkaanovatkin, olemmekinOvatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas

ADV

242 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

ADV tokens may have the following values of Clitic:

Paradigm niinHanKaanKinPa
niinhänniinkäänniinkinNiinpä

NOUN

221 fi-pos/NOUN tokens (0% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (153; 69%).

NOUN tokens may have the following values of Clitic:

Paradigm miesHanKaanKin
Case=Gen|Number=Plurmiestenkin
Case=Nom|Number=SingmieshänMieskin
Case=Nom|Number=Sing|Number[psor]=Sing|Person[psor]=1miehenikin
Case=Nom|Number=Sing|Person[psor]=3miehensäkään

Clitic seems to be lexical feature of NOUN. 95% lemmas (184) occur only with one value of Clitic.

PRON

191 fi-pos/PRON tokens (2% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Person=EMPTY (151; 79%), Number=Sing (146; 76%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKinPaS
Case=Ade|Number=Singsilläkin
Case=Ade|Number=Plurniilläkin
Case=Ade|Number=Plur|Style=Collniilki
Case=Ela|Number=Singsiitähänsiitäkinsiitäs
Case=Ela|Number=Plurniistäkin
Case=Gen|Number=SingSenhänsenkäänsenkin
Case=Gen|Number=Plurniidenkin
Case=Ill|Number=Singsiihenkin
Case=Ine|Number=SingSiinäpä
Case=Nom|Number=Singsehänsekäänsekin
Case=Nom|Number=Plurnekin
Case=Par|Number=SingSitähänsitäkäänsitäkin

AUX

106 fi-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Clitic.

The most frequent other feature values with which AUX and Clitic co-occurred: VerbForm=Fin (102; 96%), Voice=Act (98; 92%), Number=Sing (87; 82%), Mood=Ind (76; 72%), Person=3 (72; 68%), Tense=Pres (63; 59%).

AUX tokens may have the following values of Clitic:

Paradigm ollaHanKaanKinKoKo,SPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Finolekaan
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=Actolisikinolisiko
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinkinolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=ActolenkaanOlenkinolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actoot
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolikaanolikinoliko
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActOnhanonkinonkoonpas
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivatkin
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovatkin

ADJ

69 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (52; 75%), Degree=Pos (50; 72%).

ADJ tokens may have the following values of Clitic:

Paradigm hyväKaanKin
Degree=Pos|Number=Singhyvääkin
Degree=Cmp|Number=Singparempaakaanparempaakin
Degree=Cmp|Number=Plurparempiakaan

PROPN

22 fi-pos/PROPN tokens (0% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (21; 95%).

PROPN tokens may have the following values of Clitic:

Paradigm SuomiKaanKin
Case=GenSuomenkaan
Case=IneSuomessakin
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 94% lemmas (17) occur only with one value of Clitic.

SCONJ

12 fi-pos/SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Clitic.

SCONJ tokens may have the following values of Clitic:

Paradigm josKinKo
joskinjosko

ADP

10 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADP and Clitic co-occurred: AdpType=Post (6; 60%).

ADP tokens may have the following values of Clitic:

Paradigm jälkeenKaanKin
jälkeenkäänjälkeenkin

NUM

9 fi-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: Number=Sing (9; 100%), NumType=Card (9; 100%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Ablyhdeltäkään
Case=Essyhtenäkin
Case=Nomyksikin
Case=Paryhtäkään

CONJ

1 fi-pos/CONJ tokens (0% of all CONJ tokens) have a non-empty value of Clitic.

CONJ tokens may have the following values of Clitic:


Treebank Statistics (UD_Finnish-FTB)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 9 combinations have been observed: Han|Ka, Han|Kin, Han|Ko, Han|Pa, Ka|S, Kaan|Ko, Kin|Ko, Ko|S, Pa|S.

2966 tokens (2%) have a non-empty value of Clitic. 1730 types (4%) occur at least once with a non-empty value of Clitic. 763 lemmas (4%) occur at least once with a non-empty value of Clitic. The feature is used with 9 part-of-speech tags: fi-pos/VERB (1684; 1% instances), fi-pos/NOUN (357; 0% instances), fi-pos/ADV (340; 0% instances), fi-pos/PRON (311; 0% instances), fi-pos/ADJ (114; 0% instances), fi-pos/DET (71; 0% instances), fi-pos/PROPN (57; 0% instances), fi-pos/NUM (24; 0% instances), fi-pos/ADP (8; 0% instances).

VERB

1684 fi-pos/VERB tokens (4% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: PartForm=EMPTY (1640; 97%), InfForm=EMPTY (1625; 96%), Voice=Act (1595; 95%), Case=EMPTY (1581; 94%), VerbForm=EMPTY (1580; 94%), Number=Sing (1320; 78%), Mood=Ind (992; 59%), Person=3 (954; 57%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,SS
Case=Gen|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actolleenkaan
Case=Gen|Number=Sing|PartForm=Pres|VerbForm=Part|Voice=Actolevankaan
Case=Ine|InfForm=2|VerbForm=Inf|Voice=Actollessakaan
Case=Lat|InfForm=1|VerbForm=Inf|Voice=ActollakaanOllakoollapa
Case=Nom|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan, ollukkaanollukkiollukko
Case=Nom|Number=Plur|PartForm=Past|VerbForm=Part|Voice=Actolleetkaanolleetkin
Conneg=Yes|Mood=Ind|Number=Sing|Tense=Past|Voice=Actollutkaan
Conneg=Yes|Mood=Ind|Tense=Pres|Voice=Actolekaanolekin, ookin
Mood=Cnd|Number=Sing|Person=1|Voice=ActOlisinko
Mood=Cnd|Number=Sing|Person=2|Voice=ActOlisitpa
Mood=Cnd|Number=Sing|Person=3|Voice=ActOlisihanOlisikohan, Oiskohanolisikaanolisikin, oliskinolisiko, oisko, oliskoOlisipa
Mood=Cnd|Number=Plur|Person=2|Voice=ActOlisitteko
Mood=Cnd|Number=Plur|Person=3|Voice=Actolisivatko
Mood=Imp|Number=Sing|Person=2|Voice=Actolekinolepa
Mood=Imp|Number=Sing|Person=3|Voice=Actolkoonkinolkoonpa
Mood=Ind|Number=Sing|Person=1|Tense=Past|Voice=ActolinkinolinkoOlinpa
Mood=Ind|Number=Sing|Person=1|Tense=Pres|Voice=ActOlenhanolenkinolenko, oonko, olenk, Oonksmä, ooks
Mood=Ind|Number=Sing|Person=2|Tense=Past|Voice=ActOlithanOlitkoOlitkos
Mood=Ind|Number=Sing|Person=2|Tense=Pres|Voice=ActOletkohanoletkaanoletkinoletko, ootko, ootsä, Ookkonää, Ooksää, oleks, ook, oleksäOletkosoletpa
Mood=Ind|Number=Sing|Person=3|Tense=Past|Voice=Actolihanolikohanolikaanolikin, olikiioliko, oliks, olikOlikosolipaOlipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|Voice=ActonhanOnkohan, onkohaonkaanonkin, onkionko, onks, onkonkosonpaOnpas
Mood=Ind|Number=Plur|Person=1|Tense=Pres|Voice=Actolemmeko
Mood=Ind|Number=Plur|Person=2|Tense=Past|Voice=ActOlitteks
Mood=Ind|Number=Plur|Person=2|Tense=Pres|Voice=Actoletteko, Oottekste, ootteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|Voice=Actolivatkaan
Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Actovathanovatkaanovatkinovatko
Mood=Ind|Tense=Past|Voice=PassOltiinhanOltiinkin
Mood=Ind|Tense=Pres|Voice=PassOllaanhanollaanpasOllaas
Mood=Pot|Number=Sing|Person=3|Voice=ActlieneekäänLiekö, lieneekö

NOUN

357 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (266; 75%).

NOUN tokens may have the following values of Clitic:

Paradigm lapsiHanHan,KinKaanKin
Case=Ade|Number=Plurlapsillakin
Case=Ela|Number=SingLapsestakin
Case=Ill|Number=PlurLapsiinhan
Case=Nom|Number=Singlapsikaanlapsikin
Case=Nom|Number=Plurlapsetkin
Case=Par|Number=Plurlapsijakkiihan

Clitic seems to be lexical feature of NOUN. 92% lemmas (249) occur only with one value of Clitic.

ADV

340 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADV and Clitic co-occurred: PronType=EMPTY (275; 81%).

ADV tokens may have the following values of Clitic:

Paradigm kylläHanKaanKinPaPa,S
kyllähän, kylhänkylläkäänkylläkinKylläpäkylläpäs

PRON

311 fi-pos/PRON tokens (3% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Number=Sing (239; 77%), Person=EMPTY (204; 66%), Case=Nom (178; 57%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKaan,KoKinKoPaPa,S
Case=Adesilläkin
Case=ElasiitähänsiitäkäänSiitäkinSiitäpä
Case=Gensenhänsenkään
Case=IllsiihenkinSiihenkö
Case=Inesiinähäsiinäkinsiinäpä
Case=Nomsehänsekäänsekin, sekiseköSepäSepäs
Case=ParSitähänsitäkäänsitäkäänkösitäkinSitäkö

ADJ

114 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (71; 62%).

ADJ tokens may have the following values of Clitic:

Paradigm omaKinPa
Case=Ela|Number=Pluromistaki
Case=Nom|Number=SingOmapa
Case=Nom|Number=Pluromatkin

Clitic seems to be lexical feature of ADJ. 93% lemmas (70) occur only with one value of Clitic.

DET

71 fi-pos/DET tokens (2% of all DET tokens) have a non-empty value of Clitic.

The most frequent other feature values with which DET and Clitic co-occurred: Person=EMPTY (66; 93%), Number=Sing (43; 61%).

DET tokens may have the following values of Clitic:

Paradigm tämäHanKaanKinKo
Case=EssTänäkääntänäkin
Case=Gentämänkääntämänkin
Case=Inetässäkin
Case=NomTämähänTämäkäänTämäkö
Case=Partätäkä

PROPN

57 fi-pos/PROPN tokens (1% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (56; 98%).

PROPN tokens may have the following values of Clitic:

Paradigm suomiKinKo
Case=ElaSuomestakin
Case=GenSuomenkin
Case=IneSuomessakinSuomessako
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 98% lemmas (44) occur only with one value of Clitic.

NUM

24 fi-pos/NUM tokens (1% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: Number=Sing (23; 96%), NumType=Card (21; 88%), Case=Nom (15; 63%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Essyhtenäkään
Case=Genyhdenkin
Case=Nomyksikäänyksikin

ADP

8 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

ADP tokens may have the following values of Clitic: