home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Karelian-KKPP: POS Tags: ADJ

There are 104 ADJ lemmas (11%), 165 ADJ types (12%) and 216 ADJ tokens (7%). Out of 14 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: oma, toini, kanšallini, šuuri, hyvä, semmoni, enšimmäini, eeppini, uuši, 2.

The 10 most frequent ADJ types: omie, kanšallisien, omua, šemmosie, 2., toisie, erilaisie, toini, 25., 28.

The 10 most frequent ambiguous lemmas: toini (ADJ 14, PRON 2), eri (ADV 2, ADJ 1)

The 10 most frequent ambiguous types: toisen (ADJ 2, PRON 1), eri (ADV 2, ADJ 1), pitän (ADJ 1, VERB 1)

Morphology

The form / lemma ratio of ADJ is 1.586538 (the average of all parts of speech is 1.495298).

The 1st highest number of forms (8) was observed with the lemma “oma”: Oma, omalla, oman, omat, omie, omien, omilla, omua.

The 2nd highest number of forms (7) was observed with the lemma “šuuri”: šuurella, šuuremman, šuurena, šuurie, šuurimmakši, šuurimmista, šuurimpie.

The 3rd highest number of forms (6) was observed with the lemma “hyvä”: Parahat, hyvyä, parahakši, paraš, paremmat, parempie.

ADJ occurs with 5 features: Case (213; 99% instances), Number (213; 99% instances), NumType (13; 6% instances), Degree (12; 6% instances), Typo (1; 0% instances)

ADJ occurs with 15 feature-value pairs: Case=Ade, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Tra, Degree=Cmp, Degree=Sup, NumType=Ord, Number=Plur, Number=Sing, Typo=Yes

ADJ occurs with 27 feature combinations. The most frequent feature combination is Case=Par|Number=Sing (35 tokens). Examples: omua, omie, kypšie, Viimesie, epävirallista, erimoisie, hyvyä, karjalaisie, keinotekoista, keltasie

Relations

ADJ nodes are attached to their parents using 16 different relations: amod (153; 71% instances), conj (23; 11% instances), obl (12; 6% instances), root (8; 4% instances), obj (5; 2% instances), nsubj (3; 1% instances), advmod (2; 1% instances), nmod:poss (2; 1% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), compound:prt (1; 0% instances), fixed (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of ADJ nodes belong to 8 different parts of speech: NOUN (163; 75% instances), VERB (26; 12% instances), ADJ (13; 6% instances), (8; 4% instances), AUX (2; 1% instances), PRON (2; 1% instances), NUM (1; 0% instances), PROPN (1; 0% instances)

147 (68%) ADJ nodes are leaves.

42 (19%) ADJ nodes have one child.

13 (6%) ADJ nodes have two children.

14 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 5.

Children of ADJ nodes are attached using 24 different relations: punct (23; 18% instances), cc (19; 15% instances), cop (14; 11% instances), conj (13; 10% instances), advmod (10; 8% instances), nsubj:cop (10; 8% instances), nmod (5; 4% instances), obl (5; 4% instances), nmod:poss (4; 3% instances), acl:relcl (3; 2% instances), fixed (3; 2% instances), orphan (2; 2% instances), parataxis (2; 2% instances), xcomp (2; 2% instances), appos (1; 1% instances), aux (1; 1% instances), case (1; 1% instances), ccomp (1; 1% instances), compound (1; 1% instances), det (1; 1% instances), flat:name (1; 1% instances), mark (1; 1% instances), nummod (1; 1% instances), obj (1; 1% instances)

Children of ADJ nodes belong to 12 different parts of speech: PUNCT (23; 18% instances), NOUN (21; 17% instances), CCONJ (19; 15% instances), AUX (16; 13% instances), ADJ (13; 10% instances), ADV (10; 8% instances), PRON (8; 6% instances), VERB (6; 5% instances), PROPN (5; 4% instances), NUM (2; 2% instances), ADP (1; 1% instances), SCONJ (1; 1% instances)