home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Livvi-KKPP: POS Tags: NOUN

There are 227 NOUN lemmas (39%), 305 NOUN types (39%) and 431 NOUN tokens (26%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: saari, arbaitus, vojennoi, virsta, päivy, tverinkarjalaine, festivuali, kieli, kodi, sarvi

The 10 most frequent NOUN types: vojennoit, saari, virstaa, bobuli-briha, briha, piduhuttu, saaraa, taatto, arbaituksii, festivualin

The 10 most frequent ambiguous lemmas: tverinkarjalaine (NOUN 2, ADJ 1), piduhus (NOUN 5, X 1), karjalaine (ADJ 4, NOUN 4), nuori (NOUN 4, ADJ 1), bohattu (NOUN 3, ADJ 2), Karjal (PROPN 6, NOUN 2), pučči (NOUN 2, X 1), piirde (ADJ 1, NOUN 1), semmoine (ADJ 3, NOUN 1)

The 10 most frequent ambiguous types: piduhuttu (NOUN 5, X 1), tverinkarjalazien (ADJ 1, NOUN 1), bohattu (NOUN 3, ADJ 2), puččii (NOUN 2, X 1), d’engaa (NOUN 1, X 1), piirdehii (ADJ 1, NOUN 1), semmostu (ADJ 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.343612 (the average of all parts of speech is 1.335034).

The 1st highest number of forms (7) was observed with the lemma “arbaitus”: Arbaituksen, arbaitukses, arbaitukset, arbaituksien, arbaituksii, arbaituksis, arbaitustu.

The 2nd highest number of forms (6) was observed with the lemma “kieli”: kieldy, kieleh, kielen, kieles, kieli, kielil.

The 3rd highest number of forms (6) was observed with the lemma “saari”: saaraa, saaran, saari, saaril, saarilloo, saaris.

NOUN occurs with 10 features: Case (429; 100% instances), Number (429; 100% instances), Mood (1; 0% instances), Number[psor] (1; 0% instances), Person (1; 0% instances), Person[psor] (1; 0% instances), Tense (1; 0% instances), Typo (1; 0% instances), VerbForm (1; 0% instances), Voice (1; 0% instances)

NOUN occurs with 25 feature-value pairs: Case=Abl, Case=Acc, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Ins, Case=Nom, Case=Par, Case=Ter, Case=Tra, Mood=Ind, Number=Plur, Number=Sing, Number[psor]=Sing, Person=3, Person[psor]=3, Tense=Pres, Typo=Yes, VerbForm=Fin, Voice=Act

NOUN occurs with 29 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (81 tokens). Examples: saari, briha, taatto, bobuli-briha, bohattu, häkki, kodi, paimoi, -liitto, akku

Relations

NOUN nodes are attached to their parents using 21 different relations: obl (120; 28% instances), nsubj (77; 18% instances), obj (76; 18% instances), nmod:poss (47; 11% instances), conj (35; 8% instances), nsubj:cop (20; 5% instances), root (12; 3% instances), orphan (9; 2% instances), nmod (8; 2% instances), parataxis (6; 1% instances), appos (4; 1% instances), advcl (3; 1% instances), acl (2; 0% instances), amod (2; 0% instances), compound (2; 0% instances), compound:nn (2; 0% instances), flat:name (2; 0% instances), acl:relcl (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances)

Parents of NOUN nodes belong to 9 different parts of speech: VERB (262; 61% instances), NOUN (119; 28% instances), ADJ (20; 5% instances), (12; 3% instances), PROPN (8; 2% instances), PRON (3; 1% instances), X (3; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances)

173 (40%) NOUN nodes are leaves.

174 (40%) NOUN nodes have one child.

43 (10%) NOUN nodes have two children.

41 (10%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 7.

Children of NOUN nodes are attached using 29 different relations: nmod:poss (83; 19% instances), amod (74; 17% instances), punct (67; 15% instances), conj (41; 9% instances), cc (21; 5% instances), cop (18; 4% instances), nummod (17; 4% instances), nsubj:cop (15; 3% instances), case (13; 3% instances), det (12; 3% instances), obl (11; 3% instances), parataxis (10; 2% instances), acl:relcl (7; 2% instances), obj (7; 2% instances), nmod (6; 1% instances), appos (5; 1% instances), mark (5; 1% instances), orphan (5; 1% instances), advmod (4; 1% instances), compound:nn (3; 1% instances), goeswith (3; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), cop:own (1; 0% instances), csubj:cop (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (119; 27% instances), ADJ (76; 17% instances), PUNCT (67; 15% instances), PRON (32; 7% instances), PROPN (31; 7% instances), VERB (26; 6% instances), CCONJ (21; 5% instances), AUX (19; 4% instances), NUM (17; 4% instances), ADP (11; 3% instances), ADV (6; 1% instances), SCONJ (5; 1% instances), X (4; 1% instances), INTJ (1; 0% instances)