home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-OOD: POS Tags: X

There are 64 X lemmas (1%), 66 X types (1%) and 90 X tokens (0%). Out of 15 observed tags, the rank of X is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: LIST, All, Inclusive, _, author, baimbai, quote, time, #cmoref1, #nature

The 10 most frequent X types: LIST, All, Inclusive, author, baimbai, quote, time, #cmoref1, #nature, Nix

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: on (AUX 277, VERB 1, X 1)

Morphology

The form / lemma ratio of X is 1.031250 (the average of all parts of speech is 1.565977).

The 1st highest number of forms (3) was observed with the lemma “_”: pap, stä, ’n.

The 2nd highest number of forms (1) was observed with the lemma “#CmoreF1”: #CmoreF1.

The 3rd highest number of forms (1) was observed with the lemma “#ESLOneCologne”: #ESLOneCologne.

X occurs with 3 features: Foreign (70; 78% instances), Case (1; 1% instances), Number (1; 1% instances)

X occurs with 3 feature-value pairs: Case=Nom, Foreign=Yes, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (70 tokens). Examples: LIST, All, Inclusive, author, baimbai, quote, time, #nature, Nix, pekato

Relations

X nodes are attached to their parents using 11 different relations: flat:foreign (29; 32% instances), discourse (24; 27% instances), root (19; 21% instances), compound (5; 6% instances), goeswith (3; 3% instances), obl (3; 3% instances), nsubj (2; 2% instances), parataxis (2; 2% instances), appos (1; 1% instances), conj (1; 1% instances), dep (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: X (30; 33% instances), (19; 21% instances), NOUN (15; 17% instances), VERB (13; 14% instances), PROPN (8; 9% instances), NUM (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), PRON (1; 1% instances)

69 (77%) X nodes are leaves.

8 (9%) X nodes have one child.

5 (6%) X nodes have two children.

8 (9%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 13 different relations: flat:foreign (29; 35% instances), punct (28; 34% instances), obl (9; 11% instances), discourse (4; 5% instances), cop (2; 2% instances), nsubj:cop (2; 2% instances), nummod (2; 2% instances), advmod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), nmod (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: X (30; 37% instances), PUNCT (28; 34% instances), NOUN (6; 7% instances), NUM (6; 7% instances), SYM (4; 5% instances), AUX (2; 2% instances), INTJ (2; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)