Treebank Statistics: UD_Finnish-OOD: POS Tags: X
There are 64 X
lemmas (1%), 66 X
types (1%) and 90 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: LIST, All, Inclusive, _, author, baimbai, quote, time, #cmoref1, #nature
The 10 most frequent X
types: LIST, All, Inclusive, author, baimbai, quote, time, #cmoref1, #nature, Nix
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: on (AUX 277, VERB 1, X 1)
- on
Morphology
The form / lemma ratio of X
is 1.031250 (the average of all parts of speech is 1.565977).
The 1st highest number of forms (3) was observed with the lemma “_”: pap, stä, ’n.
The 2nd highest number of forms (1) was observed with the lemma “#CmoreF1”: #CmoreF1.
The 3rd highest number of forms (1) was observed with the lemma “#ESLOneCologne”: #ESLOneCologne.
X
occurs with 3 features: Foreign (70; 78% instances), Case (1; 1% instances), Number (1; 1% instances)
X
occurs with 3 feature-value pairs: Case=Nom
, Foreign=Yes
, Number=Sing
X
occurs with 3 feature combinations.
The most frequent feature combination is Foreign=Yes
(70 tokens).
Examples: LIST, All, Inclusive, author, baimbai, quote, time, #nature, Nix, pekato
Relations
X
nodes are attached to their parents using 11 different relations: flat:foreign (29; 32% instances), discourse (24; 27% instances), root (19; 21% instances), compound (5; 6% instances), goeswith (3; 3% instances), obl (3; 3% instances), nsubj (2; 2% instances), parataxis (2; 2% instances), appos (1; 1% instances), conj (1; 1% instances), dep (1; 1% instances)
Parents of X
nodes belong to 9 different parts of speech: X (30; 33% instances), (19; 21% instances), NOUN (15; 17% instances), VERB (13; 14% instances), PROPN (8; 9% instances), NUM (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), PRON (1; 1% instances)
69 (77%) X
nodes are leaves.
8 (9%) X
nodes have one child.
5 (6%) X
nodes have two children.
8 (9%) X
nodes have three or more children.
The highest child degree of a X
node is 12.
Children of X
nodes are attached using 13 different relations: flat:foreign (29; 35% instances), punct (28; 34% instances), obl (9; 11% instances), discourse (4; 5% instances), cop (2; 2% instances), nsubj:cop (2; 2% instances), nummod (2; 2% instances), advmod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), nmod (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)
Children of X
nodes belong to 11 different parts of speech: X (30; 37% instances), PUNCT (28; 34% instances), NOUN (6; 7% instances), NUM (6; 7% instances), SYM (4; 5% instances), AUX (2; 2% instances), INTJ (2; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)