Treebank Statistics: UD_Finnish-OOD: POS Tags: X
There are 63 X
lemmas (1%), 65 X
types (1%) and 89 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: LIST, All, Inclusive, _, author, baimbai, quote, time, #cmoref1, #nature
The 10 most frequent X
types: LIST, All, Inclusive, author, baimbai, quote, time, #cmoref1, #nature, Nix
The 10 most frequent ambiguous lemmas: rausim (VERB 1, X 1)
The 10 most frequent ambiguous types: on (AUX 277, VERB 1, X 1)
- on
Morphology
The form / lemma ratio of X
is 1.031746 (the average of all parts of speech is 1.566190).
The 1st highest number of forms (3) was observed with the lemma “_”: pap, stä, ’n.
The 2nd highest number of forms (1) was observed with the lemma “#CmoreF1”: #CmoreF1.
The 3rd highest number of forms (1) was observed with the lemma “#ESLOneCologne”: #ESLOneCologne.
X
occurs with 3 features: Foreign (67; 75% instances), Case (4; 4% instances), Number (4; 4% instances)
X
occurs with 3 feature-value pairs: Case=Nom
, Foreign=Yes
, Number=Sing
X
occurs with 3 feature combinations.
The most frequent feature combination is Foreign=Yes
(67 tokens).
Examples: LIST, All, Inclusive, author, baimbai, quote, time, #nature, Nix, pekato
Relations
X
nodes are attached to their parents using 12 different relations: flat:foreign (28; 31% instances), discourse (24; 27% instances), root (19; 21% instances), compound:nn (5; 6% instances), goeswith (3; 3% instances), nsubj (2; 2% instances), obl (2; 2% instances), parataxis (2; 2% instances), appos (1; 1% instances), conj (1; 1% instances), dep (1; 1% instances), flat:name (1; 1% instances)
Parents of X
nodes belong to 9 different parts of speech: X (29; 33% instances), (19; 21% instances), NOUN (15; 17% instances), VERB (13; 15% instances), PROPN (8; 9% instances), NUM (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), PRON (1; 1% instances)
68 (76%) X
nodes are leaves.
9 (10%) X
nodes have one child.
4 (4%) X
nodes have two children.
8 (9%) X
nodes have three or more children.
The highest child degree of a X
node is 12.
Children of X
nodes are attached using 12 different relations: flat:foreign (33; 41% instances), punct (28; 35% instances), obl (6; 7% instances), discourse (4; 5% instances), cop (2; 2% instances), nsubj:cop (2; 2% instances), advmod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), flat:name (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)
Children of X
nodes belong to 11 different parts of speech: X (29; 36% instances), PUNCT (28; 35% instances), NOUN (6; 7% instances), NUM (6; 7% instances), SYM (4; 5% instances), AUX (2; 2% instances), INTJ (2; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)