Treebank Statistics: UD_Finnish-TDT: POS Tags: X
There are 227 X lemmas (1%), 237 X types (0%) and 299 X tokens (0%).
Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.
The 10 most frequent X lemmas: _, metal, common, death, a, be, fun, it, pic, DIY
The 10 most frequent X types: n, metal, common, death, a, be, fun, it, pic, DIY
The 10 most frequent ambiguous lemmas: metal (X 8, NOUN 1), a (NOUN 33, X 4, PROPN 1), I (ADJ 37, PROPN 6, X 2), and (PROPN 10, X 2, CCONJ 1), in (PROPN 6, X 2), stop (X 2, NOUN 1), you (X 2, PROPN 1), Diàoyútái (PROPN 1, X 1), Do (PROPN 5, X 1), Don’t (PROPN 1, X 1)
The 10 most frequent ambiguous types: a (NOUN 33, X 4, PROPN 1), I (ADJ 29, PROPN 4, X 2), and (PROPN 10, X 2, CCONJ 1), in (PROPN 5, X 2), you (X 2, PROPN 1), Do (PROPN 5, X 1), Don’t (PROPN 1, X 1), Finnish (PROPN 2, X 1), Life (PROPN 1, X 1), On (AUX 88, VERB 28, PROPN 5, X 1)
- a
- I
- and
- PROPN 10: Wilsonin sooloalbumi Love and Youth julkaistiin Ruotsissa 2005 .
- X 2: Illan pääesiintyjä oli jouluna paluunsa ilmoittanut death and roll -yhtye Deuteronomium .
- CCONJ 1: Ensimmäiset koodirivit kirjoitettiin 1983 Commodore 64 tietokoneelle suomalaisten veljesten Juha and Vesa Meskanen toimesta .
- in
- you
- Do
- Don’t
- Finnish
- Life
- On
Morphology
The form / lemma ratio of X is 1.044053 (the average of all parts of speech is 2.067894).
The 1st highest number of forms (10) was observed with the lemma “_”: 135/02, cen, hon, htiön, iin, lla, lle, n, ovat, ´.
The 2nd highest number of forms (1) was observed with the lemma “#hashtag”: #hashtag.
The 3rd highest number of forms (1) was observed with the lemma “Ağrı”: Ağrı.
X occurs with 1 features: Foreign (267; 89% instances)
X occurs with 1 feature-value pairs: Foreign=Yes
X occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes (267 tokens).
Examples: metal, common, death, a, be, fun, it, pic, DIY, I
Relations
X nodes are attached to their parents using 14 different relations: flat:foreign (113; 38% instances), compound (54; 18% instances), appos (53; 18% instances), goeswith (23; 8% instances), conj (12; 4% instances), flat:name (11; 4% instances), root (11; 4% instances), discourse (10; 3% instances), nmod (6; 2% instances), parataxis (2; 1% instances), cc (1; 0% instances), csubj:cop (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances)
Parents of X nodes belong to 7 different parts of speech: X (134; 45% instances), NOUN (97; 32% instances), PROPN (36; 12% instances), VERB (13; 4% instances), (11; 4% instances), ADJ (6; 2% instances), NUM (2; 1% instances)
191 (64%) X nodes are leaves.
28 (9%) X nodes have one child.
23 (8%) X nodes have two children.
57 (19%) X nodes have three or more children.
The highest child degree of a X node is 15.
Children of X nodes are attached using 19 different relations: punct (127; 39% instances), flat:foreign (113; 35% instances), conj (22; 7% instances), nmod (21; 7% instances), appos (10; 3% instances), cc (6; 2% instances), flat:name (6; 2% instances), amod (3; 1% instances), acl:relcl (2; 1% instances), orphan (2; 1% instances), parataxis (2; 1% instances), advmod (1; 0% instances), cc:preconj (1; 0% instances), compound (1; 0% instances), compound:nn (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), nmod:poss (1; 0% instances), nsubj (1; 0% instances)
Children of X nodes belong to 10 different parts of speech: X (134; 42% instances), PUNCT (127; 39% instances), NOUN (34; 11% instances), CCONJ (7; 2% instances), PROPN (6; 2% instances), VERB (5; 2% instances), SYM (3; 1% instances), ADJ (2; 1% instances), ADV (2; 1% instances), NUM (2; 1% instances)