home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-FTB: POS Tags: X

There are 270 X lemmas (1%), 269 X types (1%) and 304 X tokens (0%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 11 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: 70-, in, sosiaali-, the, ala-, kauppa-, keng-, maa-, 50-, aquis

The 10 most frequent X types: 70-, in, sosiaali-, the, Kauppa-, ala-, keng-, maa-, 50-, Lilla

The 10 most frequent ambiguous lemmas: the (X 4, PROPN 1), out (NOUN 2, X 2), home (NOUN 3, X 1), is (PROPN 2, X 1), made (NOUN 1, X 1), me (PRON 483, DET 74, X 1), new (PROPN 8, X 1), partners (PROPN 1, X 1), queen (PROPN 1, X 1), ride (PROPN 1, X 1)

The 10 most frequent ambiguous types: out (NOUN 1, X 1), New (PROPN 8, X 1), Ride (PROPN 1, X 1), m- (PRON 1, X 1), me (PRON 123, VERB 1, X 1), se- (PRON 1, X 1), termi (NOUN 2, X 1)

Morphology

The form / lemma ratio of X is 0.996296 (the average of all parts of speech is 2.048675).

The 1st highest number of forms (1) was observed with the lemma “10-”: 10-.

The 2nd highest number of forms (1) was observed with the lemma “100-”: 100-.

The 3rd highest number of forms (1) was observed with the lemma “150-”: 150-.

X occurs with 1 features: Foreign (129; 42% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (175 tokens). Examples: 70-, sosiaali-, Kauppa-, ala-, keng-, maa-, 50-, Vesi-, e-, kieli-

Relations

X nodes are attached to their parents using 22 different relations: nmod (67; 22% instances), amod (37; 12% instances), conj (35; 12% instances), root (34; 11% instances), nsubj (31; 10% instances), dep (24; 8% instances), obj (17; 6% instances), reparandum (17; 6% instances), flat (9; 3% instances), advmod (7; 2% instances), compound:nn (5; 2% instances), nsubj:cop (5; 2% instances), ccomp (4; 1% instances), advcl (3; 1% instances), case (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), cc (1; 0% instances), compound:prt (1; 0% instances), csubj:cop (1; 0% instances), nmod:gobj (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: VERB (83; 27% instances), X (83; 27% instances), NOUN (75; 25% instances), (34; 11% instances), PROPN (18; 6% instances), ADJ (7; 2% instances), PRON (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances)

111 (37%) X nodes are leaves.

142 (47%) X nodes have one child.

26 (9%) X nodes have two children.

25 (8%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 21 different relations: conj (144; 49% instances), punct (34; 12% instances), amod (27; 9% instances), nmod (19; 6% instances), advmod (10; 3% instances), dep (8; 3% instances), flat (8; 3% instances), cop (7; 2% instances), acl (6; 2% instances), nsubj:cop (6; 2% instances), cc (5; 2% instances), nsubj (5; 2% instances), aux (4; 1% instances), case (3; 1% instances), vocative (2; 1% instances), compound:nn (1; 0% instances), compound:prt (1; 0% instances), csubj:cop (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: NOUN (104; 35% instances), X (83; 28% instances), PUNCT (34; 12% instances), ADJ (18; 6% instances), VERB (17; 6% instances), PROPN (13; 4% instances), AUX (7; 2% instances), ADV (5; 2% instances), PRON (5; 2% instances), CCONJ (4; 1% instances), ADP (1; 0% instances), DET (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)