home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maltese: POS Tags: X

There are 1 X lemmas (6%), 8 X types (1%) and 11 X tokens (0%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 12 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: disease motor neuron @ AĊĊESS frhospice mushrooms onvol

The 10 most frequent ambiguous lemmas: _ (NOUN 549, ADP 355, DET 266, PUNCT 262, VERB 239, ADJ 171, CCONJ 106, PROPN 103, SCONJ 93, ADV 74, NUM 42, PART 25, PRON 19, AUX 16, X 11, INTJ 1)

The 10 most frequent ambiguous types: @ (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 8.000000 (the average of all parts of speech is 58.062500).

The 1st highest number of forms (8) was observed with the lemma “_”: @, AĊĊESS, disease, frhospice, motor, mushrooms, neuron, onvol.

X occurs with 1 features: Foreign (9; 82% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (9 tokens). Examples: disease motor neuron @ frhospice onvol

Relations

X nodes are attached to their parents using 5 different relations: flat (4; 36% instances), nmod (3; 27% instances), compound (2; 18% instances), appos (1; 9% instances), obl (1; 9% instances)

Parents of X nodes belong to 3 different parts of speech: X (6; 55% instances), NOUN (4; 36% instances), VERB (1; 9% instances)

7 (64%) X nodes are leaves.

0 (0%) X nodes have one child.

1 (9%) X nodes have two children.

3 (27%) X nodes have three or more children.

The highest child degree of a X node is 5.

Children of X nodes are attached using 4 different relations: flat (4; 31% instances), case (3; 23% instances), compound (3; 23% instances), punct (3; 23% instances)

Children of X nodes belong to 4 different parts of speech: X (6; 46% instances), ADP (3; 23% instances), PUNCT (3; 23% instances), NOUN (1; 8% instances)