home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-TWITTIRO: POS Tags: X

There are 104 X lemmas (2%), 104 X types (2%) and 110 X tokens (0%). Out of 16 observed tags, the rank of X is: 8 in number of lemmas, 9 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: i, o, partes, super, zan, #labuonascuola, #tassadopotassare, 10cent, 13.mo, AAA

The 10 most frequent X types: e, i, o, partes, super, zan, #labuonascuola, #tassadopotassa, 10cent, 13.mo

The 10 most frequent ambiguous lemmas: i (X 3, DET 2), o (CCONJ 48, X 1), super (ADJ 3, X 2), #labuonascuola (SYM 347, NOUN 4, X 1), _ (PUNCT 3, X 1), andare (VERB 63, AUX 3, X 1), by (ADP 2, X 1), dare (VERB 33, X 1), design (NOUN 1, X 1), e (CCONJ 368, VERB 2, SYM 1, X 1)

The 10 most frequent ambiguous types: e (CCONJ 313, AUX 4, SYM 1, VERB 1, X 1), i (DET 298, X 1), o (CCONJ 45, X 1), super (ADJ 2, X 2), #labuonascuola (SYM 347, NOUN 4, X 1), No (INTJ 7, ADV 1, PROPN 1, X 1), by (ADP 2, X 1), design (NOUN 1, X 1), forma (NOUN 2, X 1), mal (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.274961).

The 1st highest number of forms (2) was observed with the lemma “i”: i, moltissimi.

The 2nd highest number of forms (1) was observed with the lemma “#labuonascuola”: #labuonascuola.

The 3rd highest number of forms (1) was observed with the lemma “#tassadopotassare”: #tassadopotassa.

X occurs with 3 features: Foreign (1; 1% instances), Gender (1; 1% instances), Number (1; 1% instances)

X occurs with 3 feature-value pairs: Foreign=Yes, Gender=Masc, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is _ (108 tokens). Examples: e, i, o, partes, super, zan, #labuonascuola, #tassadopotassa, 10cent, 13.mo

Relations

X nodes are attached to their parents using 22 different relations: flat:foreign (29; 26% instances), parataxis (17; 15% instances), nmod (8; 7% instances), dep (7; 6% instances), discourse (6; 5% instances), root (6; 5% instances), conj (5; 5% instances), nsubj (5; 5% instances), obj (5; 5% instances), flat (4; 4% instances), appos (3; 3% instances), flat:name (3; 3% instances), compound (2; 2% instances), obl (2; 2% instances), advcl (1; 1% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), goeswith (1; 1% instances), nsubj:pass (1; 1% instances), parataxis:hashtag (1; 1% instances), parataxis:obj (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: VERB (33; 30% instances), X (28; 25% instances), NOUN (23; 21% instances), PROPN (8; 7% instances), (6; 5% instances), SYM (6; 5% instances), ADJ (3; 3% instances), ADV (2; 2% instances), INTJ (1; 1% instances)

55 (50%) X nodes are leaves.

16 (15%) X nodes have one child.

16 (15%) X nodes have two children.

23 (21%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 28 different relations: punct (35; 23% instances), flat:foreign (27; 18% instances), det (12; 8% instances), parataxis (8; 5% instances), nmod (7; 5% instances), advmod (6; 4% instances), case (6; 4% instances), nsubj (6; 4% instances), cop (5; 3% instances), cc (4; 3% instances), conj (4; 3% instances), discourse (4; 3% instances), obl (4; 3% instances), obj (3; 2% instances), vocative:mention (3; 2% instances), amod (2; 1% instances), aux (2; 1% instances), dep (2; 1% instances), mark (2; 1% instances), nummod (2; 1% instances), advcl (1; 1% instances), det:poss (1; 1% instances), dislocated (1; 1% instances), expl (1; 1% instances), flat (1; 1% instances), flat:name (1; 1% instances), iobj (1; 1% instances), parataxis:hashtag (1; 1% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (35; 23% instances), X (28; 18% instances), DET (13; 9% instances), PRON (9; 6% instances), PROPN (9; 6% instances), SYM (9; 6% instances), VERB (9; 6% instances), ADP (7; 5% instances), AUX (7; 5% instances), NOUN (7; 5% instances), ADV (6; 4% instances), CCONJ (5; 3% instances), ADJ (3; 2% instances), INTJ (2; 1% instances), NUM (2; 1% instances), SCONJ (1; 1% instances)