Treebank Statistics: UD_Italian-TWITTIRO: POS Tags: X
There are 104 X
lemmas (2%), 104 X
types (2%) and 110 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 8 in number of lemmas, 9 in number of types and 16 in number of tokens.
The 10 most frequent X
lemmas: i, o, partes, super, zan, #labuonascuola, #tassadopotassare, 10cent, 13.mo, AAA
The 10 most frequent X
types: e, i, o, partes, super, zan, #labuonascuola, #tassadopotassa, 10cent, 13.mo
The 10 most frequent ambiguous lemmas: i (X 3, DET 2), o (CCONJ 48, X 1), super (ADJ 3, X 2), #labuonascuola (SYM 347, NOUN 4, X 1), _ (PUNCT 3, X 1), andare (VERB 63, AUX 3, X 1), by (ADP 2, X 1), dare (VERB 33, X 1), design (NOUN 1, X 1), e (CCONJ 368, VERB 2, SYM 1, X 1)
The 10 most frequent ambiguous types: e (CCONJ 313, AUX 4, SYM 1, VERB 1, X 1), i (DET 298, X 1), o (CCONJ 45, X 1), super (ADJ 2, X 2), #labuonascuola (SYM 347, NOUN 4, X 1), No (INTJ 7, ADV 1, PROPN 1, X 1), by (ADP 2, X 1), design (NOUN 1, X 1), forma (NOUN 2, X 1), mal (NOUN 1, X 1)
- e
- CCONJ 313: Vado in bagno . Evacuo il PDL e il Governo Monti che è in me .
- AUX 4: “ Vincere non e importante ma e l’ unica cosa che conta “ - Gianpiero Boniperti
- SYM 1: di i privati che finanziano le scuole statali non frega niente a nessuno / a ? tutti / e a ripetere a pappagallo merito , merito … #labuonascuola
- VERB 1: Ma tra tutti i cattolici di il Governo Monti non c’ e nessuno che possa consigliare il Papa di cacciare don Verze’ da la chiesa ?
- X 1: Governo Monti , Università , Natale … ij k mal e cap…
- i
- o
- super
- #labuonascuola
- No
- INTJ 7: NOI TIREREMO DIRITTO .. chi lo ha detto ? Mussolini ? No . Il governo Monti .
- ADV 1: Salvini a l’ Europarlamento con la maglietta “ No Euro “ . Glie lo scriverei in la prossima busta paga . [ @user ]
- PROPN 1: Unioni civili , Pd diviso tra il “ No “ e il “ Sia chiaro che ho un sacco di amici gay “ . [ guli1979 ]
- X 1: Roma sarà No fly zone per tutto il Giubileo . Se poi arriva Gesù si attacca a il cazzo . [ CONTINUA su https://t.co/oDPUtx2DvV ]
- by
- design
- forma
- mal
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.274961).
The 1st highest number of forms (2) was observed with the lemma “i”: i, moltissimi.
The 2nd highest number of forms (1) was observed with the lemma “#labuonascuola”: #labuonascuola.
The 3rd highest number of forms (1) was observed with the lemma “#tassadopotassare”: #tassadopotassa.
X
occurs with 3 features: Foreign (1; 1% instances), Gender (1; 1% instances), Number (1; 1% instances)
X
occurs with 3 feature-value pairs: Foreign=Yes
, Gender=Masc
, Number=Sing
X
occurs with 3 feature combinations.
The most frequent feature combination is _
(108 tokens).
Examples: e, i, o, partes, super, zan, #labuonascuola, #tassadopotassa, 10cent, 13.mo
Relations
X
nodes are attached to their parents using 22 different relations: flat:foreign (29; 26% instances), parataxis (17; 15% instances), nmod (8; 7% instances), dep (7; 6% instances), discourse (6; 5% instances), root (6; 5% instances), conj (5; 5% instances), nsubj (5; 5% instances), obj (5; 5% instances), flat (4; 4% instances), appos (3; 3% instances), flat:name (3; 3% instances), compound (2; 2% instances), obl (2; 2% instances), advcl (1; 1% instances), amod (1; 1% instances), cc (1; 1% instances), ccomp (1; 1% instances), goeswith (1; 1% instances), nsubj:pass (1; 1% instances), parataxis:hashtag (1; 1% instances), parataxis:obj (1; 1% instances)
Parents of X
nodes belong to 9 different parts of speech: VERB (33; 30% instances), X (28; 25% instances), NOUN (23; 21% instances), PROPN (8; 7% instances), (6; 5% instances), SYM (6; 5% instances), ADJ (3; 3% instances), ADV (2; 2% instances), INTJ (1; 1% instances)
55 (50%) X
nodes are leaves.
16 (15%) X
nodes have one child.
16 (15%) X
nodes have two children.
23 (21%) X
nodes have three or more children.
The highest child degree of a X
node is 8.
Children of X
nodes are attached using 28 different relations: punct (35; 23% instances), flat:foreign (27; 18% instances), det (12; 8% instances), parataxis (8; 5% instances), nmod (7; 5% instances), advmod (6; 4% instances), case (6; 4% instances), nsubj (6; 4% instances), cop (5; 3% instances), cc (4; 3% instances), conj (4; 3% instances), discourse (4; 3% instances), obl (4; 3% instances), obj (3; 2% instances), vocative:mention (3; 2% instances), amod (2; 1% instances), aux (2; 1% instances), dep (2; 1% instances), mark (2; 1% instances), nummod (2; 1% instances), advcl (1; 1% instances), det:poss (1; 1% instances), dislocated (1; 1% instances), expl (1; 1% instances), flat (1; 1% instances), flat:name (1; 1% instances), iobj (1; 1% instances), parataxis:hashtag (1; 1% instances)
Children of X
nodes belong to 16 different parts of speech: PUNCT (35; 23% instances), X (28; 18% instances), DET (13; 9% instances), PRON (9; 6% instances), PROPN (9; 6% instances), SYM (9; 6% instances), VERB (9; 6% instances), ADP (7; 5% instances), AUX (7; 5% instances), NOUN (7; 5% instances), ADV (6; 4% instances), CCONJ (5; 3% instances), ADJ (3; 2% instances), INTJ (2; 1% instances), NUM (2; 1% instances), SCONJ (1; 1% instances)