Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: SYM
There are 3916 SYM
lemmas (27%), 3916 SYM
types (21%) and 11625 SYM
tokens (9%).
Out of 16 observed tags, the rank of SYM
is: 1 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent SYM
lemmas: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita
The 10 most frequent SYM
types: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita
The 10 most frequent ambiguous lemmas: + (SYM 45, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), � (SYM 3, X 2), C (PROPN 2, SYM 2), Università_it (SYM 2, PROPN 1), a (ADP 2901, PROPN 17, X 9, INTJ 3, SYM 1), x9 (SYM 2, NUM 1)
The 10 most frequent ambiguous types: + (SYM 45, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (ADP 60, SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), � (SYM 3, X 2), A (ADP 216, PROPN 5, SYM 2, INTJ 1), C (PROPN 2, SYM 2, PRON 1), Università_it (SYM 2, PROPN 1), x9 (SYM 2, NUM 1)
- +
- &
- >
- x
- #D’Alema
- SYM 3: #D’Alema dice che #Grillo è un misto tra #Bossi e il #Gabibbo . Se ci va anche solo vicino a il giorno d’ oggi potrebbe voler dire un bel 20 %
- PROPN 1: E devo stare davvero male , perché sono d’ accordo anche con Massimo #D’Alema in il giudizio negativissimo su Beppe #Grillo http://t.co/Y3IWAMJQ
- �
- A
- ADP 216: A le 4 devo cominciare a studiare T__T E devo anche fare la spesa D:
- PROPN 5: http://t.co/iOq8T2QS PORTA A PORTA , OSPITE MARIO MONTI
- SYM 2: Fiorentina che Passione • Re : [ Serie A ] Roma - Fiorentina : cioè ne prendiamo 2 ???? non facciamo proc… http://t.co/LF8x9kb2 #fiorentina
- INTJ 1: A Marò .. Stime di - 2,2 per colpa di la manovra di il governo Monti ?! E daje su , finite la di dire cazzate … #Ballarò
- C
- PROPN 2: RT @user : A soli 0 euro a il mese Twitter ti offre : - Serie A , B , C - Champions League - Meteo - Programmi televisivi - Commenti in dirett…
- SYM 2: @user Anche in le 4000 #Parafarmacie E il delitti #farmaci C senza ricetta ? Intanto #federfarma fa’ lobby su #balduzzi . Man !
- PRON 1: @user va bene di carta di giornale ?? C e crisi
- Università_it
- x9
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.304759).
The 1st highest number of forms (2) was observed with the lemma “*”: , **.
The 2nd highest number of forms (1) was observed with the lemma “#”: #.
The 3rd highest number of forms (1) was observed with the lemma “#1”: #1.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 37 different relations: vocative:mention (2578; 22% instances), dep (2442; 21% instances), parataxis:hashtag (2248; 19% instances), nmod (1174; 10% instances), nsubj (822; 7% instances), discourse:emo (727; 6% instances), obl (406; 3% instances), obj (234; 2% instances), flat:name (209; 2% instances), conj (178; 2% instances), root (129; 1% instances), list (62; 1% instances), appos (50; 0% instances), flat:foreign (45; 0% instances), vocative (43; 0% instances), flat (42; 0% instances), parataxis (37; 0% instances), amod (31; 0% instances), nsubj:pass (22; 0% instances), dislocated (20; 0% instances), xcomp (17; 0% instances), compound (16; 0% instances), obl:agent (14; 0% instances), cc (11; 0% instances), advcl (8; 0% instances), mark (8; 0% instances), parataxis:appos (8; 0% instances), ccomp (7; 0% instances), fixed (7; 0% instances), advmod (6; 0% instances), case (6; 0% instances), orphan (5; 0% instances), parataxis:obj (4; 0% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), parataxis:nsubj (2; 0% instances), csubj (1; 0% instances)
Parents of SYM
nodes belong to 15 different parts of speech: VERB (5224; 45% instances), NOUN (3270; 28% instances), SYM (772; 7% instances), PROPN (764; 7% instances), ADJ (579; 5% instances), INTJ (333; 3% instances), PRON (238; 2% instances), ADV (173; 1% instances), (129; 1% instances), X (94; 1% instances), NUM (34; 0% instances), AUX (6; 0% instances), DET (6; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)
8633 (74%) SYM
nodes are leaves.
1878 (16%) SYM
nodes have one child.
603 (5%) SYM
nodes have two children.
511 (4%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 40 different relations: case (1187; 23% instances), punct (1089; 21% instances), det (718; 14% instances), nmod (336; 6% instances), conj (214; 4% instances), list (197; 4% instances), nummod (187; 4% instances), amod (130; 3% instances), cc (117; 2% instances), flat:name (116; 2% instances), advmod (99; 2% instances), dep (95; 2% instances), parataxis (90; 2% instances), obl (86; 2% instances), parataxis:hashtag (69; 1% instances), nsubj (63; 1% instances), flat:foreign (61; 1% instances), cop (51; 1% instances), appos (47; 1% instances), acl:relcl (38; 1% instances), vocative:mention (38; 1% instances), discourse (31; 1% instances), acl (18; 0% instances), mark (17; 0% instances), discourse:emo (13; 0% instances), compound (9; 0% instances), flat (9; 0% instances), advcl (8; 0% instances), det:predet (7; 0% instances), orphan (7; 0% instances), parataxis:appos (7; 0% instances), vocative (7; 0% instances), obj (6; 0% instances), aux (4; 0% instances), det:poss (4; 0% instances), goeswith (2; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), parataxis:insert (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: ADP (1179; 23% instances), PUNCT (1089; 21% instances), SYM (772; 15% instances), DET (730; 14% instances), NOUN (274; 5% instances), PROPN (262; 5% instances), NUM (194; 4% instances), ADJ (131; 3% instances), VERB (118; 2% instances), CCONJ (117; 2% instances), ADV (113; 2% instances), X (64; 1% instances), AUX (56; 1% instances), PRON (41; 1% instances), INTJ (26; 1% instances), SCONJ (16; 0% instances)