Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: SYM
There are 3915 SYM
lemmas (27%), 3915 SYM
types (21%) and 11623 SYM
tokens (9%).
Out of 16 observed tags, the rank of SYM
is: 1 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent SYM
lemmas: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita
The 10 most frequent SYM
types: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita
The 10 most frequent ambiguous lemmas: + (SYM 44, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), *** (SYM 3, X 1), � (SYM 3, X 2), 😍 (SYM 3, X 1), C (PROPN 2, SYM 2), Università_it (SYM 2, PROPN 1)
The 10 most frequent ambiguous types: + (SYM 44, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (ADP 60, SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), *** (SYM 3, X 1), � (SYM 3, X 2), 😍 (SYM 3, X 1), A (ADP 215, PROPN 5, SYM 2, INTJ 1), C (PROPN 2, SYM 2)
- +
- &
- >
- x
- #D’Alema
- SYM 3: #D’Alema dice che #Grillo è un misto tra #Bossi e il #Gabibbo . Se ci va anche solo vicino a il giorno d’ oggi potrebbe voler dire un bel 20 %
- PROPN 1: E devo stare davvero male , perché sono d’ accordo anche con Massimo #D’Alema in il giudizio negativissimo su Beppe #Grillo http://t.co/Y3IWAMJQ
- ***
- �
- 😍
- A
- ADP 215: A le 4 devo cominciare a studiare T__T E devo anche fare la spesa D:
- PROPN 5: http://t.co/iOq8T2QS PORTA A PORTA , OSPITE MARIO MONTI
- SYM 2: Fiorentina che Passione • Re : [ Serie A ] Roma - Fiorentina : cioè ne prendiamo 2 ???? non facciamo proc… http://t.co/LF8x9kb2 #fiorentina
- INTJ 1: A Marò .. Stime di - 2,2 per colpa di la manovra di il governo Monti ?! E daje su , finite la di dire cazzate … #Ballarò
- C
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.303101).
The 1st highest number of forms (1) was observed with the lemma “#”: #.
The 2nd highest number of forms (1) was observed with the lemma “#1”: #1.
The 3rd highest number of forms (1) was observed with the lemma “#10”: #10.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 37 different relations: vocative:mention (2576; 22% instances), dep (2438; 21% instances), parataxis:hashtag (2253; 19% instances), nmod (1173; 10% instances), nsubj (823; 7% instances), discourse:emo (725; 6% instances), obl (406; 3% instances), obj (233; 2% instances), flat:name (209; 2% instances), conj (178; 2% instances), root (130; 1% instances), list (64; 1% instances), appos (50; 0% instances), flat:foreign (46; 0% instances), vocative (42; 0% instances), flat (41; 0% instances), parataxis (37; 0% instances), amod (31; 0% instances), nsubj:pass (21; 0% instances), dislocated (20; 0% instances), xcomp (18; 0% instances), compound (16; 0% instances), obl:agent (14; 0% instances), cc (11; 0% instances), advcl (8; 0% instances), case (8; 0% instances), mark (8; 0% instances), parataxis:appos (8; 0% instances), ccomp (7; 0% instances), fixed (7; 0% instances), advmod (6; 0% instances), orphan (5; 0% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), parataxis:nsubj (2; 0% instances), parataxis:obj (2; 0% instances), csubj (1; 0% instances)
Parents of SYM
nodes belong to 15 different parts of speech: VERB (5225; 45% instances), NOUN (3258; 28% instances), PROPN (780; 7% instances), SYM (768; 7% instances), ADJ (577; 5% instances), INTJ (383; 3% instances), PRON (236; 2% instances), (130; 1% instances), ADV (115; 1% instances), X (102; 1% instances), NUM (34; 0% instances), AUX (6; 0% instances), DET (6; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)
8635 (74%) SYM
nodes are leaves.
1876 (16%) SYM
nodes have one child.
601 (5%) SYM
nodes have two children.
511 (4%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 40 different relations: case (1185; 23% instances), punct (1088; 21% instances), det (715; 14% instances), nmod (334; 6% instances), conj (216; 4% instances), list (204; 4% instances), nummod (190; 4% instances), amod (129; 2% instances), cc (118; 2% instances), flat:name (116; 2% instances), advmod (98; 2% instances), dep (94; 2% instances), parataxis (90; 2% instances), obl (86; 2% instances), parataxis:hashtag (69; 1% instances), flat:foreign (63; 1% instances), nsubj (63; 1% instances), cop (51; 1% instances), appos (46; 1% instances), acl:relcl (38; 1% instances), vocative:mention (35; 1% instances), discourse (32; 1% instances), acl (18; 0% instances), mark (17; 0% instances), discourse:emo (13; 0% instances), compound (9; 0% instances), flat (9; 0% instances), advcl (8; 0% instances), det:predet (7; 0% instances), parataxis:appos (7; 0% instances), vocative (7; 0% instances), obj (6; 0% instances), orphan (6; 0% instances), aux (4; 0% instances), det:poss (4; 0% instances), goeswith (2; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), parataxis:insert (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: ADP (1177; 23% instances), PUNCT (1088; 21% instances), SYM (768; 15% instances), DET (727; 14% instances), NOUN (273; 5% instances), PROPN (266; 5% instances), NUM (197; 4% instances), ADJ (130; 3% instances), VERB (119; 2% instances), CCONJ (118; 2% instances), ADV (112; 2% instances), X (68; 1% instances), AUX (56; 1% instances), PRON (40; 1% instances), INTJ (27; 1% instances), SCONJ (16; 0% instances)