Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: SYM
There are 3902 SYM
lemmas (27%), 3902 SYM
types (20%) and 11745 SYM
tokens (9%).
Out of 16 observed tags, the rank of SYM
is: 1 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent SYM
lemmas: @user, #grillo, #monti, @user1, @user2, RT, #governo, #serviziopubblico, :), @user3
The 10 most frequent SYM
types: @user, #grillo, #monti, @user1, @user2, RT, #governo, #serviziopubblico, :), @user3
The 10 most frequent ambiguous lemmas: @user (SYM 1886, X 2), + (SYM 29, PUNCT 15, ADV 1), & (SYM 15, PROPN 1), x (SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), «/em> (PUNCT 3, SYM 3), � (SYM 3, X 2), > (PUNCT 12, SYM 2), C (PROPN 2, SYM 2), Università_it (SYM 2, PROPN 1)
The 10 most frequent ambiguous types: @user (SYM 1886, X 2), RT (SYM 338, NOUN 3, VERB 2), + (SYM 29, PUNCT 15, ADV 2), & (SYM 15, PROPN 1), x (ADP 60, SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), «/em> (PUNCT 3, SYM 3), � (SYM 3, X 2), > (PUNCT 12, SYM 2), A (ADP 216, PROPN 5, SYM 2, INTJ 1)
- @user
- RT
- +
- SYM 29: BORSA MILANO : POSITIVA ( + 0,5 % ) DOPO AVVIO GOVERNO MONTI
- PUNCT 15: @user :: nemmeno un governo Monti ‘ di scopo ‘ ? : legge elett . + rientro in UE + elez. primavera ?
- ADV 2: Governo #Monti : la fiducia + ampia di la storia di la Repubblica Italiana da il peggior Parlamento di la storia di la Repubblica Italiana . 2+2
- &
- x
- #D’Alema
- SYM 3: #D’Alema dice che #Grillo è un misto tra #Bossi e il #Gabibbo . Se ci va anche solo vicino a il giorno d’ oggi potrebbe voler dire un bel 20 %
- PROPN 1: E devo stare davvero male , perché sono d’ accordo anche con Massimo #D’Alema in il giudizio negativissimo su Beppe #Grillo http://t.co/Y3IWAMJQ
- «/em>
- �
- >
- A
- ADP 216: A le 4 devo cominciare a studiare T__T E devo anche fare la spesa D:
- PROPN 5: http://t.co/iOq8T2QS PORTA A PORTA , OSPITE MARIO MONTI
- SYM 2: Fiorentina che Passione • Re : [ Serie A ] Roma - Fiorentina : cioè ne prendiamo 2 ???? non facciamo proc… http://t.co/LF8x9kb2 #fiorentina
- INTJ 1: A Marò .. Stime di - 2,2 per colpa di la manovra di il governo Monti ?! E daje su , finite la di dire cazzate … #Ballarò
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.310684).
The 1st highest number of forms (1) was observed with the lemma “#”: #.
The 2nd highest number of forms (1) was observed with the lemma “#1”: #1.
The 3rd highest number of forms (1) was observed with the lemma “#10”: #10.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 37 different relations: vocative (2621; 22% instances), parataxis (2544; 22% instances), parataxis:hashtag (2259; 19% instances), nmod (1184; 10% instances), nsubj (819; 7% instances), discourse (723; 6% instances), obl (411; 3% instances), obj (235; 2% instances), flat:name (214; 2% instances), conj (180; 2% instances), root (129; 1% instances), dep (66; 1% instances), list (62; 1% instances), flat (52; 0% instances), appos (48; 0% instances), amod (32; 0% instances), nsubj:pass (23; 0% instances), dislocated (20; 0% instances), xcomp (15; 0% instances), compound (14; 0% instances), obl:agent (14; 0% instances), cc (11; 0% instances), ccomp (9; 0% instances), mark (8; 0% instances), parataxis:appos (8; 0% instances), advcl (7; 0% instances), fixed (7; 0% instances), advmod (5; 0% instances), flat:foreign (5; 0% instances), orphan (5; 0% instances), case (4; 0% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), parataxis:nsubj (2; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances), parataxis:obj (1; 0% instances)
Parents of SYM
nodes belong to 15 different parts of speech: VERB (5315; 45% instances), NOUN (3307; 28% instances), SYM (782; 7% instances), PROPN (733; 6% instances), ADJ (603; 5% instances), INTJ (333; 3% instances), PRON (238; 2% instances), ADV (174; 1% instances), (129; 1% instances), X (88; 1% instances), NUM (30; 0% instances), AUX (7; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)
8757 (75%) SYM
nodes are leaves.
1876 (16%) SYM
nodes have one child.
598 (5%) SYM
nodes have two children.
514 (4%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 38 different relations: case (1190; 23% instances), punct (1099; 21% instances), det (721; 14% instances), nmod (345; 7% instances), conj (216; 4% instances), list (197; 4% instances), nummod (189; 4% instances), parataxis (158; 3% instances), amod (131; 3% instances), flat:name (121; 2% instances), cc (117; 2% instances), advmod (98; 2% instances), obl (87; 2% instances), parataxis:hashtag (75; 1% instances), nsubj (62; 1% instances), cop (51; 1% instances), appos (47; 1% instances), discourse (47; 1% instances), vocative (46; 1% instances), acl:relcl (38; 1% instances), flat:foreign (20; 0% instances), acl (18; 0% instances), flat (18; 0% instances), mark (16; 0% instances), dep (15; 0% instances), compound (9; 0% instances), advcl (8; 0% instances), det:predet (7; 0% instances), obj (7; 0% instances), orphan (7; 0% instances), parataxis:appos (7; 0% instances), aux (5; 0% instances), det:poss (4; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), parataxis:insert (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: ADP (1182; 23% instances), PUNCT (1099; 21% instances), SYM (782; 15% instances), DET (733; 14% instances), PROPN (260; 5% instances), NOUN (253; 5% instances), NUM (196; 4% instances), ADJ (130; 3% instances), VERB (120; 2% instances), CCONJ (117; 2% instances), ADV (112; 2% instances), X (59; 1% instances), AUX (57; 1% instances), PRON (41; 1% instances), INTJ (26; 1% instances), SCONJ (15; 0% instances)