home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-TWITTIRO: POS Tags: SYM

There are 603 SYM lemmas (12%), 603 SYM types (9%) and 2146 SYM tokens (7%). Out of 16 observed tags, the rank of SYM is: 4 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent SYM lemmas: @user, #labuonascuola, #monti, @user1, @user2, #renzi, #scuola, @user3, http://t.co/oDPUtx2DvV, #Grillo

The 10 most frequent SYM types: @user, #labuonascuola, #monti, @user1, @user2, #renzi, #scuola, @user3, http://t.co/oDPUtx2DvV, #Grillo

The 10 most frequent ambiguous lemmas: @user (SYM 513, INTJ 8, ADV 2, VERB 2, ADP 1, NOUN 1), #labuonascuola (SYM 347, NOUN 4, X 1), @user1 (SYM 76, NUM 1), @user2 (SYM 75, PROPN 1, VERB 1), #passodopopasso (SYM 6, NOUN 1), a (ADP 722, PROPN 4, INTJ 1, SYM 1), e (CCONJ 368, VERB 2, SYM 1, X 1), guli1979 (NUM 2, SYM 1), spot (NOUN 3, SYM 1)

The 10 most frequent ambiguous types: @user (SYM 513, INTJ 8, ADV 2, VERB 2, ADP 1, NOUN 1), #labuonascuola (SYM 347, NOUN 4, X 1), @user1 (SYM 76, NUM 1), @user2 (SYM 75, PROPN 1, VERB 1), #passodopopasso (SYM 6, NOUN 1), .@matteorenzi (NOUN 1, SYM 1), a (ADP 668, PROPN 4, SYM 1), e (CCONJ 313, AUX 4, SYM 1, VERB 1, X 1), guli1979 (NUM 2, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.274961).

The 1st highest number of forms (1) was observed with the lemma “#1000giorni”: #1000giorni.

The 2nd highest number of forms (1) was observed with the lemma “#10_ottobre”: #10_ottobre.

The 3rd highest number of forms (1) was observed with the lemma “#10o”: #10o.

SYM occurs with 2 features: Gender (1; 0% instances), Number (1; 0% instances)

SYM occurs with 2 feature-value pairs: Gender=Masc, Number=Sing

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (2145 tokens). Examples: @user, #labuonascuola, #monti, @user1, @user2, #renzi, #scuola, @user3, http://t.co/oDPUtx2DvV, #Grillo

Relations

SYM nodes are attached to their parents using 31 different relations: vocative:mention (662; 31% instances), parataxis:hashtag (540; 25% instances), nmod (232; 11% instances), dep (208; 10% instances), obl (127; 6% instances), nsubj (126; 6% instances), obj (58; 3% instances), discourse:emo (42; 2% instances), flat:name (28; 1% instances), conj (27; 1% instances), parataxis (23; 1% instances), root (23; 1% instances), flat (9; 0% instances), vocative (7; 0% instances), appos (5; 0% instances), compound (5; 0% instances), advmod (3; 0% instances), dislocated (3; 0% instances), ccomp (2; 0% instances), fixed (2; 0% instances), flat:foreign (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), iobj (1; 0% instances), list (1; 0% instances), parataxis:appos (1; 0% instances), parataxis:obj (1; 0% instances)

Parents of SYM nodes belong to 14 different parts of speech: VERB (1159; 54% instances), NOUN (598; 28% instances), ADJ (116; 5% instances), SYM (95; 4% instances), PROPN (63; 3% instances), PRON (51; 2% instances), (23; 1% instances), ADV (19; 1% instances), X (9; 0% instances), INTJ (4; 0% instances), ADP (3; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), NUM (2; 0% instances)

1328 (62%) SYM nodes are leaves.

370 (17%) SYM nodes have one child.

377 (18%) SYM nodes have two children.

71 (3%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 32 different relations: punct (697; 49% instances), case (286; 20% instances), det (116; 8% instances), nmod (67; 5% instances), advmod (27; 2% instances), conj (26; 2% instances), cop (19; 1% instances), amod (18; 1% instances), parataxis (18; 1% instances), cc (17; 1% instances), nsubj (15; 1% instances), parataxis:hashtag (14; 1% instances), nummod (13; 1% instances), dep (11; 1% instances), flat:name (11; 1% instances), discourse (7; 0% instances), flat:foreign (7; 0% instances), vocative:mention (7; 0% instances), acl:relcl (5; 0% instances), obl (5; 0% instances), acl (4; 0% instances), appos (4; 0% instances), mark (4; 0% instances), advcl (2; 0% instances), obj (2; 0% instances), parataxis:appos (2; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), det:poss (1; 0% instances), discourse:emo (1; 0% instances), vocative (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: PUNCT (697; 49% instances), ADP (282; 20% instances), DET (117; 8% instances), SYM (95; 7% instances), NOUN (56; 4% instances), ADV (27; 2% instances), ADJ (21; 1% instances), VERB (21; 1% instances), AUX (20; 1% instances), PROPN (19; 1% instances), CCONJ (17; 1% instances), NUM (13; 1% instances), PRON (9; 1% instances), INTJ (6; 0% instances), X (6; 0% instances), SCONJ (4; 0% instances)