home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: SYM

There are 3915 SYM lemmas (27%), 3915 SYM types (21%) and 11623 SYM tokens (9%). Out of 16 observed tags, the rank of SYM is: 1 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent SYM lemmas: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita

The 10 most frequent SYM types: @user, #grillo, #monti, @user1, @user2, #governo, #serviziopubblico, :), @user3, #piazzapulita

The 10 most frequent ambiguous lemmas: + (SYM 44, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), *** (SYM 3, X 1), (SYM 3, X 2), 😍 (SYM 3, X 1), C (PROPN 2, SYM 2), Università_it (SYM 2, PROPN 1)

The 10 most frequent ambiguous types: + (SYM 44, ADV 1), & (SYM 15, PROPN 1), > (SYM 13, PROPN 1), x (ADP 60, SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), *** (SYM 3, X 1), (SYM 3, X 2), 😍 (SYM 3, X 1), A (ADP 215, PROPN 5, SYM 2, INTJ 1), C (PROPN 2, SYM 2)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.303101).

The 1st highest number of forms (1) was observed with the lemma “#”: #.

The 2nd highest number of forms (1) was observed with the lemma “#1”: #1.

The 3rd highest number of forms (1) was observed with the lemma “#10”: #10.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 37 different relations: vocative:mention (2576; 22% instances), dep (2438; 21% instances), parataxis:hashtag (2253; 19% instances), nmod (1173; 10% instances), nsubj (823; 7% instances), discourse:emo (725; 6% instances), obl (406; 3% instances), obj (233; 2% instances), flat:name (209; 2% instances), conj (178; 2% instances), root (130; 1% instances), list (64; 1% instances), appos (50; 0% instances), flat:foreign (46; 0% instances), vocative (42; 0% instances), flat (41; 0% instances), parataxis (37; 0% instances), amod (31; 0% instances), nsubj:pass (21; 0% instances), dislocated (20; 0% instances), xcomp (18; 0% instances), compound (16; 0% instances), obl:agent (14; 0% instances), cc (11; 0% instances), advcl (8; 0% instances), case (8; 0% instances), mark (8; 0% instances), parataxis:appos (8; 0% instances), ccomp (7; 0% instances), fixed (7; 0% instances), advmod (6; 0% instances), orphan (5; 0% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), parataxis:nsubj (2; 0% instances), parataxis:obj (2; 0% instances), csubj (1; 0% instances)

Parents of SYM nodes belong to 15 different parts of speech: VERB (5225; 45% instances), NOUN (3258; 28% instances), PROPN (780; 7% instances), SYM (768; 7% instances), ADJ (577; 5% instances), INTJ (383; 3% instances), PRON (236; 2% instances), (130; 1% instances), ADV (115; 1% instances), X (102; 1% instances), NUM (34; 0% instances), AUX (6; 0% instances), DET (6; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)

8635 (74%) SYM nodes are leaves.

1876 (16%) SYM nodes have one child.

601 (5%) SYM nodes have two children.

511 (4%) SYM nodes have three or more children.

The highest child degree of a SYM node is 13.

Children of SYM nodes are attached using 40 different relations: case (1185; 23% instances), punct (1088; 21% instances), det (715; 14% instances), nmod (334; 6% instances), conj (216; 4% instances), list (204; 4% instances), nummod (190; 4% instances), amod (129; 2% instances), cc (118; 2% instances), flat:name (116; 2% instances), advmod (98; 2% instances), dep (94; 2% instances), parataxis (90; 2% instances), obl (86; 2% instances), parataxis:hashtag (69; 1% instances), flat:foreign (63; 1% instances), nsubj (63; 1% instances), cop (51; 1% instances), appos (46; 1% instances), acl:relcl (38; 1% instances), vocative:mention (35; 1% instances), discourse (32; 1% instances), acl (18; 0% instances), mark (17; 0% instances), discourse:emo (13; 0% instances), compound (9; 0% instances), flat (9; 0% instances), advcl (8; 0% instances), det:predet (7; 0% instances), parataxis:appos (7; 0% instances), vocative (7; 0% instances), obj (6; 0% instances), orphan (6; 0% instances), aux (4; 0% instances), det:poss (4; 0% instances), goeswith (2; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), parataxis:insert (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: ADP (1177; 23% instances), PUNCT (1088; 21% instances), SYM (768; 15% instances), DET (727; 14% instances), NOUN (273; 5% instances), PROPN (266; 5% instances), NUM (197; 4% instances), ADJ (130; 3% instances), VERB (119; 2% instances), CCONJ (118; 2% instances), ADV (112; 2% instances), X (68; 1% instances), AUX (56; 1% instances), PRON (40; 1% instances), INTJ (27; 1% instances), SCONJ (16; 0% instances)