home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: SYM

There are 3903 SYM lemmas (27%), 3903 SYM types (21%) and 11749 SYM tokens (9%). Out of 16 observed tags, the rank of SYM is: 1 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent SYM lemmas: @user, #grillo, #monti, @user1, @user2, RT, #governo, #serviziopubblico, :), @user3

The 10 most frequent SYM types: @user, #grillo, #monti, @user1, @user2, RT, #governo, #serviziopubblico, :), @user3

The 10 most frequent ambiguous lemmas: + (SYM 29, PUNCT 15, ADV 1), & (SYM 15, PROPN 1), x (SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), «/em> (PUNCT 3, SYM 3), (SYM 3, X 2), > (PUNCT 12, SYM 2), C (PROPN 2, SYM 2), Università_it (SYM 2, PROPN 1), a (ADP 2901, PROPN 16, X 10, INTJ 3, SYM 1)

The 10 most frequent ambiguous types: RT (SYM 338, NOUN 3, VERB 2), + (SYM 29, PUNCT 15, ADV 2), & (SYM 15, PROPN 1), x (ADP 60, SYM 4, X 3), #D’Alema (SYM 3, PROPN 1), «/em> (PUNCT 3, SYM 3), (SYM 3, X 2), > (PUNCT 12, SYM 2), A (ADP 216, PROPN 5, SYM 2, INTJ 1), C (PROPN 2, SYM 2, PRON 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.310882).

The 1st highest number of forms (1) was observed with the lemma “#”: #.

The 2nd highest number of forms (1) was observed with the lemma “#1”: #1.

The 3rd highest number of forms (1) was observed with the lemma “#10”: #10.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 37 different relations: vocative (2621; 22% instances), parataxis (2544; 22% instances), parataxis:hashtag (2252; 19% instances), nmod (1173; 10% instances), nsubj (819; 7% instances), discourse (721; 6% instances), obl (411; 3% instances), obj (235; 2% instances), flat:name (210; 2% instances), conj (180; 2% instances), root (129; 1% instances), dep (65; 1% instances), list (62; 1% instances), appos (48; 0% instances), flat:foreign (43; 0% instances), flat (42; 0% instances), amod (31; 0% instances), nsubj:pass (23; 0% instances), dislocated (20; 0% instances), compound (16; 0% instances), xcomp (15; 0% instances), obl:agent (14; 0% instances), cc (11; 0% instances), ccomp (9; 0% instances), mark (8; 0% instances), parataxis:appos (8; 0% instances), advcl (7; 0% instances), fixed (7; 0% instances), advmod (5; 0% instances), orphan (5; 0% instances), case (4; 0% instances), acl (3; 0% instances), acl:relcl (3; 0% instances), parataxis:nsubj (2; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances), parataxis:obj (1; 0% instances)

Parents of SYM nodes belong to 15 different parts of speech: VERB (5314; 45% instances), NOUN (3309; 28% instances), SYM (783; 7% instances), PROPN (733; 6% instances), ADJ (603; 5% instances), INTJ (333; 3% instances), PRON (238; 2% instances), ADV (174; 1% instances), (129; 1% instances), X (89; 1% instances), NUM (30; 0% instances), AUX (7; 0% instances), DET (4; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)

8761 (75%) SYM nodes are leaves.

1877 (16%) SYM nodes have one child.

598 (5%) SYM nodes have two children.

513 (4%) SYM nodes have three or more children.

The highest child degree of a SYM node is 13.

Children of SYM nodes are attached using 38 different relations: case (1189; 23% instances), punct (1099; 21% instances), det (720; 14% instances), nmod (336; 6% instances), conj (216; 4% instances), list (197; 4% instances), nummod (189; 4% instances), parataxis (157; 3% instances), amod (129; 2% instances), cc (117; 2% instances), flat:name (117; 2% instances), advmod (98; 2% instances), obl (88; 2% instances), parataxis:hashtag (69; 1% instances), nsubj (62; 1% instances), flat:foreign (56; 1% instances), cop (51; 1% instances), appos (47; 1% instances), vocative (46; 1% instances), discourse (45; 1% instances), acl:relcl (38; 1% instances), acl (18; 0% instances), mark (16; 0% instances), dep (13; 0% instances), compound (9; 0% instances), flat (9; 0% instances), advcl (8; 0% instances), det:predet (7; 0% instances), obj (7; 0% instances), orphan (7; 0% instances), parataxis:appos (7; 0% instances), aux (5; 0% instances), det:poss (4; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), parataxis:insert (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: ADP (1181; 23% instances), PUNCT (1099; 21% instances), SYM (783; 15% instances), DET (732; 14% instances), PROPN (261; 5% instances), NOUN (253; 5% instances), NUM (196; 4% instances), ADJ (130; 3% instances), VERB (120; 2% instances), CCONJ (117; 2% instances), ADV (112; 2% instances), X (59; 1% instances), AUX (57; 1% instances), PRON (41; 1% instances), INTJ (26; 1% instances), SCONJ (15; 0% instances)