home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: SYM

There are 491 SYM lemmas (2%), 491 SYM types (1%) and 1395 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: ,, -, =, %, (, ), /, !, mm, “

The 10 most frequent SYM types: ,, -, =, %, (, ), /, !, mm, “

The 10 most frequent ambiguous lemmas: , (PUNCT 10378, SYM 91), - (PUNCT 913, SYM 75), ( (PUNCT 2587, SYM 45), ) (PUNCT 2587, SYM 44), / (SYM 42, PUNCT 19), ! (PUNCT 48, SYM 36), (PUNCT 1380, SYM 26), # (SYM 25, X 1), : (PUNCT 1163, SYM 22), (PUNCT 725, SYM 17, X 2)

The 10 most frequent ambiguous types: , (PUNCT 10378, SYM 91), - (PUNCT 913, SYM 75), ( (PUNCT 2587, SYM 45), ) (PUNCT 2587, SYM 44), / (SYM 42, PUNCT 19), ! (PUNCT 48, SYM 36), (PUNCT 1380, SYM 26), # (SYM 25, X 1), : (PUNCT 1163, SYM 22), m (SYM 13, PRON 2)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (1) was observed with the lemma “!”: !.

The 2nd highest number of forms (1) was observed with the lemma “””: .

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM occurs with 1 features: ExtPos (3; 0% instances)

SYM occurs with 1 feature-value pairs: ExtPos=PROPN

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (1392 tokens). Examples: ,, -, =, %, (, ), /, !, mm, “

Relations

SYM nodes are attached to their parents using 20 different relations: flat (308; 22% instances), nmod (272; 19% instances), parataxis (195; 14% instances), fixed (156; 11% instances), obl (146; 10% instances), conj (97; 7% instances), root (69; 5% instances), appos (61; 4% instances), obj (27; 2% instances), nsubj (22; 2% instances), cc (7; 1% instances), advcl (6; 0% instances), orphan (6; 0% instances), acl (5; 0% instances), obl:arg (5; 0% instances), xcomp (5; 0% instances), nsubj:pass (4; 0% instances), ccomp (2; 0% instances), amod (1; 0% instances), case (1; 0% instances)

Parents of SYM nodes belong to 13 different parts of speech: PROPN (314; 23% instances), NOUN (281; 20% instances), VERB (232; 17% instances), SYM (203; 15% instances), X (131; 9% instances), NUM (73; 5% instances), (69; 5% instances), ADJ (46; 3% instances), DET (25; 2% instances), ADV (10; 1% instances), PRON (7; 1% instances), CCONJ (3; 0% instances), ADP (1; 0% instances)

651 (47%) SYM nodes are leaves.

285 (20%) SYM nodes have one child.

314 (23%) SYM nodes have two children.

145 (10%) SYM nodes have three or more children.

The highest child degree of a SYM node is 57.

Children of SYM nodes are attached using 20 different relations: punct (533; 35% instances), case (221; 14% instances), nummod (172; 11% instances), fixed (126; 8% instances), nmod (105; 7% instances), flat (86; 6% instances), conj (81; 5% instances), cc (45; 3% instances), appos (32; 2% instances), parataxis (30; 2% instances), det (25; 2% instances), amod (24; 2% instances), mark (14; 1% instances), cop (9; 1% instances), nsubj (9; 1% instances), acl (5; 0% instances), acl:relcl (5; 0% instances), orphan (3; 0% instances), advcl (1; 0% instances), obl (1; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: PUNCT (533; 35% instances), NUM (228; 15% instances), ADP (219; 14% instances), SYM (203; 13% instances), NOUN (106; 7% instances), X (83; 5% instances), CCONJ (47; 3% instances), ADV (19; 1% instances), PROPN (19; 1% instances), DET (17; 1% instances), VERB (14; 1% instances), ADJ (12; 1% instances), SCONJ (12; 1% instances), AUX (9; 1% instances), PRON (6; 0% instances)