home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: SYM

There are 491 SYM lemmas (2%), 491 SYM types (1%) and 1397 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: ,, -, =, %, (, ), /, !, mm, “

The 10 most frequent SYM types: ,, -, =, %, (, ), /, !, mm, “

The 10 most frequent ambiguous lemmas: , (PUNCT 10378, SYM 91), - (PUNCT 911, SYM 77), ( (PUNCT 2587, SYM 45), ) (PUNCT 2587, SYM 44), / (SYM 42, PUNCT 19), ! (PUNCT 48, SYM 36), (PUNCT 1380, SYM 26), # (SYM 25, X 1), : (PUNCT 1163, SYM 22), (PUNCT 725, SYM 17, X 2)

The 10 most frequent ambiguous types: , (PUNCT 10378, SYM 91), - (PUNCT 911, SYM 77), ( (PUNCT 2587, SYM 45), ) (PUNCT 2587, SYM 44), / (SYM 42, PUNCT 19), ! (PUNCT 48, SYM 36), (PUNCT 1380, SYM 26), # (SYM 25, X 1), : (PUNCT 1163, SYM 22), m (SYM 13, PRON 2)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (1) was observed with the lemma “!”: !.

The 2nd highest number of forms (1) was observed with the lemma “””: .

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM occurs with 1 features: ExtPos (1; 0% instances)

SYM occurs with 1 feature-value pairs: ExtPos=PROPN

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (1396 tokens). Examples: ,, -, =, %, (, ), /, !, mm, “

Relations

SYM nodes are attached to their parents using 19 different relations: flat (318; 23% instances), nmod (272; 19% instances), parataxis (196; 14% instances), obl (146; 10% instances), fixed (144; 10% instances), conj (98; 7% instances), root (69; 5% instances), appos (61; 4% instances), obj (28; 2% instances), nsubj (22; 2% instances), cc (10; 1% instances), orphan (6; 0% instances), acl (5; 0% instances), advcl (5; 0% instances), obl:arg (5; 0% instances), xcomp (5; 0% instances), nsubj:pass (4; 0% instances), ccomp (2; 0% instances), case (1; 0% instances)

Parents of SYM nodes belong to 13 different parts of speech: PROPN (315; 23% instances), NOUN (280; 20% instances), VERB (232; 17% instances), SYM (202; 14% instances), X (131; 9% instances), NUM (74; 5% instances), (69; 5% instances), ADJ (47; 3% instances), DET (25; 2% instances), ADV (11; 1% instances), PRON (7; 1% instances), CCONJ (3; 0% instances), ADP (1; 0% instances)

653 (47%) SYM nodes are leaves.

286 (20%) SYM nodes have one child.

314 (22%) SYM nodes have two children.

144 (10%) SYM nodes have three or more children.

The highest child degree of a SYM node is 57.

Children of SYM nodes are attached using 21 different relations: punct (532; 35% instances), case (221; 15% instances), nummod (172; 11% instances), fixed (112; 7% instances), nmod (105; 7% instances), flat (99; 6% instances), conj (81; 5% instances), cc (45; 3% instances), appos (32; 2% instances), parataxis (30; 2% instances), det (25; 2% instances), advmod (17; 1% instances), mark (13; 1% instances), cop (9; 1% instances), nsubj (9; 1% instances), amod (7; 0% instances), acl (5; 0% instances), acl:relcl (5; 0% instances), orphan (3; 0% instances), advcl (1; 0% instances), obl (1; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: PUNCT (532; 35% instances), NUM (228; 15% instances), ADP (219; 14% instances), SYM (202; 13% instances), NOUN (106; 7% instances), X (83; 5% instances), CCONJ (47; 3% instances), ADV (19; 1% instances), PROPN (19; 1% instances), DET (17; 1% instances), VERB (14; 1% instances), ADJ (12; 1% instances), SCONJ (11; 1% instances), AUX (9; 1% instances), PRON (6; 0% instances)