Treebank Statistics: UD_Finnish: POS Tags: SYM
There are 200 SYM
lemmas (1%), 202 SYM
types (0%) and 479 SYM
tokens (0%).
Out of 15 observed tags, the rank of SYM
is: 8 in number of lemmas, 10 in number of types and 13 in number of tokens.
The 10 most frequent SYM
lemmas: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
The 10 most frequent SYM
types: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
The 10 most frequent ambiguous lemmas: :) (SYM 64, PUNCT 1), % (SYM 42, NOUN 9), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 22, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), K (PROPN 1, SYM 1), V (ADJ 10, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
The 10 most frequent ambiguous types: :) (SYM 64, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), A (NOUN 10, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), V (ADJ 7, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
- :)
- &
- +
- SYM 20: - Ruisleipä + oivariini + oltermanni maistuu vaan niin hyvältä .
- PROPN 2: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- A
- B
- V
- ADJ 7: Hänen isänsä oli kuningas Mithridates V Euergetes .
- NOUN 1: Siitä kehittyivät kreikkalaisen kirjaimiston digamma ja ypsilon , myöhemmin kehittyivät latinalaisen kirjaimiston F , V ja Y sekä edelleen kehittyivät U ja W .
- SYM 1: Gliese 581 eli HO Librae on Vaa’an tähdistössä sijaitseva punainen kääpiötähti , jonka spektriluokka on M2,5 V .
- ×
- PROPN 4: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- SYM 1: Tämä sopii hyvin yhteen sen kanssa , että tähti on vanha , ikä 7 - 11 × 109 vuotta .
Morphology
The form / lemma ratio of SYM
is 1.010000 (the average of all parts of speech is 2.060960).
The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.
The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.
The 3rd highest number of forms (1) was observed with the lemma “#”: #.
SYM
occurs with 1 features: Case (2; 0% instances)
SYM
occurs with 2 feature-value pairs: Case=Ine
, Case=Par
SYM
occurs with 3 feature combinations.
The most frequent feature combination is _
(477 tokens).
Examples: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
Relations
SYM
nodes are attached to their parents using 24 different relations: discourse (121; 25% instances), flat:name (95; 20% instances), nmod (50; 10% instances), punct (36; 8% instances), obj (29; 6% instances), appos (27; 6% instances), nsubj (20; 4% instances), obl (17; 4% instances), conj (16; 3% instances), root (11; 2% instances), compound:nn (10; 2% instances), cc (9; 2% instances), nsubj:cop (8; 2% instances), advcl (6; 1% instances), compound (6; 1% instances), nummod (4; 1% instances), dep (3; 1% instances), parataxis (3; 1% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), advmod (1; 0% instances), case (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)
Parents of SYM
nodes belong to 11 different parts of speech: NOUN (154; 32% instances), VERB (150; 31% instances), SYM (83; 17% instances), ADJ (33; 7% instances), PROPN (26; 5% instances), (11; 2% instances), NUM (9; 2% instances), ADV (5; 1% instances), PRON (4; 1% instances), X (3; 1% instances), PUNCT (1; 0% instances)
313 (65%) SYM
nodes are leaves.
43 (9%) SYM
nodes have one child.
66 (14%) SYM
nodes have two children.
57 (12%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 14.
Children of SYM
nodes are attached using 21 different relations: punct (152; 35% instances), flat:name (90; 21% instances), nummod (54; 12% instances), nmod (22; 5% instances), nsubj:cop (18; 4% instances), conj (17; 4% instances), cop (15; 3% instances), advmod (12; 3% instances), cc (12; 3% instances), compound:nn (12; 3% instances), acl:relcl (5; 1% instances), appos (5; 1% instances), compound (4; 1% instances), obl (4; 1% instances), mark (3; 1% instances), acl (2; 0% instances), amod (2; 0% instances), advcl (1; 0% instances), case (1; 0% instances), nmod:poss (1; 0% instances), nsubj (1; 0% instances)
Children of SYM
nodes belong to 13 different parts of speech: PUNCT (152; 35% instances), SYM (82; 19% instances), NUM (73; 17% instances), NOUN (63; 15% instances), AUX (15; 3% instances), ADV (13; 3% instances), CCONJ (12; 3% instances), VERB (9; 2% instances), ADJ (5; 1% instances), PRON (3; 1% instances), PROPN (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)