home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: PUNCT

There are 34 PUNCT lemmas (1%), 42 PUNCT types (0%) and 3964 PUNCT tokens (15%). Out of 17 observed tags, the rank of PUNCT is: 10 in number of lemmas, 15 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ·, …, :, (…), ⁞, |, ·:·, ·:, [·], ·

The 10 most frequent PUNCT types: ·, …, :, (…), ⁞, |, ·:·, :], ·:, [·]

The 10 most frequent ambiguous lemmas: (PUNCT 1217, X 5), ·-· (PUNCT 3, NUM 2), (-) (PUNCT 2, X 2), :-: (PUNCT 2, NUM 1), :-҃: (PUNCT 2, NUM 1), —– (X 11, PUNCT 1)

The 10 most frequent ambiguous types: (PUNCT 1208, X 10), (-) (PUNCT 2, X 2), :-҃: (PUNCT 2, NUM 1), —– (X 11, PUNCT 1)

Morphology

The form / lemma ratio of PUNCT is 1.235294 (the average of all parts of speech is 2.412613).

The 1st highest number of forms (5) was observed with the lemma “:”: :, :), :], [:, ·.

The 2nd highest number of forms (4) was observed with the lemma “·”: (·, [·, ·, ·].

The 3rd highest number of forms (3) was observed with the lemma “…”: (…, …, …).

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (3964; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: NOUN (1193; 30% instances), VERB (1157; 29% instances), PROPN (811; 20% instances), X (258; 7% instances), ADJ (176; 4% instances), PRON (121; 3% instances), NUM (100; 3% instances), DET (78; 2% instances), ADV (27; 1% instances), ADP (17; 0% instances), PART (11; 0% instances), CCONJ (8; 0% instances), SCONJ (5; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)

3964 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.