home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: PUNCT

There are 7 PUNCT lemmas (2%), 6 PUNCT types (1%) and 200 PUNCT tokens (10%). Out of 16 observed tags, the rank of PUNCT is: 11 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ?, _, !, ., :, ;

The 10 most frequent PUNCT types: ,, ?, ., !, :, ;

The 10 most frequent ambiguous lemmas: _ (VERB 46, NOUN 41, ADV 24, PRON 18, PROPN 17, X 16, ADP 14, PUNCT 13, PART 3, DET 2)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PUNCT is 0.857143 (the average of all parts of speech is 1.661638).

The 1st highest number of forms (4) was observed with the lemma “_”: !, ., :, ?.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “,”: ,.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (200; 100% instances)

Parents of PUNCT nodes belong to 9 different parts of speech: VERB (115; 57% instances), NOUN (46; 23% instances), PROPN (13; 7% instances), X (6; 3% instances), PRON (5; 3% instances), ADP (4; 2% instances), ADV (4; 2% instances), PART (4; 2% instances), INTJ (3; 2% instances)

200 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.