home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_French-PROFITEROLE: POS Tags: PUNCT

There are 17 PUNCT lemmas (0%), 23 PUNCT types (0%) and 31393 PUNCT tokens (13%). Out of 15 observed tags, the rank of PUNCT is: 12 in number of lemmas, 13 in number of types and 3 in number of tokens.

The 10 most frequent PUNCT lemmas: _, ,, ., ;, :, “, !, -, «, »

The 10 most frequent PUNCT types: ,, ., ;, :, «, », !, -, ?, .´

The 10 most frequent ambiguous lemmas: _ (VERB 13207, NOUN 12804, PUNCT 12347, DET 11020, PRON 10386, ADP 9918, ADV 9278, CCONJ 4482, PROPN 3203, AUX 2986, SCONJ 2870, ADJ 2419, NUM 300, INTJ 47, X 8)

The 10 most frequent ambiguous types: ) (PUNCT 35, DET 1)

Morphology

The form / lemma ratio of PUNCT is 1.352941 (the average of all parts of speech is 3.463337).

The 1st highest number of forms (16) was observed with the lemma “_”: !, (, ), ,, -, -,, ., :, ;, ?, «, », ’”, “, “‘, ”.

The 2nd highest number of forms (6) was observed with the lemma “””: ”, «, », ’”, “, ”.

The 3rd highest number of forms (2) was observed with the lemma “’”: ’, ’.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (31393; 100% instances)

Parents of PUNCT nodes belong to 14 different parts of speech: VERB (24754; 79% instances), NOUN (3484; 11% instances), ADJ (1352; 4% instances), PRON (602; 2% instances), PROPN (563; 2% instances), ADV (506; 2% instances), INTJ (41; 0% instances), AUX (20; 0% instances), ADP (16; 0% instances), DET (15; 0% instances), CCONJ (14; 0% instances), NUM (14; 0% instances), SCONJ (11; 0% instances), X (1; 0% instances)

31393 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.