home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: PUNCT

There are 15 PUNCT lemmas (0%), 53 PUNCT types (0%) and 41908 PUNCT tokens (13%). Out of 16 observed tags, the rank of PUNCT is: 6 in number of lemmas, 13 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, -, ), (, _, :, /, ?

The 10 most frequent PUNCT types: ,, ., “, -, ), (, –, :, ‘, /

The 10 most frequent ambiguous lemmas: _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1)

The 10 most frequent ambiguous types: . (PUNCT 11427, NUM 1), (PUNCT 2896, NOUN 5), (PUNCT 217, NOUN 7), ? (PUNCT 131, X 1), º (PUNCT 113, NOUN 20), ² (PUNCT 63, NOUN 1), ] (PUNCT 34, X 1), x (PUNCT 16, X 6, NOUN 1), ° (PUNCT 12, NOUN 11), + (X 13, PUNCT 6, PROPN 1)

Morphology

The form / lemma ratio of PUNCT is 3.533333 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (42) was observed with the lemma “_”: #, ‘, ‘’, ‘s, **, ,, –, —, .., …, …., ……, :/, ;, =, ==, ====, A.D., AC, I., [, \, ], ^, _, a.C., d.C., x, {, |, }, §, ª, «, °, ², ³, ¹, º, », ¿, •.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: .

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 4 different relations: punct (41905; 100% instances), cop (1; 0% instances), dep (1; 0% instances), det:poss (1; 0% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (18871; 45% instances), NOUN (10201; 24% instances), PROPN (7658; 18% instances), NUM (1632; 4% instances), ADV (1231; 3% instances), ADJ (849; 2% instances), PRON (658; 2% instances), PART (371; 1% instances), SYM (198; 0% instances), X (130; 0% instances), CCONJ (45; 0% instances), ADP (43; 0% instances), AUX (12; 0% instances), PUNCT (5; 0% instances), DET (4; 0% instances)

41905 (100%) PUNCT nodes are leaves.

1 (0%) PUNCT nodes have one child.

2 (0%) PUNCT nodes have two children.

The highest child degree of a PUNCT node is 2.

Children of PUNCT nodes are attached using 1 different relations: punct (5; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (5; 100% instances)