home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-AnCora: POS Tags: PUNCT

There are 17 PUNCT lemmas (0%), 18 PUNCT types (0%) and 65625 PUNCT tokens (12%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 13 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, -, (, ), :, ?, ;, ¿

The 10 most frequent PUNCT types: ,, ., “, -, (, ), :, ?, ;, ¿

The 10 most frequent ambiguous lemmas: etcétera (PUNCT 15, NOUN 1)

The 10 most frequent ambiguous types: etcétera (PUNCT 13, NOUN 1)

Morphology

The form / lemma ratio of PUNCT is 1.058824 (the average of all parts of speech is 1.505808).

The 1st highest number of forms (2) was observed with the lemma “etcétera”: etc, etcétera.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: .

PUNCT occurs with 2 features: PunctType (65610; 100% instances), PunctSide (4431; 7% instances)

PUNCT occurs with 12 feature-value pairs: PunctSide=Fin, PunctSide=Ini, PunctType=Brck, PunctType=Colo, PunctType=Comm, PunctType=Dash, PunctType=Elip, PunctType=Excl, PunctType=Peri, PunctType=Qest, PunctType=Quot, PunctType=Semi

PUNCT occurs with 14 feature combinations. The most frequent feature combination is PunctType=Comm (30187 tokens). Examples: ,

Relations

PUNCT nodes are attached to their parents using 2 different relations: punct (65613; 100% instances), root (12; 0% instances)

Parents of PUNCT nodes belong to 18 different parts of speech: VERB (30122; 46% instances), NOUN (16366; 25% instances), PROPN (7213; 11% instances), ADJ (5152; 8% instances), ADV (1854; 3% instances), ADP (1193; 2% instances), PRON (1009; 2% instances), NUM (987; 2% instances), AUX (762; 1% instances), CCONJ (306; 0% instances), DET (211; 0% instances), SYM (159; 0% instances), INTJ (152; 0% instances), PART (79; 0% instances), SCONJ (24; 0% instances), PUNCT (23; 0% instances), (12; 0% instances), X (1; 0% instances)

65614 (100%) PUNCT nodes are leaves.

0 (0%) PUNCT nodes have one child.

3 (0%) PUNCT nodes have two children.

8 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 7.

Children of PUNCT nodes are attached using 2 different relations: punct (23; 68% instances), conj (11; 32% instances)

Children of PUNCT nodes belong to 6 different parts of speech: PUNCT (23; 68% instances), VERB (4; 12% instances), NOUN (3; 9% instances), PROPN (2; 6% instances), ADJ (1; 3% instances), PRON (1; 3% instances)