home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-AnCora: POS Tags: PUNCT

There are 17 PUNCT lemmas (0%), 18 PUNCT types (0%) and 65625 PUNCT tokens (12%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 13 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, -, (, ), :, ?, ;, ¿

The 10 most frequent PUNCT types: ,, ., “, -, (, ), :, ?, ;, ¿

The 10 most frequent ambiguous lemmas: etcétera (PUNCT 15, NOUN 1)

The 10 most frequent ambiguous types: etcétera (PUNCT 13, NOUN 1)

Morphology

The form / lemma ratio of PUNCT is 1.058824 (the average of all parts of speech is 1.505634).

The 1st highest number of forms (2) was observed with the lemma “etcétera”: etc, etcétera.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: .

PUNCT occurs with 2 features: PunctType (65543; 100% instances), PunctSide (4424; 7% instances)

PUNCT occurs with 11 feature-value pairs: PunctSide=Fin, PunctSide=Ini, PunctType=Brck, PunctType=Colo, PunctType=Comm, PunctType=Dash, PunctType=Excl, PunctType=Peri, PunctType=Qest, PunctType=Quot, PunctType=Semi

PUNCT occurs with 13 feature combinations. The most frequent feature combination is PunctType=Comm (30261 tokens). Examples: ,, …, etcétera, etc

Relations

PUNCT nodes are attached to their parents using 2 different relations: punct (65613; 100% instances), root (12; 0% instances)

Parents of PUNCT nodes belong to 18 different parts of speech: VERB (30122; 46% instances), NOUN (16361; 25% instances), PROPN (7214; 11% instances), ADJ (5152; 8% instances), ADV (1854; 3% instances), ADP (1193; 2% instances), PRON (1022; 2% instances), NUM (987; 2% instances), AUX (762; 1% instances), CCONJ (306; 0% instances), DET (201; 0% instances), SYM (159; 0% instances), INTJ (152; 0% instances), PART (79; 0% instances), SCONJ (25; 0% instances), PUNCT (23; 0% instances), (12; 0% instances), X (1; 0% instances)

65614 (100%) PUNCT nodes are leaves.

0 (0%) PUNCT nodes have one child.

3 (0%) PUNCT nodes have two children.

8 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 7.

Children of PUNCT nodes are attached using 2 different relations: punct (23; 68% instances), conj (11; 32% instances)

Children of PUNCT nodes belong to 6 different parts of speech: PUNCT (23; 68% instances), VERB (4; 12% instances), NOUN (3; 9% instances), PROPN (2; 6% instances), ADJ (1; 3% instances), PRON (1; 3% instances)