Treebank Statistics: UD_Spanish-AnCora: Features: PunctType
This feature is language-specific.
It occurs with 9 different values: Brck
, Colo
, Comm
, Dash
, Excl
, Peri
, Qest
, Quot
, Semi
.
65544 tokens (12%) have a non-empty value of PunctType
.
18 types (0%) occur at least once with a non-empty value of PunctType
.
17 lemmas (0%) occur at least once with a non-empty value of PunctType
.
The feature is used with 2 part-of-speech tags: PUNCT (65543; 12% instances), NOUN (1; 0% instances).
PUNCT
65543 PUNCT tokens (100% of all PUNCT
tokens) have a non-empty value of PunctType
.
PUNCT
tokens may have the following values of PunctType
:
Brck
(3647; 6% of non-emptyPunctType
): (, )Colo
(768; 1% of non-emptyPunctType
): :, /Comm
(30261; 46% of non-emptyPunctType
): ,, …, etcétera, etcDash
(2870; 4% of non-emptyPunctType
): -, “, .Excl
(137; 0% of non-emptyPunctType
): !, ¡Peri
(17511; 27% of non-emptyPunctType
): .Qest
(640; 1% of non-emptyPunctType
): ?, ¿Quot
(9404; 14% of non-emptyPunctType
): ”, ‘, `, .Semi
(305; 0% of non-emptyPunctType
): ;
Paradigm . | Dash | Peri | Quot |
---|---|---|---|
. | . | . |
NOUN
1 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of PunctType
.
The most frequent other feature values with which NOUN
and PunctType
co-occurred: Gender=EMPTY (1; 100%), Number=EMPTY (1; 100%).
NOUN
tokens may have the following values of PunctType
:
Comm
(1; 100% of non-emptyPunctType
): etcétera