Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PUNCT
There are 95 PUNCT
lemmas (1%), 103 PUNCT
types (1%) and 15246 PUNCT
tokens (12%).
Out of 16 observed tags, the rank of PUNCT
is: 10 in number of lemmas, 13 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT
lemmas: ., ,, :, ?, !, …, “, ), (, -
The 10 most frequent PUNCT
types: ., ,, :, ?, !, …, “, ), (, -
The 10 most frequent ambiguous lemmas: - (PUNCT 325, PROPN 4, SYM 1), ’ (PUNCT 215, X 1), …. (PUNCT 98, PROPN 1), / (PUNCT 71, SYM 1), = (PUNCT 16, SYM 1), + (SYM 29, PUNCT 15, ADV 1), > (PUNCT 12, SYM 2), – (PUNCT 6, X 1), _ (X 9, PUNCT 6), «/em> (PUNCT 3, SYM 3)
The 10 most frequent ambiguous types: - (PUNCT 325, PROPN 4, SYM 1), ’ (PUNCT 215, X 1), …. (PUNCT 99, PROPN 1), / (PUNCT 71, SYM 1), = (PUNCT 16, SYM 1), + (SYM 29, PUNCT 15, ADV 2), > (PUNCT 12, SYM 2), – (PUNCT 6, X 1), «/em> (PUNCT 3, SYM 3), ): (PUNCT 1, SYM 1)
- -
- ’
- ….
- /
- =
- +
- SYM 29: BORSA MILANO : POSITIVA ( + 0,5 % ) DOPO AVVIO GOVERNO MONTI
- PUNCT 15: @user :: nemmeno un governo Monti ‘ di scopo ‘ ? : legge elett . + rientro in UE + elez. primavera ?
- ADV 2: Governo #Monti : la fiducia + ampia di la storia di la Repubblica Italiana da il peggior Parlamento di la storia di la Repubblica Italiana . 2+2
- >
- –
- «/em>
- ):
Morphology
The form / lemma ratio of PUNCT
is 1.084211 (the average of all parts of speech is 1.310882).
The 1st highest number of forms (10) was observed with the lemma “!”: !, !!, !!!, !!!!, !!!!!, !!!!!!, !!!!!!!, !!!!!!!!, !!!!!!!!!, !!!!!!!!!!!!!!.
The 2nd highest number of forms (3) was observed with the lemma “…”: …, …., ………...
The 3rd highest number of forms (3) was observed with the lemma “?”: ?, ??, ???.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (15246; 100% instances)
Parents of PUNCT
nodes belong to 15 different parts of speech: VERB (5394; 35% instances), NOUN (4070; 27% instances), PROPN (2025; 13% instances), SYM (1099; 7% instances), ADJ (832; 5% instances), INTJ (457; 3% instances), PRON (388; 3% instances), NUM (379; 2% instances), ADV (308; 2% instances), X (192; 1% instances), PUNCT (37; 0% instances), DET (33; 0% instances), AUX (15; 0% instances), ADP (11; 0% instances), CCONJ (6; 0% instances)
15227 (100%) PUNCT
nodes are leaves.
1 (0%) PUNCT
nodes have one child.
18 (0%) PUNCT
nodes have two children.
The highest child degree of a PUNCT
node is 2.
Children of PUNCT
nodes are attached using 1 different relations: punct (37; 100% instances)
Children of PUNCT
nodes belong to 1 different parts of speech: PUNCT (37; 100% instances)