Treebank Statistics: UD_Hindi-HDTB: POS Tags: PUNCT
There are 19 PUNCT
lemmas (0%), 19 PUNCT
types (0%) and 23455 PUNCT
tokens (7%).
Out of 16 observed tags, the rank of PUNCT
is: 13 in number of lemmas, 14 in number of types and 6 in number of tokens.
The 10 most frequent PUNCT
lemmas: ।, ,, -, ., ‘, (, ), -JOIN, ?, ‘1
The 10 most frequent PUNCT
types: ।, ,, -, ., ‘, (, ), -JOIN, ?, ‘1
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PUNCT
is 1.000000 (the average of all parts of speech is 1.200484).
The 1st highest number of forms (1) was observed with the lemma “!”: !.
The 2nd highest number of forms (1) was observed with the lemma “””: “.
The 3rd highest number of forms (1) was observed with the lemma “’”: ‘.
PUNCT
occurs with 5 features: Case (1; 0% instances), Gender (1; 0% instances), Mood (1; 0% instances), Number (1; 0% instances), VerbForm (1; 0% instances)
PUNCT
occurs with 5 feature-value pairs: Case=Nom
, Gender=Fem
, Mood=Sub
, Number=Sing
, VerbForm=Fin
PUNCT
occurs with 3 feature combinations.
The most frequent feature combination is _
(23453 tokens).
Examples: ।, ,, -, ., ‘, ), (, -JOIN, ?, ‘1
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (23455; 100% instances)
Parents of PUNCT
nodes belong to 15 different parts of speech: VERB (15739; 67% instances), PROPN (3218; 14% instances), NOUN (2951; 13% instances), ADJ (1007; 4% instances), PRON (199; 1% instances), NUM (155; 1% instances), ADV (124; 1% instances), DET (38; 0% instances), PART (10; 0% instances), ADP (4; 0% instances), CCONJ (4; 0% instances), AUX (2; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)
23455 (100%) PUNCT
nodes are leaves.
The highest child degree of a PUNCT
node is 0.