home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hindi-PUD: POS Tags: X

There are 1 X lemmas (6%), 10 X types (0%) and 11 X tokens (0%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 14 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: वीं, आनन, ए, कि, थ़ॉट, न्यू, पहला, भरकम, यह, या

The 10 most frequent ambiguous lemmas: _ (NOUN 5597, ADP 4849, PUNCT 2297, VERB 2058, ADJ 1995, AUX 1776, PROPN 1358, PRON 1128, DET 876, CCONJ 545, NUM 452, SCONJ 382, PART 316, ADV 159, SYM 30, X 11)

The 10 most frequent ambiguous types: ए (DET 4, NOUN 1, X 1), कि (SCONJ 205, ADP 4, VERB 1, X 1), न्यू (ADJ 3, X 1), पहला (ADJ 4, X 1), यह (PRON 74, DET 29, X 1), या (CCONJ 38, SCONJ 1, X 1)

Morphology

The form / lemma ratio of X is 10.000000 (the average of all parts of speech is 345.375000).

The 1st highest number of forms (10) was observed with the lemma “_”: आनन, ए, कि, थ़ॉट, न्यू, पहला, भरकम, यह, या, वीं.

X occurs with 1 features: Foreign (3; 27% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (8 tokens). Examples: वीं, आनन, ए, कि, पहला, भरकम, यह

Relations

X nodes are attached to their parents using 6 different relations: dep (3; 27% instances), fixed (3; 27% instances), flat (2; 18% instances), advmod (1; 9% instances), nsubj (1; 9% instances), root (1; 9% instances)

Parents of X nodes belong to 5 different parts of speech: VERB (4; 36% instances), ADJ (2; 18% instances), NUM (2; 18% instances), X (2; 18% instances), (1; 9% instances)

6 (55%) X nodes are leaves.

2 (18%) X nodes have one child.

1 (9%) X nodes have two children.

2 (18%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 11 different relations: punct (3; 21% instances), flat (2; 14% instances), acl (1; 7% instances), aux (1; 7% instances), case (1; 7% instances), ccomp (1; 7% instances), cop (1; 7% instances), fixed (1; 7% instances), iobj (1; 7% instances), list (1; 7% instances), nsubj (1; 7% instances)

Children of X nodes belong to 8 different parts of speech: PUNCT (3; 21% instances), AUX (2; 14% instances), NOUN (2; 14% instances), VERB (2; 14% instances), X (2; 14% instances), ADJ (1; 7% instances), ADP (1; 7% instances), PRON (1; 7% instances)