home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Nepali-BK: POS Tags: X

There are 2 X lemmas (1%), 5 X types (1%) and 5 X tokens (1%). Out of 15 observed tags, the rank of X is: 15 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: -, कर्मण्येवाधिकारस्ते

The 10 most frequent X types: कर्तव्यम्, कर्मण्येवाधिकारस्ते, द्रव्यम्, न्याययतेन, पारलौकिकम्

The 10 most frequent ambiguous lemmas: - (X 4, PUNCT 3)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of X is 2.500000 (the average of all parts of speech is 1.329630).

The 1st highest number of forms (4) was observed with the lemma “-”: कर्तव्यम्, द्रव्यम्, न्याययतेन, पारलौकिकम्.

The 2nd highest number of forms (1) was observed with the lemma “कर्मण्येवाधिकारस्ते”: कर्मण्येवाधिकारस्ते.

X occurs with 1 features: Foreign (2; 40% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (3 tokens). Examples: द्रव्यम्, न्याययतेन, पारलौकिकम्

Relations

X nodes are attached to their parents using 3 different relations: dep (3; 60% instances), ccomp (1; 20% instances), parataxis (1; 20% instances)

Parents of X nodes belong to 2 different parts of speech: X (3; 60% instances), VERB (2; 40% instances)

3 (60%) X nodes are leaves.

0 (0%) X nodes have one child.

0 (0%) X nodes have two children.

2 (40%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 2 different relations: punct (6; 67% instances), dep (3; 33% instances)

Children of X nodes belong to 2 different parts of speech: PUNCT (6; 67% instances), X (3; 33% instances)