home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bengali-BRU: POS Tags: NUM

There are 2 NUM lemmas (2%), 2 NUM types (1%) and 2 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 12 in number of lemmas, 13 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: একজন, চার

The 10 most frequent NUM types: একজন, চার

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.290598).

The 1st highest number of forms (1) was observed with the lemma “একজন”: একজন.

The 2nd highest number of forms (1) was observed with the lemma “চার”: চার.

NUM occurs with 1 features: NumType (2; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (2 tokens). Examples: একজন, চার

Relations

NUM nodes are attached to their parents using 2 different relations: nsubj (1; 50% instances), root (1; 50% instances)

Parents of NUM nodes belong to 2 different parts of speech: (1; 50% instances), VERB (1; 50% instances)

1 (50%) NUM nodes are leaves.

0 (0%) NUM nodes have one child.

1 (50%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 2 different relations: compound (1; 50% instances), punct (1; 50% instances)

Children of NUM nodes belong to 2 different parts of speech: NOUN (1; 50% instances), PUNCT (1; 50% instances)