home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Assamese-AiW: POS Tags: NUM

There are 7 NUM lemmas (2%), 9 NUM types (2%) and 16 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent NUM lemmas: এটা, এক, চাৰি, দুই, হাজাৰ, ১০০, ২৪x৭

The 10 most frequent NUM types: এটা, এক, একোটা, এশাৰী, চাৰি, দুটা, হাজাৰ, ১০০, ২৪x৭

The 10 most frequent ambiguous lemmas: এটা (NUM 6, DET 1)

The 10 most frequent ambiguous types: এটা (NUM 7, DET 1)

Morphology

The form / lemma ratio of NUM is 1.285714 (the average of all parts of speech is 1.317618).

The 1st highest number of forms (4) was observed with the lemma “এক”: এক, একোটা, এটা, এশাৰী.

The 2nd highest number of forms (1) was observed with the lemma “এটা”: এটা.

The 3rd highest number of forms (1) was observed with the lemma “চাৰি”: চাৰি.

NUM occurs with 1 features: NumType (7; 44% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (9 tokens). Examples: এটা, চাৰি, দুটা, হাজাৰ

Relations

NUM nodes are attached to their parents using 2 different relations: nummod (14; 88% instances), compound (2; 13% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (15; 94% instances), NUM (1; 6% instances)

15 (94%) NUM nodes are leaves.

1 (6%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 1 different relations: compound (1; 100% instances)

Children of NUM nodes belong to 1 different parts of speech: NUM (1; 100% instances)