Treebank Statistics: UD_Assamese-AiW: POS Tags: NUM
There are 7 NUM lemmas (2%), 9 NUM types (2%) and 16 NUM tokens (2%).
Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 8 in number of types and 9 in number of tokens.
The 10 most frequent NUM lemmas: এটা, এক, চাৰি, দুই, হাজাৰ, ১০০, ২৪x৭
The 10 most frequent NUM types: এটা, এক, একোটা, এশাৰী, চাৰি, দুটা, হাজাৰ, ১০০, ২৪x৭
The 10 most frequent ambiguous lemmas: এটা (NUM 6, DET 1)
The 10 most frequent ambiguous types: এটা (NUM 7, DET 1)
- এটা
Morphology
The form / lemma ratio of NUM is 1.285714 (the average of all parts of speech is 1.317618).
The 1st highest number of forms (4) was observed with the lemma “এক”: এক, একোটা, এটা, এশাৰী.
The 2nd highest number of forms (1) was observed with the lemma “এটা”: এটা.
The 3rd highest number of forms (1) was observed with the lemma “চাৰি”: চাৰি.
NUM occurs with 1 features: NumType (7; 44% instances)
NUM occurs with 1 feature-value pairs: NumType=Card
NUM occurs with 2 feature combinations.
The most frequent feature combination is _ (9 tokens).
Examples: এটা, চাৰি, দুটা, হাজাৰ
Relations
NUM nodes are attached to their parents using 2 different relations: nummod (14; 88% instances), compound (2; 13% instances)
Parents of NUM nodes belong to 2 different parts of speech: NOUN (15; 94% instances), NUM (1; 6% instances)
15 (94%) NUM nodes are leaves.
1 (6%) NUM nodes have one child.
The highest child degree of a NUM node is 1.
Children of NUM nodes are attached using 1 different relations: compound (1; 100% instances)
Children of NUM nodes belong to 1 different parts of speech: NUM (1; 100% instances)