home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Malayalam-UFAL: POS Tags: NUM

There are 38 NUM lemmas (3%), 39 NUM types (2%) and 42 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: മൂന്ന്, 1, രണ്ട്, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2

The 10 most frequent NUM types: 1, മൂന്ന്, രണ്ട്, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.026316 (the average of all parts of speech is 1.111893).

The 1st highest number of forms (2) was observed with the lemma “മൂന്ന്”: മൂന്നിന്, മൂന്ന്.

The 2nd highest number of forms (1) was observed with the lemma “0,7-0,8”: 0,7-0,8.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 3 features: NumForm (42; 100% instances), NumType (42; 100% instances), Case (18; 43% instances)

NUM occurs with 6 feature-value pairs: Case=Dat, Case=Nom, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Frac

NUM occurs with 4 feature combinations. The most frequent feature combination is NumForm=Digit|NumType=Card (24 tokens). Examples: 1, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2, 1950, 2004

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (31; 74% instances), flat (4; 10% instances), obl (4; 10% instances), root (2; 5% instances), acl:relcl (1; 2% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (31; 74% instances), PROPN (3; 7% instances), NUM (2; 5% instances), (2; 5% instances), VERB (2; 5% instances), ADJ (1; 2% instances), ADV (1; 2% instances)

32 (76%) NUM nodes are leaves.

6 (14%) NUM nodes have one child.

3 (7%) NUM nodes have two children.

1 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 8 different relations: case (5; 31% instances), amod (2; 13% instances), flat (2; 13% instances), nsubj (2; 13% instances), punct (2; 13% instances), cop (1; 6% instances), mark (1; 6% instances), nummod (1; 6% instances)

Children of NUM nodes belong to 7 different parts of speech: ADP (5; 31% instances), NOUN (3; 19% instances), ADJ (2; 13% instances), NUM (2; 13% instances), PUNCT (2; 13% instances), AUX (1; 6% instances), PART (1; 6% instances)