home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Marathi-UFAL: POS Tags: NUM

There are 11 NUM lemmas (1%), 13 NUM types (1%) and 34 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: दोन, एक, हजार, चार, पाच, पन्नास, चाळीस, दहा, दुसरा, लाख

The 10 most frequent NUM types: दोन, _, एक, हजार, चार, दुसरा, चाळीस, दहा, दुसऱ्या, दोन्ही

The 10 most frequent ambiguous lemmas: एक (DET 27, NUM 4), दुसरा (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: _ (ADP 290, NOUN 176, PRON 111, PART 47, VERB 20, PROPN 15, ADJ 6, DET 4, NUM 4, AUX 1), एक (DET 22, NUM 4)

Morphology

The form / lemma ratio of NUM is 1.181818 (the average of all parts of speech is 1.339869).

The 1st highest number of forms (3) was observed with the lemma “दोन”: दुसरा, दोन, दोन्ही.

The 2nd highest number of forms (2) was observed with the lemma “पाच”: _, पाच.

The 3rd highest number of forms (1) was observed with the lemma “एक”: एक.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (28; 82% instances), flat (3; 9% instances), amod (1; 3% instances), det (1; 3% instances), obj (1; 3% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (27; 79% instances), NUM (4; 12% instances), ADJ (1; 3% instances), ADV (1; 3% instances), VERB (1; 3% instances)

27 (79%) NUM nodes are leaves.

7 (21%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 4 different relations: flat (3; 43% instances), amod (2; 29% instances), nummod (1; 14% instances), punct (1; 14% instances)

Children of NUM nodes belong to 3 different parts of speech: NUM (4; 57% instances), ADJ (2; 29% instances), PUNCT (1; 14% instances)