home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Odia-ODTB: POS Tags: NUM

There are 29 NUM lemmas (3%), 121 NUM types (4%) and 185 NUM tokens (3%). Out of 15 observed tags, the rank of NUM is: 7 in number of lemmas, 5 in number of types and 7 in number of tokens.

The 10 most frequent NUM lemmas: _, ଶତ, ଏକ, ଗୋଟିଏ, ୧୨, ୧୮୬୩, ଦୁଇ, ଅନେକ, ଆଠ, ଉଭୟ

The 10 most frequent NUM types: ଏକ, ୧୯୩୪, ଗୋଟିଏ, ଦୁଇ, ଶତ, ୧୨, ୧୮, ୧୮୬୩, ୧୯୦୨, ୧୯୩୦

The 10 most frequent ambiguous lemmas: _ (NOUN 1308, PUNCT 581, VERB 563, PROPN 481, ADJ 299, PRON 276, NUM 145, DET 137, ADP 124, CCONJ 122, ADV 112, PART 51, SCONJ 31, AUX 10), ଏକ (NUM 3, DET 1), ଅନେକ (ADJ 1, DET 1, NUM 1)

The 10 most frequent ambiguous types: ଏକ (DET 14, NUM 12), ଗୋଟିଏ (NUM 4, DET 1, PART 1), ଅନେକ (DET 6, NUM 2, ADJ 1, PRON 1), ଜଣ (NUM 2, PART 2), ପ୍ରଥମ (ADJ 13, NUM 2), ଅଧିକ (ADJ 2, ADV 2, NUM 1), ଉଭୟ (PRON 2, DET 1, NUM 1), ଏତେ (DET 1, NUM 1), କିଛି (DET 10, PRON 3, NOUN 1, NUM 1), ବିଭିନ୍ନ (ADJ 4, DET 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 4.172414 (the average of all parts of speech is 3.185615).

The 1st highest number of forms (103) was observed with the lemma “_”: 60, ଅଧିକ, ଅନେକ, ଆଠ, ଏକ, ଏତେ, କିଛି, କୋଟି, ଗୋଟିଏ, ଗୋଟେ, ଚାରି, ଚାରିଗୁଣ, ଜଣ, ତିନ, ତିନି, ଦୁଇ, ଦୁଇଗୋଟି, ନଅଶହ, ପ୍ରଥମ, ବିଭିନ୍ନ, ଷୋହଳ, ସାତ, ହଜାର, ୧.୫, ୧୦୦, ୧୧, ୧୨, ୧୩, ୧୪, ୧୬, ୧୮, ୧୮୧୭, ୧୮୩୦, ୧୮୭୩, ୧୮୭୫, ୧୮୭୭, ୧୮୭୮, ୧୮୮୦, ୧୮୮୩, ୧୮୮୮, ୧୮୯୦, ୧୮୯୧, ୧୮୯୨, ୧୮୯୫, ୧୮୯୮, ୧୮୯୯, ୧୯୦୧, ୧୯୦୧ରୁ, ୧୯୦୨, ୧୯୦୫, ୧୯୦୬, ୧୯୦୬ରେ, ୧୯୦୭, ୧୯୧୨, ୧୯୧୩, ୧୯୧୪, ୧୯୧୫, ୧୯୧୫ରେ, ୧୯୧୯, ୧୯୨୧, ୧୯୩୦, ୧୯୩୨, ୧୯୩୩, ୧୯୩୪, ୧୯୩୫, ୧୯୩୬, ୧୯୩୭, ୧୯୩୮, ୧୯୩୯, ୧୯୪୦, ୧୯୪୧, ୧୯୫୩, ୧୯୫୪, ୧୯୬୪, ୧୯୭୦, ୧୯୭୫, ୧୯୮୦, ୧୯୮୫, ୧୯୯୩, ୨, ୨ଟି, ୨୦୦, ୨୦୦ରୁ, ୨୦୦୦, ୨୦୨୨, ୨୧, ୨୨, ୨୪, ୨୫, ୨୮, ୩, ୩୦, ୩୬, ୩୮, ୫, ୫୦ଟି, ୫୨, ୭, ୮, ୮୦, ୮୪, ୯ରେ, ୯୬.

The 2nd highest number of forms (2) was observed with the lemma “ଦୁଇ”: ଦୁଇ, ଦୁଇଟି.

The 3rd highest number of forms (1) was observed with the lemma “ଅନେକ”: ଅନେକ.

NUM occurs with 2 features: NumType (150; 81% instances), Number (7; 4% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, Number=Sing

NUM occurs with 4 feature combinations. The most frequent feature combination is NumType=Card (145 tokens). Examples: ୧୯୩୪, ଏକ, ଗୋଟିଏ, ଦୁଇ, ୧୨, ଶତ, ୧୮, ୧୯୦୨, ୮, ୧୮୬୩

Relations

NUM nodes are attached to their parents using 8 different relations: nummod (136; 74% instances), appos (18; 10% instances), nmod (10; 5% instances), obl (9; 5% instances), compound (6; 3% instances), conj (4; 2% instances), amod (1; 1% instances), obj (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (124; 67% instances), PROPN (30; 16% instances), NUM (13; 7% instances), VERB (9; 5% instances), ADJ (5; 3% instances), ADP (3; 2% instances), PART (1; 1% instances)

132 (71%) NUM nodes are leaves.

33 (18%) NUM nodes have one child.

17 (9%) NUM nodes have two children.

3 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 8 different relations: punct (41; 53% instances), compound (9; 12% instances), nummod (9; 12% instances), case (7; 9% instances), conj (4; 5% instances), det (4; 5% instances), nmod (2; 3% instances), cc (1; 1% instances)

Children of NUM nodes belong to 8 different parts of speech: PUNCT (41; 53% instances), NUM (13; 17% instances), ADP (7; 9% instances), NOUN (7; 9% instances), DET (4; 5% instances), PROPN (3; 4% instances), CCONJ (1; 1% instances), PRON (1; 1% instances)