home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Gujarati-GujTB: POS Tags: NUM

There are 28 NUM lemmas (3%), 29 NUM types (3%) and 40 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: એક, 80, ત્રણ, 03, 1, 1.5, 12GB, 15, 2016, 300

The 10 most frequent NUM types: એક, 80, ત્રણ, 03, 1, 1.5, 12GB, 15, 2016, 300

The 10 most frequent ambiguous lemmas: એક (NUM 10, DET 2), બીજા (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: એક (NUM 9, DET 2), _ (ADP 29, NOUN 18, PROPN 6, VERB 3, NUM 1, PRON 1), બીજા (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.035714 (the average of all parts of speech is 1.120000).

The 1st highest number of forms (2) was observed with the lemma “એક”: _, એક.

The 2nd highest number of forms (1) was observed with the lemma “03”: 03.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 1 features: Case (1; 3% instances)

NUM occurs with 1 feature-value pairs: Case=Ter

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (39 tokens). Examples: એક, 80, ત્રણ, 03, 1, 1.5, 12GB, 15, 2016, 300

Relations

NUM nodes are attached to their parents using 9 different relations: nummod (27; 68% instances), compound (5; 13% instances), obl (2; 5% instances), conj (1; 3% instances), dep (1; 3% instances), flat (1; 3% instances), nmod (1; 3% instances), nsubj:pass (1; 3% instances), root (1; 3% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (22; 55% instances), NUM (7; 18% instances), PROPN (3; 8% instances), SYM (3; 8% instances), VERB (3; 8% instances), ADJ (1; 3% instances), (1; 3% instances)

25 (63%) NUM nodes are leaves.

10 (25%) NUM nodes have one child.

1 (3%) NUM nodes have two children.

4 (10%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 13 different relations: punct (6; 24% instances), case (4; 16% instances), compound (3; 12% instances), nmod (2; 8% instances), nummod (2; 8% instances), amod (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), det (1; 4% instances), discourse (1; 4% instances), flat (1; 4% instances), nmod:tmod (1; 4% instances), nsubj (1; 4% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (7; 28% instances), PUNCT (6; 24% instances), ADP (4; 16% instances), NOUN (2; 8% instances), ADJ (1; 4% instances), AUX (1; 4% instances), DET (1; 4% instances), PART (1; 4% instances), PRON (1; 4% instances), PROPN (1; 4% instances)