home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-STB: POS Tags: NUM

There are 4 NUM lemmas (1%), 4 NUM types (1%) and 4 NUM tokens (0%). Out of 13 observed tags, the rank of NUM is: 11 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: 1990, දෙවන, පළමු, හතර

The 10 most frequent NUM types: 1990, දෙවැන්න, පළමු, හතර

The 10 most frequent ambiguous lemmas: පළමු (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: දෙවැන්න (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.145336).

The 1st highest number of forms (1) was observed with the lemma “1990”: 1990.

The 2nd highest number of forms (1) was observed with the lemma “දෙවන”: දෙවැන්න.

The 3rd highest number of forms (1) was observed with the lemma “පළමු”: පළමු.

NUM occurs with 1 features: NumType (4; 100% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Ord (2 tokens). Examples: දෙවැන්න, පළමු

Relations

NUM nodes are attached to their parents using 4 different relations: amod (1; 25% instances), nsubj (1; 25% instances), nummod (1; 25% instances), obl (1; 25% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (3; 75% instances), VERB (1; 25% instances)

3 (75%) NUM nodes are leaves.

1 (25%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 1 different relations: dep (1; 100% instances)

Children of NUM nodes belong to 1 different parts of speech: PART (1; 100% instances)