home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-HSE: POS Tags: NUM

There are 16 NUM lemmas (1%), 19 NUM types (1%) and 24 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 12 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: trys, du, šimtas, penkiasdešimt, 1994, 30, 4151, 52, 7, 92

The 10 most frequent NUM types: du, penkiasdešimt, trijų, trys, šimtus, 1994, 30, 4151, 52, 7

The 10 most frequent ambiguous lemmas: vienas (ADJ 9, PRON 3, DET 1, NUM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.187500 (the average of all parts of speech is 1.442977).

The 1st highest number of forms (2) was observed with the lemma “du”: du, dvi.

The 2nd highest number of forms (2) was observed with the lemma “trys”: trijų, trys.

The 3rd highest number of forms (2) was observed with the lemma “šimtas”: šimtus, šimtą.

NUM occurs with 3 features: Case (14; 58% instances), Gender (14; 58% instances), Number (6; 25% instances)

NUM occurs with 7 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

NUM occurs with 9 feature combinations. The most frequent feature combination is _ (10 tokens). Examples: penkiasdešimt, 1994, 30, 4151, 52, 7, 92, dešimt, tūkst.

Relations

NUM nodes are attached to their parents using 6 different relations: nummod (11; 46% instances), nummod:gov (5; 21% instances), compound (4; 17% instances), appos (2; 8% instances), amod (1; 4% instances), root (1; 4% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (19; 79% instances), NUM (4; 17% instances), (1; 4% instances)

18 (75%) NUM nodes are leaves.

4 (17%) NUM nodes have one child.

0 (0%) NUM nodes have two children.

2 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 8 different relations: compound (4; 36% instances), advmod (1; 9% instances), advmod:emph (1; 9% instances), cop (1; 9% instances), dislocated (1; 9% instances), nmod (1; 9% instances), obl (1; 9% instances), punct (1; 9% instances)

Children of NUM nodes belong to 7 different parts of speech: NUM (4; 36% instances), ADV (2; 18% instances), AUX (1; 9% instances), NOUN (1; 9% instances), PART (1; 9% instances), PRON (1; 9% instances), PUNCT (1; 9% instances)