home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Esperanto-Prago: POS Tags: NUM

There are 24 NUM lemmas (3%), 24 NUM types (2%) and 42 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: unu, 1, 2, 1913, 3, 4, 5, 6, 7, 8

The 10 most frequent NUM types: unu, 1, 2, 1913, 3, 4, 5, 6, 7, 8

The 10 most frequent ambiguous lemmas: unu (NUM 8, ADV 1, PRON 1)

The 10 most frequent ambiguous types: unu (NUM 8, PRON 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.222222).

The 1st highest number of forms (1) was observed with the lemma “1”: 1.

The 2nd highest number of forms (1) was observed with the lemma “10”: 10.

The 3rd highest number of forms (1) was observed with the lemma “1887”: 1887.

NUM occurs with 2 features: NumType (19; 45% instances), NumForm (2; 5% instances)

NUM occurs with 2 feature-value pairs: NumForm=Word, NumType=Card

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (23 tokens). Examples: 1, 2, 3, 4, 5, 6, 7, 8, unu, 10

Relations

NUM nodes are attached to their parents using 6 different relations: nummod (15; 36% instances), root (12; 29% instances), nmod (10; 24% instances), discourse (2; 5% instances), obl (2; 5% instances), conj (1; 2% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (16; 38% instances), VERB (14; 33% instances), (12; 29% instances)

17 (40%) NUM nodes are leaves.

22 (52%) NUM nodes have one child.

1 (2%) NUM nodes have two children.

2 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 9 different relations: punct (21; 64% instances), nmod (3; 9% instances), case (2; 6% instances), nsubj (2; 6% instances), advmod (1; 3% instances), aux (1; 3% instances), cc (1; 3% instances), cop (1; 3% instances), mark (1; 3% instances)

Children of NUM nodes belong to 7 different parts of speech: PUNCT (21; 64% instances), NOUN (5; 15% instances), ADP (2; 6% instances), AUX (2; 6% instances), ADV (1; 3% instances), CCONJ (1; 3% instances), SCONJ (1; 3% instances)