home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Teko-TuDeT: POS Tags: NUM

There are 6 NUM lemmas (1%), 11 NUM types (1%) and 18 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 12 in number of lemmas, 11 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: mokoɲ, bapɨr, mozepẽ, 12, cinq, momokonte

The 10 most frequent NUM types: bapɨr, mokoɲ, 12, cinq, dimokoɲi, mokoɲamãʔẽkom, mokoɲoŋ, mokoɲtaroŋ, momokonte, mozepẽ

The 10 most frequent ambiguous lemmas: mokoɲ (NUM 8, VERB 2), mozepẽ (NUM 2, PRON 2), cinq (ADV 1, NUM 1)

The 10 most frequent ambiguous types: mozepẽ (PRON 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.833333 (the average of all parts of speech is 2.035162).

The 1st highest number of forms (5) was observed with the lemma “mokoɲ”: dimokoɲi, mokoɲ, mokoɲamãʔẽkom, mokoɲoŋ, mokoɲtaroŋ.

The 2nd highest number of forms (2) was observed with the lemma “mozepẽ”: mozepẽ, mozepẽãhã.

The 3rd highest number of forms (1) was observed with the lemma “12”: 12.

NUM occurs with 6 features: Number[subj] (2; 11% instances), Case (1; 6% instances), Number (1; 6% instances), Polarity (1; 6% instances), Tense (1; 6% instances), VerbForm (1; 6% instances)

NUM occurs with 6 feature-value pairs: Case=Ref, Number=Plur, Number[subj]=Plur, Polarity=Neg, Tense=Fut, VerbForm=Ger

NUM occurs with 5 feature combinations. The most frequent feature combination is _ (14 tokens). Examples: bapɨr, mokoɲ, 12, cinq, momokonte, mozepẽ, mozepẽãhã

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (9; 50% instances), root (4; 22% instances), conj (2; 11% instances), obl (2; 11% instances), nsubj (1; 6% instances)

Parents of NUM nodes belong to 4 different parts of speech: NOUN (9; 50% instances), (4; 22% instances), VERB (3; 17% instances), NUM (2; 11% instances)

15 (83%) NUM nodes are leaves.

3 (17%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 2 different relations: conj (2; 67% instances), advmod (1; 33% instances)

Children of NUM nodes belong to 2 different parts of speech: NUM (2; 67% instances), ADV (1; 33% instances)