Statistics of NUM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Greek-GLCII: POS Tags: `NUM`

There are 39 NUM lemmas (2%), 43 NUM types (1%) and 80 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: ένας, δύο, τρεις, πέντε, 3, δέκα, ενας, σαράντα, 4, 5

The 10 most frequent NUM types: δύο, ένα, πέντε, μια, 3, δέκα, δυο, μία, σαράντα, τρία

The 10 most frequent ambiguous lemmas: ένας (DET 93, NUM 12), ενας (DET 16, NUM 3)

The 10 most frequent ambiguous types: ένα (DET 38, NUM 4), μια (DET 39, NUM 3), μία (DET 7, NUM 2), ενα (DET 10, NUM 2)

ένα
- DET 38: Βρήκα ένα διαμέρισμα για να μετακόμισα .
- NUM 4: Αυτά τα πράγματα δεν αξίζει να τα σκεφτόμστε ούτε ένα λεπτό .
μια
- DET 39: δεινει μια αισθητικη ομορφια .
- NUM 3: Κατά τη γώμη μου μια από τις κυριότερες είναι οι αλλαγές σ τον καιρό .
μία
- DET 7: Για την απόψη μου κατά την υιόθεση , δεν έχω μία καθαρή γμώμη .
- NUM 2: Η ζωή είναι μονο μία !
ενα
- DET 10: Τον Ιούνιο πρέπει να γραψω ενα τεστ για το Πανεπιστημιο .
- NUM 2: Αυτό είναι το πλάνο μου , να ρθω το καλοκαίρι για ενα μήνα , και να σπουδάζω τα ελληνικά σ την Θεσσαλονίκη !

Morphology

The form / lemma ratio of NUM is 1.102564 (the average of all parts of speech is 1.387814).

The 1st highest number of forms (3) was observed with the lemma “ένας”: ένα, μία, μια.

The 2nd highest number of forms (2) was observed with the lemma “δύο”: δυο, δύο.

The 3rd highest number of forms (2) was observed with the lemma “ενας”: ενα, ενασ.

NUM occurs with 5 features: NumType (74; 93% instances), Case (41; 51% instances), Gender (41; 51% instances), Number (41; 51% instances), Foreign (1; 1% instances)

NUM occurs with 10 feature-value pairs: Case=Acc, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, NumType=Sets, Number=Plur, Number=Sing

NUM occurs with 13 feature combinations. The most frequent feature combination is NumType=Card (33 tokens). Examples: δύο, πέντε, 3, σαράντα, ένα, 4, 5, μια, δέκα, δυο

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (42; 53% instances), obl (11; 14% instances), discourse (7; 9% instances), root (7; 9% instances), compound (6; 8% instances), conj (3; 4% instances), nsubj (1; 1% instances), obj (1; 1% instances), parataxis (1; 1% instances), xcomp (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (42; 53% instances), VERB (20; 25% instances), NUM (7; 9% instances), (7; 9% instances), ADJ (2; 3% instances), DET (1; 1% instances), PROPN (1; 1% instances)

39 (49%) NUM nodes are leaves.

19 (24%) NUM nodes have one child.

10 (13%) NUM nodes have two children.

12 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 12 different relations: punct (20; 24% instances), det (12; 14% instances), obl (11; 13% instances), case (9; 11% instances), cop (8; 10% instances), advmod (6; 7% instances), compound (5; 6% instances), conj (5; 6% instances), nsubj (3; 4% instances), cc (2; 2% instances), csubj (2; 2% instances), nmod (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: PUNCT (20; 24% instances), NOUN (13; 15% instances), DET (12; 14% instances), ADP (9; 11% instances), AUX (8; 10% instances), NUM (7; 8% instances), ADV (6; 7% instances), ADJ (3; 4% instances), VERB (3; 4% instances), CCONJ (2; 2% instances), PROPN (1; 1% instances)

Treebank Statistics: UD_Greek-GLCII: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Greek-GLCII: POS Tags: `NUM`