home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek-GUD: POS Tags: VERB

There are 670 VERB lemmas (24%), 1794 VERB types (38%) and 4035 VERB tokens (16%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: λέω, είμαι, έχω, κάνω, μπορώ, παίρνω, βρίσκω, θέλω, ξέρω, βλέπω

The 10 most frequent VERB types: είναι, λέει, λέω, μπορεί, ήταν, απαντάει, πρέπει, έχει, κάνει, έχω

The 10 most frequent ambiguous lemmas: είμαι (AUX 225, VERB 180), έχω (VERB 154, AUX 98), πρέπει (VERB 45, AUX 2), ορίζω (VERB 2, ADJ 1), επερχόμενος (ADJ 1, VERB 1), προετοιμασμένος (ADJ 1, VERB 1), προηγούμενος (ADJ 3, VERB 1), προϊστάμενος (NOUN 4, VERB 1)

The 10 most frequent ambiguous types: είναι (AUX 137, VERB 93), ήταν (VERB 43, AUX 42), πρέπει (VERB 29, AUX 2), έχει (AUX 33, VERB 31), έχω (VERB 30, AUX 7), έχουμε (VERB 20, AUX 1), είχε (VERB 24, AUX 12), έχουν (AUX 13, VERB 12), είμαι (AUX 10, VERB 8), είμαστε (VERB 9, AUX 8)

Morphology

The form / lemma ratio of VERB is 2.677612 (the average of all parts of speech is 1.660999).

The 1st highest number of forms (28) was observed with the lemma “λέω”: ‘λεγα, ‘πε, Πέστε, Πες, έλεγα, έλεγαν, έλεγε, έλεγες, είπα, είπαμε, είπαν, είπατε, είπε, είπες, λέγαμε, λέγεται, λέει, λέμε, λένε, λέω, λες, πείς, πείτε, πει, πεις, πουν, πούμε, πω.

The 2nd highest number of forms (23) was observed with the lemma “βρίσκω”: έβρισκαν, βρέθηκαν, βρήκα, βρήκαμε, βρήκαν, βρήκατε, βρήκε, βρήκες, βρίσκει, βρίσκεις, βρίσκεται, βρίσκονται, βρίσκουμε, βρίσκω, βρείτε, βρεθεί, βρεθώ, βρει, βρεις, βρισκόταν, βρουν, βρούμε, βρω.

The 3rd highest number of forms (21) was observed with the lemma “βλέπω”: Έβλεπα, Βλέπεις, Είντα, έβλεπε, βλέπει, βλέπετε, βλέπουν, βλέπω, δείτε, δει, δεις, δουν, δούμε, δω, είδα, είδα-, είδαμε, είδαν, είδατε, είδε, είδες.

VERB occurs with 9 features: Voice (4018; 100% instances), VerbForm (4017; 100% instances), Number (3989; 99% instances), Aspect (3948; 98% instances), Mood (3910; 97% instances), Person (3909; 97% instances), Tense (3070; 76% instances), Case (79; 2% instances), Gender (78; 2% instances)

VERB occurs with 24 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 103 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (807 tokens). Examples: λέει, μπορεί, απαντάει, πρέπει, έχει, ρωτάει, υπάρχει, κοιτάζει, κάνει, εξηγεί

Relations

VERB nodes are attached to their parents using 19 different relations: root (1554; 39% instances), conj (648; 16% instances), advcl (483; 12% instances), ccomp (371; 9% instances), xcomp (348; 9% instances), acl:relcl (284; 7% instances), parataxis (207; 5% instances), acl (53; 1% instances), csubj (46; 1% instances), amod (23; 1% instances), flat (4; 0% instances), appos (3; 0% instances), dep (2; 0% instances), discourse (2; 0% instances), obl (2; 0% instances), vocative (2; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: VERB (1923; 48% instances), (1554; 39% instances), NOUN (278; 7% instances), AUX (116; 3% instances), DET (58; 1% instances), ADJ (47; 1% instances), PROPN (22; 1% instances), ADV (20; 0% instances), PRON (8; 0% instances), INTJ (5; 0% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), NUM (1; 0% instances), X (1; 0% instances)

65 (2%) VERB nodes are leaves.

298 (7%) VERB nodes have one child.

853 (21%) VERB nodes have two children.

2819 (70%) VERB nodes have three or more children.

The highest child degree of a VERB node is 10.

Children of VERB nodes are attached using 31 different relations: punct (3377; 25% instances), obj (1949; 14% instances), mark (1266; 9% instances), advmod (1261; 9% instances), obl (1135; 8% instances), nsubj (1075; 8% instances), cc (647; 5% instances), conj (647; 5% instances), advcl (522; 4% instances), xcomp (440; 3% instances), ccomp (398; 3% instances), iobj (342; 2% instances), aux (253; 2% instances), parataxis (177; 1% instances), nsubj:pass (85; 1% instances), csubj (42; 0% instances), vocative (37; 0% instances), discourse (25; 0% instances), expl (23; 0% instances), case (16; 0% instances), det (16; 0% instances), obl:agent (13; 0% instances), nmod (10; 0% instances), dep (7; 0% instances), dislocated (5; 0% instances), cop (4; 0% instances), nsubj:outer (4; 0% instances), compound (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances), fixed (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: PUNCT (3377; 25% instances), NOUN (2337; 17% instances), VERB (1923; 14% instances), PRON (1424; 10% instances), SCONJ (1118; 8% instances), ADV (912; 7% instances), CCONJ (650; 5% instances), PROPN (493; 4% instances), PART (417; 3% instances), AUX (387; 3% instances), DET (315; 2% instances), ADJ (210; 2% instances), ADP (149; 1% instances), INTJ (32; 0% instances), NUM (22; 0% instances), X (15; 0% instances)