home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-Nonstandard: POS Tags: VERB

There are 2405 VERB lemmas (17%), 11268 VERB types (33%) and 75123 VERB tokens (13%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: zice, face, da, fi, avea, veni, lua, vedea, merge, pune

The 10 most frequent VERB types: zise, făcut, face, da, dat, era, luat, zice, veni, avea

The 10 most frequent ambiguous lemmas: zice (VERB 3503, NOUN 6, ADV 5, ADJ 2, NUM 1, PROPN 1), face (VERB 3037, ADJ 2, NOUN 1), da (VERB 2434, ADV 3, CCONJ 3), fi (AUX 10056, VERB 2003, PRON 11, NOUN 9, ADV 2), avea (AUX 14569, VERB 1986, INTJ 23, PART 18, ADP 8, DET 7, CCONJ 5, NOUN 1), veni (VERB 1891, NOUN 5, ADJ 2), lua (VERB 1681, INTJ 3, NOUN 3, ADV 1), vedea (VERB 1620, ADV 5, NOUN 3), merge (VERB 1183, NOUN 6, ADJ 1, ADV 1), pune (VERB 1059, NOUN 6)

The 10 most frequent ambiguous types: da (VERB 647, CCONJ 4, ADV 3, NUM 1), dat (VERB 555, ADJ 2, NOUN 1), era (AUX 1221, VERB 400), veni (VERB 394, PUNCT 1), avea (VERB 403, AUX 17), dea (VERB 395, INTJ 1), are (VERB 379, AUX 63, NOUN 1), scris (VERB 370, NOUN 23), pus (VERB 361, NOUN 2, ADJ 1), ia (VERB 272, PRON 153, NOUN 7, INTJ 5)

Morphology

The form / lemma ratio of VERB is 4.685239 (the average of all parts of speech is 2.491875).

The 1st highest number of forms (82) was observed with the lemma “trimite”: Tremete, Tremisem, Tremăs, Trimetu, Trimitu, treimisă, tremease, tremeasără, tremeate, tremes, tremesease, tremeș, tremeși, tremeșii, tremeți, tremețu, tremiase, tremis, tremise, tremite, tremiși, triimată, triimață, triimețind, triimețindu, triimis, triimisesa, triimisesă, triimisă, triimisără, triimite, triimitea, triimită, triimiș, triimiși, triimițind, triimițindu, triimiță, triimăs, triimăsu, trimease, trimeaseră, trimeaset, trimeasă, trimeasără, trimeate, trimeață, trimes, trimesu, trimet, trimete, trimetea, trimetem, trimeș, trimeși, trimeț, trimețu, trimețînd, trimis, trimise, trimiseră, trimisese, trimisesă, trimisu, trimisă, trimisără, trimit, trimite, trimitea, trimitem, trimită, trimişi, trimiș, trimiși, trimiț, trimiți, trimițind, trimițînd, trimițîndu, trimiță, trimăs, trămăs.

The 2nd highest number of forms (77) was observed with the lemma “zice”: -zîce, Dzi, Dzicu, Dzisu-, Zicere, Zisei, Zisе, Zîcînd, dzic, dzice, dzicea, dziceți, dzici, dzicând, dzicându, dzicînd, dzicîndu, dzică, dzis, dzise, dziseră, dzisese, dzisi, dzisu, dzisâră, dzisă, dzîc, dzîce, dzîcea, dzîceam, dzîcim, dzîcînd, dzîcîndu, dzîcă, gice, gicem, gicere, zi, zic, zice, zicea, ziceam, ziceare, ziceau, ziceați, zicem, ziceţi, ziceți, zici, zicu, zicând, zicându, zicînd, zicîndu, zică, zicănd, zis, zise, zisease, ziseră, ziseși, zisă, zisără, ziș, zâce, zâș, zî, zîc, zîce, zîcem, zîceț, zîci, zîcă, zîs, zîsă, zîsără, zîș.

The 3rd highest number of forms (64) was observed with the lemma “vedea”: Vezî, Videții, Vădz, Văzui, Văzutu, nevăzut, nevăzute, nevăzută, nevăzînd, nevăzîndu, vad-, vadzî, vadză, vadă, vazî, vază, veade, vede, vedea, vedea-, vedeai, vedeam, vedeare, vedeau, vedem, vedeț, vedeți, vedzi, vedzii, vez, vezi, vide, videa, videți, văd, vădzind, vădzindu, vădzindu-, vădzu, vădzuiu, vădzură, vădzusă, vădzut, vădzând, vădzînd, vădzîndu, vădzîndu-, văz, văzind, văzindu, văzu, văzuiu, văzum, văzură, văzuse, văzut, văzute, văzută, văzuși, văzând, văzându-, văzînd, văzîndu, văzîndu-.

VERB occurs with 9 features: VerbForm (75118; 100% instances), Mood (44294; 59% instances), Person (44294; 59% instances), Tense (41194; 55% instances), Number (41027; 55% instances), Polarity (11918; 16% instances), Gender (4616; 6% instances), Case (1145; 2% instances), Variant (175; 0% instances)

VERB occurs with 23 feature-value pairs: Case=Acc,Nom, Case=Dat,Gen, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Imp, Tense=Past, Tense=Pqp, Tense=Pres, Variant=Long, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

VERB occurs with 56 feature combinations. The most frequent feature combination is VerbForm=Part (12701 tokens). Examples: făcut, dat, luat, pus, scris, dus, venit, lăsat, vinit, început

Relations

VERB nodes are attached to their parents using 27 different relations: root (22185; 30% instances), conj (17771; 24% instances), advcl (14890; 20% instances), acl (5418; 7% instances), ccomp (4691; 6% instances), parataxis (2631; 4% instances), advcl:tcl (2060; 3% instances), xcomp (1822; 2% instances), csubj (1785; 2% instances), amod (1388; 2% instances), ccomp:pmod (130; 0% instances), appos (103; 0% instances), obl:pmod (65; 0% instances), csubj:pass (58; 0% instances), nmod (54; 0% instances), obl (18; 0% instances), nsubj (16; 0% instances), obj (9; 0% instances), orphan (7; 0% instances), vocative (5; 0% instances), case (4; 0% instances), iobj (4; 0% instances), nsubj:pass (3; 0% instances), discourse (2; 0% instances), obl:agent (2; 0% instances), compound (1; 0% instances), nmod:tmod (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: VERB (41916; 56% instances), (22185; 30% instances), NOUN (6325; 8% instances), PRON (1827; 2% instances), ADJ (1183; 2% instances), ADV (801; 1% instances), PROPN (515; 1% instances), AUX (170; 0% instances), INTJ (101; 0% instances), NUM (49; 0% instances), DET (39; 0% instances), ADP (5; 0% instances), CCONJ (3; 0% instances), SCONJ (2; 0% instances), PUNCT (1; 0% instances), X (1; 0% instances)

1149 (2%) VERB nodes are leaves.

2704 (4%) VERB nodes have one child.

8803 (12%) VERB nodes have two children.

62467 (83%) VERB nodes have three or more children.

The highest child degree of a VERB node is 20.

Children of VERB nodes are attached using 43 different relations: punct (54246; 17% instances), obl (30107; 9% instances), nsubj (27031; 9% instances), obj (26846; 8% instances), cc (22048; 7% instances), mark (22040; 7% instances), aux (21150; 7% instances), advmod (18570; 6% instances), conj (17366; 5% instances), advcl (14519; 5% instances), expl:pv (12166; 4% instances), iobj (10999; 3% instances), obl:pmod (8237; 3% instances), ccomp (5230; 2% instances), advmod:tmod (4158; 1% instances), xcomp (3913; 1% instances), nmod:tmod (3413; 1% instances), parataxis (3121; 1% instances), vocative (2051; 1% instances), advcl:tcl (1999; 1% instances), expl (1604; 1% instances), csubj (1484; 0% instances), aux:pass (1315; 0% instances), nsubj:pass (810; 0% instances), cop (700; 0% instances), discourse (674; 0% instances), obl:agent (525; 0% instances), expl:pass (316; 0% instances), det (254; 0% instances), nmod (137; 0% instances), expl:impers (136; 0% instances), ccomp:pmod (134; 0% instances), appos (130; 0% instances), compound (130; 0% instances), expl:poss (107; 0% instances), nummod (100; 0% instances), csubj:pass (63; 0% instances), cc:preconj (54; 0% instances), acl (43; 0% instances), case (28; 0% instances), amod (19; 0% instances), orphan (5; 0% instances), fixed (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (62273; 20% instances), PRON (54428; 17% instances), PUNCT (54246; 17% instances), VERB (41916; 13% instances), ADV (23464; 7% instances), AUX (23322; 7% instances), CCONJ (22127; 7% instances), PART (10821; 3% instances), PROPN (10587; 3% instances), SCONJ (9062; 3% instances), ADP (2236; 1% instances), ADJ (1549; 0% instances), INTJ (739; 0% instances), NUM (734; 0% instances), DET (471; 0% instances), X (4; 0% instances)