home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: VERB

There are 1870 VERB lemmas (10%), 6830 VERB types (21%) and 22990 VERB tokens (11%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: putea, avea, trebui, face, fi, lua, prevedea, da, vedea, începe

The 10 most frequent VERB types: poate, trebuie, pot, putea, avea, are, face, avut, prevăzute, era

The 10 most frequent ambiguous lemmas: avea (AUX 3824, VERB 718), fi (AUX 4200, VERB 310), da (VERB 200, ADV 10, X 1), duce (VERB 104, NOUN 1), intra (VERB 92, ADV 1), vrea (AUX 549, VERB 45, PART 1), însuși (DET 51, VERB 5), clorprofa (PROPN 1, VERB 1), făr- (ADP 1, VERB 1), pui (NOUN 4, VERB 1)

The 10 most frequent ambiguous types: poate (VERB 400, ADV 31), avea (VERB 162, AUX 20), are (VERB 148, AUX 3), prevăzute (VERB 103, ADJ 3), era (AUX 292, VERB 67), au (AUX 821, VERB 72, X 1), este (AUX 1106, VERB 53), spus (VERB 50, ADJ 1), rupt (VERB 47, ADJ 1), dat (VERB 45, ADJ 4)

Morphology

The form / lemma ratio of VERB is 3.652406 (the average of all parts of speech is 1.819791).

The 1st highest number of forms (23) was observed with the lemma “face”: Făceți, fac, face, facem, faceți, faci, facă, făcea, făceam, făceau, făcu, făcui, făcură, făcuse, făcusem, făcuseră, făcuserăm, făcuseși, făcut, făcute, făcută, făcând, făcându.

The 2nd highest number of forms (21) was observed with the lemma “avea”: ai, aibe, aibă, am, are, au, avea, aveai, aveam, aveau, avem, aveți, avu, avusese, avuseseră, avuseserăm, avut, avute, avută, având, neavând.

The 3rd highest number of forms (19) was observed with the lemma “da”: da, dai, dat, date, dată, dau, dați, dea, dând, dându, dă, dădea, dădeai, dădeau, dădu, dădură, dăduse, dăm, nedându.

VERB occurs with 9 features: VerbForm (22989; 100% instances), Number (15209; 66% instances), Tense (14253; 62% instances), Person (12273; 53% instances), Mood (12272; 53% instances), Gender (7633; 33% instances), Variant (271; 1% instances), ExtPos (1; 0% instances), Foreign (1; 0% instances)

VERB occurs with 21 feature-value pairs: ExtPos=ADP, Foreign=Yes, Gender=Fem, Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pqp, Tense=Pres, Variant=Short, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

VERB occurs with 55 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|VerbForm=Part (4456 tokens). Examples: avut, făcut, spus, putut, rupt, dat, murit, devenit, luat, rănit

Relations

VERB nodes are attached to their parents using 25 different relations: root (8129; 35% instances), acl (4567; 20% instances), advcl (3003; 13% instances), conj (2800; 12% instances), ccomp (2146; 9% instances), csubj (695; 3% instances), parataxis (375; 2% instances), amod (296; 1% instances), xcomp (273; 1% instances), ccomp:pmod (219; 1% instances), advcl:tcl (85; 0% instances), obj (68; 0% instances), fixed (65; 0% instances), obl (62; 0% instances), appos (61; 0% instances), csubj:pass (51; 0% instances), nsubj (41; 0% instances), nmod (23; 0% instances), flat (8; 0% instances), nsubj:pass (7; 0% instances), discourse (5; 0% instances), obl:pmod (4; 0% instances), case (3; 0% instances), obl:agent (2; 0% instances), orphan (2; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: VERB (8699; 38% instances), (8129; 35% instances), NOUN (4895; 21% instances), ADJ (442; 2% instances), ADV (331; 1% instances), PRON (211; 1% instances), PROPN (132; 1% instances), ADP (55; 0% instances), NUM (30; 0% instances), AUX (27; 0% instances), INTJ (18; 0% instances), PART (15; 0% instances), SCONJ (3; 0% instances), CCONJ (2; 0% instances), DET (1; 0% instances)

326 (1%) VERB nodes are leaves.

2144 (9%) VERB nodes have one child.

3490 (15%) VERB nodes have two children.

17030 (74%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 43 different relations: punct (13396; 16% instances), obl (10812; 13% instances), nsubj (8684; 10% instances), obj (7774; 9% instances), mark (5890; 7% instances), advmod (5642; 7% instances), aux (4392; 5% instances), advcl (2801; 3% instances), conj (2757; 3% instances), cc (2459; 3% instances), ccomp (2266; 3% instances), expl:pv (2246; 3% instances), nsubj:pass (2021; 2% instances), aux:pass (1728; 2% instances), iobj (1434; 2% instances), obl:pmod (1269; 2% instances), parataxis (1222; 1% instances), expl:pass (1134; 1% instances), xcomp (1097; 1% instances), obl:agent (797; 1% instances), expl:poss (602; 1% instances), csubj (592; 1% instances), expl (547; 1% instances), obl:tmod (484; 1% instances), nummod (351; 0% instances), cop (282; 0% instances), case (229; 0% instances), ccomp:pmod (183; 0% instances), expl:impers (130; 0% instances), advmod:tmod (112; 0% instances), advcl:tcl (84; 0% instances), csubj:pass (56; 0% instances), vocative (51; 0% instances), appos (50; 0% instances), cc:preconj (43; 0% instances), amod (33; 0% instances), nmod (11; 0% instances), acl (10; 0% instances), discourse (10; 0% instances), det (8; 0% instances), dep (7; 0% instances), flat (2; 0% instances), fixed (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (26876; 32% instances), PUNCT (13396; 16% instances), PRON (9999; 12% instances), VERB (8699; 10% instances), AUX (6407; 8% instances), PART (4516; 5% instances), ADV (4363; 5% instances), CCONJ (2408; 3% instances), ADP (1910; 2% instances), PROPN (1671; 2% instances), SCONJ (1463; 2% instances), NUM (1119; 1% instances), ADJ (817; 1% instances), DET (38; 0% instances), INTJ (12; 0% instances), X (5; 0% instances)