home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-Nonstandard: POS Tags: AUX

There are 6 AUX lemmas (0%), 226 AUX types (1%) and 31926 AUX tokens (6%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: avea, fi, vrea, putea, trebui, ști

The 10 most frequent AUX types: au, va, -i, -au, era, am, a, iaste, vor, fi

The 10 most frequent ambiguous lemmas: avea (AUX 14569, VERB 1986, INTJ 23, PART 18, ADP 8, DET 7, CCONJ 5, NOUN 1), fi (AUX 10056, VERB 2003, PRON 11, NOUN 9, ADV 2), vrea (AUX 6645, VERB 664, NOUN 3, DET 1, PART 1, PRON 1), putea (AUX 630, VERB 513), trebui (VERB 210, AUX 13), ști (VERB 862, AUX 13)

The 10 most frequent ambiguous types: au (AUX 8701, CCONJ 191, VERB 167, INTJ 72, DET 15, ADV 7, PRON 2), va (AUX 2546, VERB 72), -i (PRON 2449, AUX 1608, VERB 185), -au (AUX 1559, VERB 5), era (AUX 1221, VERB 400), am (AUX 1181, VERB 135), a (DET 1752, PART 1448, AUX 1105, ADP 166, VERB 2, NOUN 1, PRON 1), iaste (AUX 930, VERB 179), vor (AUX 882, VERB 33), fi (AUX 789, VERB 220)

Morphology

The form / lemma ratio of AUX is 37.666667 (the average of all parts of speech is 2.492163).

The 1st highest number of forms (98) was observed with the lemma “fi”: -e, -i, -s, -su, -ă, E-, Nefiindu, e, easte, era, erai, eram, erau, erați, erea, eream, este, esti, estă, eşti, ești, fi, fi-, fie, fie-, fii, fiie, fiind, fiindu, fim, fimu, fir-, fire, fire-, fireți, fiu, fiţi, fiț, fiți, fost, fostu, fu, fui, fuiu, fum, fură, fuse, fuses-, fusese, fusesem, fusesie, fusesă, fuset, fuseș, fuseși, fusăsă, fîindu, hi, hie, hii, hiind, him, hire, hiu, hiț, hiți, i, i-, iaste, iaște, iera, ieste, ii, nefiind, nefind, s, sem, simtem, simteți, sint, sintem, sinteți, sintu, sunt, suntem, sunteți, suntu, sânt, sântem, sânteți, sînt, sîntem, sînteț, sînteți, sîntu, îi, îs, еrа.

The 2nd highest number of forms (58) was observed with the lemma “vrea”: -a, -ei, -i, -ii, -o, -oi, -om, -or, -va, -vei, -veri, -veți, -viți, -voi, -voiu, -vom, -vor, -vа, -îi, -ţi, -ț, -ți, a, a-, carele, ei, ii, iț, o, o-, oi, oiu, om, omu, or, oru, oţi, v-, va, va-, vei, ver, veri, veț, veți, vii, viț, voi, voiu, vom, vor, vrea, vrem, vrut, vа, îi, ăi, ăţi.

The 3rd highest number of forms (39) was observed with the lemma “avea”: -a, -ai, -am, -ar, -au, -aş, -aţi, -aș, -aț, -ați, -i, -m, -o, -u, -ți, a, a-, ai, am, amu, ar, are, ari, aru, ară, arе, au, avea, avem, aş, aţi, aș, ași, aț, ați, o, or, r, u.

AUX occurs with 8 features: Mood (29944; 94% instances), Person (29944; 94% instances), Tense (29868; 94% instances), Number (29369; 92% instances), VerbForm (2167; 7% instances), Polarity (355; 1% instances), Variant (14; 0% instances), Gender (4; 0% instances)

AUX occurs with 20 feature-value pairs: Gender=Masc, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Imp, Tense=Past, Tense=Pqp, Tense=Pres, Variant=Long, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 55 feature combinations. The most frequent feature combination is Mood=Ind|Number=Plur|Person=3|Tense=Pres (11662 tokens). Examples: au, vor, sînt, -s, or, ar, u, -or, -ar, s

Relations

AUX nodes are attached to their parents using 17 different relations: aux (22093; 69% instances), cop (8196; 26% instances), aux:pass (1329; 4% instances), root (84; 0% instances), advcl (46; 0% instances), conj (40; 0% instances), acl (36; 0% instances), parataxis (30; 0% instances), ccomp (28; 0% instances), csubj (19; 0% instances), xcomp (11; 0% instances), advcl:tcl (8; 0% instances), nsubj (2; 0% instances), appos (1; 0% instances), discourse (1; 0% instances), obl (1; 0% instances), obl:pmod (1; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: VERB (23322; 73% instances), NOUN (4310; 13% instances), ADJ (2066; 6% instances), ADV (924; 3% instances), PRON (638; 2% instances), PROPN (407; 1% instances), (84; 0% instances), NUM (70; 0% instances), AUX (48; 0% instances), DET (47; 0% instances), ADP (6; 0% instances), CCONJ (3; 0% instances), INTJ (1; 0% instances)

31635 (99%) AUX nodes are leaves.

28 (0%) AUX nodes have one child.

40 (0%) AUX nodes have two children.

223 (1%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 31 different relations: punct (224; 21% instances), nsubj (140; 13% instances), advmod (95; 9% instances), obj (81; 7% instances), ccomp (79; 7% instances), obl (74; 7% instances), mark (69; 6% instances), cc (59; 5% instances), conj (55; 5% instances), aux (46; 4% instances), advcl (36; 3% instances), xcomp (36; 3% instances), iobj (16; 1% instances), advmod:tmod (13; 1% instances), csubj (10; 1% instances), nmod:tmod (8; 1% instances), obl:pmod (7; 1% instances), vocative (7; 1% instances), advcl:tcl (6; 1% instances), fixed (5; 0% instances), discourse (4; 0% instances), case (2; 0% instances), det (2; 0% instances), expl (2; 0% instances), expl:pv (2; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), cop (1; 0% instances), nmod (1; 0% instances), nsubj:pass (1; 0% instances), obl:agent (1; 0% instances)

Children of AUX nodes belong to 15 different parts of speech: PUNCT (224; 21% instances), NOUN (214; 20% instances), VERB (170; 16% instances), PRON (135; 12% instances), ADV (116; 11% instances), CCONJ (58; 5% instances), AUX (48; 4% instances), SCONJ (36; 3% instances), PROPN (23; 2% instances), ADP (21; 2% instances), PART (19; 2% instances), ADJ (13; 1% instances), INTJ (4; 0% instances), DET (2; 0% instances), NUM (2; 0% instances)