home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: AUX

There are 3 AUX lemmas (0%), 13 AUX types (0%) and 428 AUX tokens (3%). Out of 15 observed tags, the rank of AUX is: 14 in number of lemmas, 12 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: fi, avea, vrea

The 10 most frequent AUX types: este, sunt, a, fi, au, fost, fiind, ar, va, fie

The 10 most frequent ambiguous lemmas: fi (AUX 308, VERB 8), avea (AUX 108, VERB 47)

The 10 most frequent ambiguous types: este (AUX 148, VERB 2), sunt (AUX 62, VERB 1), a (DET 191, AUX 52, PART 19, ADP 1, NOUN 1, X 1), fi (AUX 44, VERB 2), au (AUX 39, VERB 13), fost (AUX 22, VERB 2), fie (AUX 4, CCONJ 4)

Morphology

The form / lemma ratio of AUX is 4.333333 (the average of all parts of speech is 1.477080).

The 1st highest number of forms (8) was observed with the lemma “fi”: e, era, este, fi, fie, fiind, fost, sunt.

The 2nd highest number of forms (3) was observed with the lemma “avea”: a, ar, au.

The 3rd highest number of forms (2) was observed with the lemma “vrea”: va, vor.

AUX occurs with 6 features: VerbForm (367; 86% instances), Number (344; 80% instances), Person (342; 80% instances), Tense (266; 62% instances), Mood (222; 52% instances), Gender (22; 5% instances)

AUX occurs with 12 feature-value pairs: Gender=Masc, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=3, Tense=Imp, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 10 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (153 tokens). Examples: este, e

Relations

AUX nodes are attached to their parents using 6 different relations: cop (183; 43% instances), aux (121; 28% instances), aux:pass (117; 27% instances), root (4; 1% instances), conj (2; 0% instances), acl (1; 0% instances)

Parents of AUX nodes belong to 8 different parts of speech: VERB (230; 54% instances), ADJ (101; 24% instances), NOUN (67; 16% instances), ADV (18; 4% instances), NUM (6; 1% instances), (4; 1% instances), PRON (1; 0% instances), X (1; 0% instances)

421 (98%) AUX nodes are leaves.

0 (0%) AUX nodes have one child.

3 (1%) AUX nodes have two children.

4 (1%) AUX nodes have three or more children.

The highest child degree of a AUX node is 6.

Children of AUX nodes are attached using 8 different relations: obj (8; 35% instances), nsubj (4; 17% instances), punct (4; 17% instances), advcl (2; 9% instances), parataxis (2; 9% instances), cc (1; 4% instances), conj (1; 4% instances), obl (1; 4% instances)

Children of AUX nodes belong to 7 different parts of speech: NOUN (11; 48% instances), PUNCT (4; 17% instances), ADJ (2; 9% instances), NUM (2; 9% instances), VERB (2; 9% instances), CCONJ (1; 4% instances), PRON (1; 4% instances)