home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-Nonstandard: POS Tags: ADV

There are 580 ADV lemmas (4%), 951 ADV types (3%) and 34596 ADV tokens (6%). Out of 16 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent ADV lemmas: nu, și, mai, când, cum, atunci, numai, așa, tot, unde

The 10 most frequent ADV types: nu, mai, și, cum, n-, cînd, numai, şi, tot, unde

The 10 most frequent ambiguous lemmas: nu (ADV 7837, PART 3, NOUN 1), și (CCONJ 24473, ADV 3128, PRON 14, SCONJ 9), mai (ADV 2739, NOUN 9, ADJ 1, PROPN 1), numai (ADV 922, ADP 2), așa (ADV 819, NOUN 4, ADJ 2, VERB 2, PROPN 1), tot (DET 2499, PRON 1429, ADV 638, NOUN 5, ADJ 1), unde (ADV 625, NOUN 1), acolo (ADV 614, NOUN 1), nici (ADV 603, CCONJ 586), înainte (ADV 541, ADP 26, NOUN 8)

The 10 most frequent ambiguous types: nu (ADV 5887, NOUN 1), mai (ADV 2610, NOUN 7, ADJ 1), și (CCONJ 13522, ADV 2325, PRON 168, SCONJ 2), n- (ADV 1189, PART 3), numai (ADV 643, ADP 2), şi (CCONJ 2042, ADV 719, PRON 6), tot (DET 579, ADV 518, PRON 124, ADJ 1, NOUN 1), unde (ADV 435, NOUN 7), bine (ADV 438, NOUN 135), așa (ADV 298, ADJ 1)

Morphology

The form / lemma ratio of ADV is 1.639655 (the average of all parts of speech is 2.492163).

The 1st highest number of forms (28) was observed with the lemma “aici”: -acia, -acii, -aciia, -aice, -aici, -ciia, aceiși, aci, acia, acice, acicea, acicia, acie, acieși, acii, aciia, aciiași, aciiș, aciișe, aciiși, aice, aicea, aici, cicea, cii, ice, icea, ici.

The 2nd highest number of forms (12) was observed with the lemma “atunci”: -atunce, -atunci, Atuncia, atunce, atuncea, atunceași, atunceș, atunceși, atuncești, atunci, atuncii, atunciși.

The 3rd highest number of forms (12) was observed with the lemma “și”: -şi, ca, i, si, Ş-, Şî, şi, şi-, ș, ș-, și, și-.

ADV occurs with 7 features: Polarity (8506; 25% instances), PronType (5423; 16% instances), Compound (2; 0% instances), Case (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances)

ADV occurs with 8 feature-value pairs: Case=Acc,Nom, Compound=Yes, Gender=Masc, Number=Sing, Person=3, Polarity=Neg, PronType=Ind, PronType=Int,Rel

ADV occurs with 6 feature combinations. The most frequent feature combination is _ (20667 tokens). Examples: mai, și, numai, şi, încă, acolo, bine, atunce, atunci, așa

Relations

ADV nodes are attached to their parents using 31 different relations: advmod (27999; 81% instances), advmod:tmod (4537; 13% instances), conj (559; 2% instances), root (390; 1% instances), advcl (180; 1% instances), mark (134; 0% instances), parataxis (117; 0% instances), compound (91; 0% instances), acl (86; 0% instances), ccomp (83; 0% instances), case (79; 0% instances), xcomp (77; 0% instances), fixed (58; 0% instances), cc (45; 0% instances), nmod (43; 0% instances), csubj (24; 0% instances), obl (24; 0% instances), advcl:tcl (12; 0% instances), cc:preconj (10; 0% instances), appos (8; 0% instances), obj (8; 0% instances), obl:pmod (7; 0% instances), nsubj (6; 0% instances), discourse (5; 0% instances), orphan (4; 0% instances), amod (3; 0% instances), nmod:tmod (2; 0% instances), vocative (2; 0% instances), ccomp:pmod (1; 0% instances), csubj:pass (1; 0% instances), iobj (1; 0% instances)

Parents of ADV nodes belong to 13 different parts of speech: VERB (23464; 68% instances), NOUN (4729; 14% instances), ADV (2060; 6% instances), ADJ (1335; 4% instances), PRON (1321; 4% instances), PROPN (754; 2% instances), (390; 1% instances), NUM (219; 1% instances), DET (135; 0% instances), AUX (116; 0% instances), ADP (54; 0% instances), INTJ (18; 0% instances), CCONJ (1; 0% instances)

28453 (82%) ADV nodes are leaves.

3500 (10%) ADV nodes have one child.

1243 (4%) ADV nodes have two children.

1400 (4%) ADV nodes have three or more children.

The highest child degree of a ADV node is 11.

Children of ADV nodes are attached using 38 different relations: punct (2386; 20% instances), mark (2276; 19% instances), advmod (1728; 15% instances), obl (945; 8% instances), cop (776; 7% instances), nsubj (631; 5% instances), cc (496; 4% instances), conj (480; 4% instances), advcl (379; 3% instances), case (257; 2% instances), csubj (242; 2% instances), iobj (234; 2% instances), obl:pmod (157; 1% instances), aux (139; 1% instances), nmod:tmod (101; 1% instances), advmod:tmod (70; 1% instances), parataxis (69; 1% instances), fixed (63; 1% instances), det (58; 0% instances), vocative (58; 0% instances), advcl:tcl (47; 0% instances), nmod (47; 0% instances), obj (37; 0% instances), xcomp (30; 0% instances), discourse (29; 0% instances), cc:preconj (12; 0% instances), ccomp (11; 0% instances), compound (11; 0% instances), appos (9; 0% instances), expl (9; 0% instances), nummod (9; 0% instances), acl (7; 0% instances), amod (6; 0% instances), orphan (6; 0% instances), ccomp:pmod (4; 0% instances), expl:pv (3; 0% instances), obl:agent (2; 0% instances), expl:pass (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PUNCT (2386; 20% instances), ADP (2252; 19% instances), ADV (2060; 17% instances), NOUN (1465; 12% instances), AUX (924; 8% instances), VERB (801; 7% instances), PRON (637; 5% instances), CCONJ (514; 4% instances), PROPN (274; 2% instances), SCONJ (212; 2% instances), PART (115; 1% instances), DET (63; 1% instances), ADJ (61; 1% instances), NUM (32; 0% instances), INTJ (29; 0% instances)