home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: ADJ

There are 3470 ADJ lemmas (19%), 5647 ADJ types (17%) and 15298 ADJ tokens (7%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: mare, prezent, european, nou, asemenea, mic, necesar, român, național, general

The 10 most frequent ADJ types: mare, asemenea, europene, prezentul, nou, necesare, prezenta, europeană, mari, european

The 10 most frequent ambiguous lemmas: mare (ADJ 302, NOUN 46, ADV 1), prezent (ADJ 267, NOUN 36), european (ADJ 221, NOUN 1), nou (ADJ 189, ADV 8, NOUN 1), asemenea (ADJ 141, ADV 17), necesar (ADJ 111, ADV 48), român (ADJ 109, NOUN 35), general (ADJ 89, NOUN 39), bun (ADJ 86, NOUN 25, ADV 3), întreg (ADJ 83, NOUN 4)

The 10 most frequent ambiguous types: mare (ADJ 162, NOUN 14, ADV 1), asemenea (ADJ 138, ADV 17), prezentul (ADJ 57, NOUN 4), nou (ADJ 74, ADV 7, NOUN 1), prezenta (ADJ 53, VERB 15), general (ADJ 37, NOUN 30), română (ADJ 23, NOUN 2), vechi (ADJ 37, NOUN 1), diferite (ADJ 36, VERB 5), standard (ADJ 35, NOUN 1)

Morphology

The form / lemma ratio of ADJ is 1.627378 (the average of all parts of speech is 1.814756).

The 1st highest number of forms (9) was observed with the lemma “adevărat”: adevărat, adevărata, adevărate, adevăratele, adevăratul, adevăratului, adevărată, adevărați, adevărații.

The 2nd highest number of forms (9) was observed with the lemma “nou”: noi, noii, noile, noilor, nou, noua, noul, noului, nouă.

The 3rd highest number of forms (9) was observed with the lemma “prezent”: prezent, prezenta, prezente, prezentei, prezentele, prezentul, prezentului, prezentă, prezenți.

ADJ occurs with 9 features: Degree (15276; 100% instances), Definite (15010; 98% instances), Number (14769; 97% instances), Gender (14474; 95% instances), Case (5612; 37% instances), Abbr (18; 0% instances), Foreign (6; 0% instances), Typo (6; 0% instances), Variant (3; 0% instances)

ADJ occurs with 15 feature-value pairs: Abbr=Yes, Case=Acc,Nom, Case=Dat,Gen, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Typo=Yes, Variant=Short

ADJ occurs with 42 feature combinations. The most frequent feature combination is Definite=Ind|Degree=Pos|Gender=Masc|Number=Sing (4182 tokens). Examples: nou, european, general, mic, național, bun, românesc, singur, oficial, scurt

Relations

ADJ nodes are attached to their parents using 26 different relations: amod (12643; 83% instances), conj (879; 6% instances), xcomp (381; 2% instances), root (316; 2% instances), fixed (284; 2% instances), acl (195; 1% instances), advcl (124; 1% instances), flat (117; 1% instances), ccomp (66; 0% instances), csubj (66; 0% instances), nsubj (57; 0% instances), advmod (46; 0% instances), appos (30; 0% instances), parataxis (21; 0% instances), obj (19; 0% instances), iobj (15; 0% instances), nmod (12; 0% instances), nsubj:pass (6; 0% instances), ccomp:pmod (4; 0% instances), dep (4; 0% instances), advcl:tcl (3; 0% instances), case (3; 0% instances), obl (3; 0% instances), csubj:pass (2; 0% instances), obl:pmod (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (12812; 84% instances), ADJ (829; 5% instances), VERB (822; 5% instances), (316; 2% instances), ADP (199; 1% instances), PROPN (153; 1% instances), PRON (101; 1% instances), ADV (28; 0% instances), NUM (18; 0% instances), DET (14; 0% instances), AUX (2; 0% instances), X (2; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)

11514 (75%) ADJ nodes are leaves.

2067 (14%) ADJ nodes have one child.

711 (5%) ADJ nodes have two children.

1006 (7%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 40 different relations: punct (1364; 18% instances), advmod (1223; 16% instances), conj (1014; 13% instances), obl (786; 10% instances), cop (656; 8% instances), cc (622; 8% instances), nsubj (521; 7% instances), det (280; 4% instances), mark (240; 3% instances), advcl (201; 3% instances), iobj (183; 2% instances), obl:pmod (126; 2% instances), aux (78; 1% instances), xcomp (61; 1% instances), parataxis (55; 1% instances), amod (48; 1% instances), case (47; 1% instances), nsubj:pass (39; 1% instances), obl:agent (28; 0% instances), aux:pass (25; 0% instances), nummod (22; 0% instances), csubj (21; 0% instances), appos (20; 0% instances), ccomp:pmod (19; 0% instances), obj (16; 0% instances), flat (15; 0% instances), obl:tmod (15; 0% instances), acl (14; 0% instances), fixed (11; 0% instances), ccomp (10; 0% instances), goeswith (6; 0% instances), advmod:tmod (5; 0% instances), cc:preconj (3; 0% instances), dep (3; 0% instances), expl (3; 0% instances), expl:poss (2; 0% instances), advcl:tcl (1; 0% instances), csubj:pass (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (1626; 21% instances), PUNCT (1364; 18% instances), ADV (1106; 14% instances), ADJ (829; 11% instances), AUX (759; 10% instances), CCONJ (628; 8% instances), VERB (440; 6% instances), DET (242; 3% instances), PRON (213; 3% instances), ADP (163; 2% instances), PART (132; 2% instances), SCONJ (130; 2% instances), PROPN (90; 1% instances), NUM (57; 1% instances), X (7; 0% instances)