home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German: POS Tags: ADJ

There are 5149 ADJ lemmas (11%), 7730 ADJ types (14%) and 20887 ADJ tokens (7%). Out of 15 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent ADJ lemmas: erst, groß, gut, neu, weit, spät, ander, deutsch, alt, hoch

The 10 most frequent ADJ types: ersten, später, erste, anderen, weitere, neue, bekannt, neuen, großen, große

The 10 most frequent ambiguous lemmas: erst (ADJ 398, ADV 116, PROPN 37, NOUN 12, NUM 1), groß (ADJ 375, PROPN 23, NOUN 3, ADV 1, VERB 1), gut (ADJ 341, ADV 79, PROPN 11, NOUN 7), neu (ADJ 301, PROPN 17, ADV 11), weit (ADJ 273, ADV 6, NOUN 5), spät (ADJ 235, ADV 36, ADP 1, NOUN 1), ander (ADJ 223, PRON 157, NOUN 5, ADV 2), deutsch (ADJ 179, PROPN 79, NOUN 7), alt (ADJ 177, PROPN 17, NOUN 5), hoch (ADJ 172, PROPN 5, ADV 2, NOUN 1)

The 10 most frequent ambiguous types: ersten (ADJ 199, NOUN 4), später (ADJ 158, ADV 34, ADP 1, NOUN 1), erste (ADJ 156, NOUN 5, ADV 1), anderen (ADJ 125, PRON 31, NOUN 1), weitere (ADJ 89, NOUN 1), neue (ADJ 107, PROPN 1), bekannt (ADJ 95, ADV 8, VERB 4), großen (ADJ 91, PROPN 2, VERB 1), große (ADJ 81, PROPN 1), gut (ADJ 76, ADV 73, PROPN 1)

Morphology

The form / lemma ratio of ADJ is 1.501262 (the average of all parts of speech is 1.186689).

The 1st highest number of forms (21) was observed with the lemma “groß”: gross, grosse, grossen, grosser, groß, große, großem, großen, großer, großes, grössere, grösste, größer, größere, größerem, größeren, größerer, größeres, größte, größten, größter.

The 2nd highest number of forms (17) was observed with the lemma “hoch”: Höchstes, hoch, hohe, hohem, hohen, hoher, hohes, höchste, höchstem, höchsten, höchster, höher, höhere, höherem, höheren, höherer, höheres.

The 3rd highest number of forms (16) was observed with the lemma “gut”: besser, bessere, besserem, besseren, besserer, besseres, beste, besten, bester, bestes, gut, gute, gutem, guten, guter, gutes.

ADJ occurs with 6 features: Degree (19237; 92% instances), Case (9535; 46% instances), Number (9500; 45% instances), Gender (6577; 31% instances), NumType (629; 3% instances), Polarity (2; 0% instances)

ADJ occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Degree=Cmp, Degree=Cmp,Pos, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, NumType=Ord, Number=Plur, Number=Sing, Polarity=Neg

ADJ occurs with 172 feature combinations. The most frequent feature combination is Degree=Pos (8742 tokens). Examples: ersten, anderen, erste, gut, großen, kurz, neue, neuen, deutschen, freundlich

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (14318; 69% instances), advmod (2835; 14% instances), root (1517; 7% instances), conj (1182; 6% instances), xcomp (241; 1% instances), acl (159; 1% instances), compound (117; 1% instances), advcl (108; 1% instances), ccomp (101; 0% instances), appos (74; 0% instances), parataxis (71; 0% instances), flat (43; 0% instances), cop (31; 0% instances), det (29; 0% instances), nsubj (16; 0% instances), dep (11; 0% instances), csubj (9; 0% instances), obj (9; 0% instances), csubj:pass (3; 0% instances), fixed (3; 0% instances), nmod:poss (3; 0% instances), obl (3; 0% instances), nmod (2; 0% instances), mark (1; 0% instances), nummod (1; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (13959; 67% instances), VERB (2341; 11% instances), ADJ (1928; 9% instances), (1517; 7% instances), PROPN (914; 4% instances), ADP (67; 0% instances), ADV (60; 0% instances), NUM (44; 0% instances), PRON (38; 0% instances), X (5; 0% instances), AUX (4; 0% instances), SCONJ (4; 0% instances), DET (3; 0% instances), PART (3; 0% instances)

14599 (70%) ADJ nodes are leaves.

2915 (14%) ADJ nodes have one child.

870 (4%) ADJ nodes have two children.

2503 (12%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 12.

Children of ADJ nodes are attached using 31 different relations: advmod (3266; 20% instances), punct (2987; 18% instances), nmod (2494; 15% instances), cop (2233; 14% instances), nsubj (1972; 12% instances), conj (1223; 7% instances), cc (914; 6% instances), case (195; 1% instances), mark (169; 1% instances), compound (130; 1% instances), det (128; 1% instances), advcl (117; 1% instances), parataxis (79; 0% instances), amod (77; 0% instances), appos (71; 0% instances), csubj (62; 0% instances), obj (56; 0% instances), iobj (51; 0% instances), ccomp (44; 0% instances), nsubj:pass (41; 0% instances), xcomp (40; 0% instances), dep (39; 0% instances), nummod (38; 0% instances), expl (28; 0% instances), acl (27; 0% instances), aux (9; 0% instances), aux:pass (3; 0% instances), compound:prt (3; 0% instances), fixed (3; 0% instances), obl (3; 0% instances), det:poss (2; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (3316; 20% instances), PUNCT (2990; 18% instances), ADV (2353; 14% instances), AUX (2146; 13% instances), ADJ (1928; 12% instances), CCONJ (918; 6% instances), PRON (762; 5% instances), PROPN (653; 4% instances), VERB (597; 4% instances), NUM (221; 1% instances), ADP (209; 1% instances), SCONJ (159; 1% instances), PART (116; 1% instances), DET (112; 1% instances), X (24; 0% instances)