This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home nl/pos issue tracker

ADJ: adjective

This document is a placeholder for the language-specific documentation for ADJ.


Treebank Statistics (UD_Dutch)

There are 2496 ADJ lemmas (10%), 3230 ADJ types (11%) and 12878 ADJ tokens (6%). Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent ADJ lemmas: groot, goed, eerste, nieuw, Nederlands, lang, tweede, belangrijk, klein, vorig

The 10 most frequent ADJ types: eerste, nieuwe, grote, goed, Nederlandse, laatste, tweede, groot, verder, goede

The 10 most frequent ambiguous lemmas: groot (ADJ 388, X 1), goed (ADJ 336, NOUN 13, X 2), eerste (ADJ 256, PROPN 3, X 2), nieuw (ADJ 232, NOUN 2, X 2), Nederlands (ADJ 173, PROPN 5, X 1), lang (ADJ 163, NOUN 1), hoog (ADJ 99, X 2), laatst (ADJ 94, X 3), snel (ADJ 86, VERB 1), oud (ADJ 83, X 1)

The 10 most frequent ambiguous types: eerste (ADJ 252, PROPN 3, X 2), nieuwe (ADJ 189, X 2), goed (ADJ 147, NOUN 3, X 2), Nederlandse (ADJ 140, PROPN 22, X 5), laatste (ADJ 115, X 1), groot (ADJ 89, X 1), goede (ADJ 85, X 1), Amerikaanse (ADJ 86, PROPN 1, X 1), hele (ADJ 65, X 1), bekend (ADJ 60, VERB 1)

Morphology

The form / lemma ratio of ADJ is 1.294071 (the average of all parts of speech is 1.258498).

The 1st highest number of forms (53) was observed with the lemma “jarig”: 12-jarige, 15-jarige, 15-jarigen, 16jarige, 17-jarige, 18-jarige, 19jarige, 20-jarig, 20-jarige, 20jarige, 21-jarige, 22-jarige, 23-jarige, 23jarige, 24-jarige, 24jarige, 25-jarig, 25-jarige, 26-jarige, 26jarige, 27-jarige, 30-jarige, 31-jarige, 32-jarige, 33jarige, 34-jarige, 35-jarige, 37-jarige, 38-jarige, 40-jarige, 43-jarige, 45-jarige, 46-jarige, 47-jarige, 49-jarige, 5-jarig, 50-jarige, 54-jarige, 55-jarige, 58-jarige, 59-jarige, 63-jarige, 66-jarige, 67-jarige, 68-jarige, 73jarige, 78-jarige, 79-jarige, 80-jarige, 85-jarige, 86-jarige, achttienjarigen, vijftigjarig.

The 2nd highest number of forms (8) was observed with the lemma “goed”: best, beste, beter, betere, goed, goede, goeds, goeie.

The 3rd highest number of forms (8) was observed with the lemma “loos”: bosloze, emotieloze, klauwloze, leuningloze, loos, loze, pretentieloos, zouteloze.

ADJ occurs with 6 features: Degree (12307; 96% instances), Case (6067; 47% instances), Variant (2733; 21% instances), Definite (536; 4% instances), NumType (536; 4% instances), Number (130; 1% instances)

ADJ occurs with 10 feature-value pairs: Case=Gen, Case=Nom, Definite=Def, Degree=Cmp, Degree=Pos, Degree=Sup, NumType=Ord, Number=Plur, Number=Sing, Variant=Short

ADJ occurs with 16 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos (5471 tokens). Examples: nieuwe, grote, Nederlandse, goede, Amerikaanse, hele, kleine, politieke, Franse, Duitse

Relations

ADJ nodes are attached to their parents using 24 different relations: amod (7382; 57% instances), advmod (2827; 22% instances), root (941; 7% instances), conj (431; 3% instances), compound:prt (176; 1% instances), dobj (146; 1% instances), xcomp (142; 1% instances), nsubj (129; 1% instances), ccomp (95; 1% instances), parataxis (94; 1% instances), acl (91; 1% instances), aux (89; 1% instances), advcl (86; 1% instances), mark (86; 1% instances), dep (83; 1% instances), csubj (23; 0% instances), appos (22; 0% instances), cc (17; 0% instances), cop (5; 0% instances), nmod (5; 0% instances), case (3; 0% instances), iobj (2; 0% instances), name (2; 0% instances), neg (1; 0% instances)

Parents of ADJ nodes belong to 16 different parts of speech: NOUN (7339; 57% instances), VERB (2190; 17% instances), ROOT (941; 7% instances), ADJ (748; 6% instances), AUX (613; 5% instances), PROPN (300; 2% instances), PRON (245; 2% instances), ADV (239; 2% instances), NUM (76; 1% instances), X (64; 0% instances), CONJ (38; 0% instances), DET (30; 0% instances), SCONJ (30; 0% instances), ADP (15; 0% instances), INTJ (9; 0% instances), SYM (1; 0% instances)

9562 (74%) ADJ nodes are leaves.

1023 (8%) ADJ nodes have one child.

588 (5%) ADJ nodes have two children.

1705 (13%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 14.

Children of ADJ nodes are attached using 28 different relations: advmod (1738; 17% instances), cop (1593; 15% instances), punct (1406; 13% instances), nsubj (1251; 12% instances), nmod (907; 9% instances), dobj (728; 7% instances), conj (470; 4% instances), cc (430; 4% instances), mark (321; 3% instances), det (274; 3% instances), case (270; 3% instances), advcl (219; 2% instances), neg (216; 2% instances), csubj (149; 1% instances), aux (98; 1% instances), parataxis (73; 1% instances), dep (71; 1% instances), xcomp (58; 1% instances), ccomp (55; 1% instances), expl (43; 0% instances), compound:prt (31; 0% instances), nummod (25; 0% instances), amod (21; 0% instances), appos (13; 0% instances), compound (6; 0% instances), det:nummod (6; 0% instances), name (4; 0% instances), acl (2; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: NOUN (1529; 15% instances), ADV (1499; 14% instances), AUX (1406; 13% instances), PUNCT (1406; 13% instances), PRON (1215; 12% instances), VERB (898; 9% instances), ADJ (748; 7% instances), CONJ (402; 4% instances), SCONJ (353; 3% instances), ADP (305; 3% instances), DET (293; 3% instances), PROPN (262; 3% instances), NUM (73; 1% instances), X (66; 1% instances), SYM (20; 0% instances), INTJ (3; 0% instances)


Treebank Statistics (UD_Dutch-LassySmall)

There are 1335 ADJ lemmas (10%), 1763 ADJ types (11%) and 6954 ADJ tokens (7%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent ADJ lemmas: Vlaams, Belgisch, groot, ander, laat, nieuw, Brussels, bekend, Frans, belangrijk

The 10 most frequent ADJ types: Vlaamse, belgische, andere, grote, nieuwe, externe, Vlaams, later, Franse, eigen

The 10 most frequent ambiguous lemmas: Vlaams (ADJ 286, PROPN 45), groot (ADJ 203, PROPN 9), ander (ADJ 164, ADV 1), nieuw (ADJ 101, NOUN 1), Brussels (ADJ 86, PROPN 6, X 1), Frans (ADJ 84, PROPN 53), politiek (ADJ 74, NOUN 28), goed (ADJ 66, NOUN 2), groen (ADJ 65, PROPN 20), koninklijk (ADJ 50, PROPN 3)

The 10 most frequent ambiguous types: Vlaams (ADJ 77, PROPN 45), Franse (ADJ 67, PROPN 1), bekend (ADJ 46, VERB 1), Brussels (ADJ 45, PROPN 6, X 1), vaak (ADJ 40, ADV 3), Europese (ADJ 38, PROPN 3), bekende (ADJ 21, VERB 1), verder (ADJ 22, ADV 8), Waals (ADJ 19, PROPN 6), politiek (NOUN 17, ADJ 12)

Morphology

The form / lemma ratio of ADJ is 1.320599 (the average of all parts of speech is 1.179900).

The 1st highest number of forms (7) was observed with the lemma “laat”: laat, laatst, laatste, laatsten, late, later, latere.

The 2nd highest number of forms (6) was observed with the lemma “goed”: best, beste, beter, betere, goed, goede.

The 3rd highest number of forms (6) was observed with the lemma “groot”: groot, grootste, grote, groten, groter, grotere.

ADJ occurs with 1 features: Degree (6954; 100% instances)

ADJ occurs with 3 feature-value pairs: Degree=Cmp, Degree=Pos, Degree=Sup

ADJ occurs with 3 feature combinations. The most frequent feature combination is Degree=Pos (6431 tokens). Examples: Vlaamse, belgische, andere, grote, nieuwe, externe, Vlaams, Franse, eigen, federale

Relations

ADJ nodes are attached to their parents using 18 different relations: amod (5086; 73% instances), nmod (421; 6% instances), root (345; 5% instances), mwe (316; 5% instances), conj (291; 4% instances), nsubj (147; 2% instances), acl (74; 1% instances), advcl (64; 1% instances), compound (49; 1% instances), det (42; 1% instances), parataxis (30; 0% instances), dobj (29; 0% instances), appos (26; 0% instances), ccomp (20; 0% instances), advmod (7; 0% instances), cc (4; 0% instances), iobj (2; 0% instances), xcomp (1; 0% instances)

Parents of ADJ nodes belong to 16 different parts of speech: NOUN (4572; 66% instances), VERB (1023; 15% instances), ADJ (475; 7% instances), ROOT (345; 5% instances), PROPN (295; 4% instances), DET (69; 1% instances), PRON (49; 1% instances), ADP (33; 0% instances), NUM (32; 0% instances), ADV (27; 0% instances), X (25; 0% instances), SYM (4; 0% instances), INTJ (2; 0% instances), CONJ (1; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)

5429 (78%) ADJ nodes are leaves.

428 (6%) ADJ nodes have one child.

320 (5%) ADJ nodes have two children.

777 (11%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 43.

Children of ADJ nodes are attached using 27 different relations: punct (746; 15% instances), mwe (745; 15% instances), nmod (540; 11% instances), det (450; 9% instances), cop (366; 7% instances), nsubj (350; 7% instances), advmod (306; 6% instances), conj (298; 6% instances), case (292; 6% instances), cc (265; 5% instances), amod (170; 3% instances), mark (85; 2% instances), advcl (61; 1% instances), parataxis (58; 1% instances), appos (44; 1% instances), acl (38; 1% instances), neg (36; 1% instances), name (33; 1% instances), dobj (23; 0% instances), auxpass (19; 0% instances), aux (17; 0% instances), nummod (15; 0% instances), ccomp (8; 0% instances), expl (7; 0% instances), compound (2; 0% instances), csubj (2; 0% instances), iobj (2; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: NOUN (1051; 21% instances), PUNCT (751; 15% instances), ADJ (475; 10% instances), DET (467; 9% instances), AUX (402; 8% instances), ADV (382; 8% instances), ADP (354; 7% instances), PROPN (307; 6% instances), CONJ (273; 5% instances), PRON (155; 3% instances), VERB (153; 3% instances), NUM (68; 1% instances), SCONJ (61; 1% instances), X (33; 1% instances), PART (24; 0% instances), SYM (22; 0% instances)


ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]