home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: ADJ

There are 3544 ADJ lemmas (15%), 4853 ADJ types (15%) and 27149 ADJ tokens (9%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: mykje, mange, stor, god, ny, heil, norsk, liten, viktig, lang

The 10 most frequent ADJ types: meir, mange, fleire, mykje, nye, store, heile, godt, heilt, norske

The 10 most frequent ambiguous lemmas: stor (ADJ 738, X 1), god (ADJ 706, X 1), ny (ADJ 621, X 1), norsk (ADJ 460, NOUN 75), liten (ADJ 433, X 1), lang (ADJ 309, X 1), rett (ADJ 250, NOUN 98, X 1), sist (ADJ 231, ADV 31), politisk (ADJ 215, X 3), klar (ADJ 187, VERB 1)

The 10 most frequent ambiguous types: nye (ADJ 333, X 2), store (ADJ 258, X 1), heile (ADJ 251, NOUN 2), norske (ADJ 227, X 2), norsk (ADJ 209, NOUN 70), litt (ADJ 171, X 1), mest (ADJ 168, X 1), rett (ADJ 155, NOUN 26, X 1), stor (ADJ 157, X 1), ny (ADJ 155, X 1)

Morphology

The form / lemma ratio of ADJ is 1.369357 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (9) was observed with the lemma “høg”: høg, høgare, høgast, høgaste, høge, høgre, høgst, høgste, høgt.

The 2nd highest number of forms (9) was observed with the lemma “liten”: lita, lite, liten, mindre, minst, minste, små, smått, vesle.

The 3rd highest number of forms (9) was observed with the lemma “nær”: nermare, nær, nærare, nærast, næraste, nære, nærmare, nærmast, nært.

ADJ occurs with 7 features: Degree (24498; 90% instances), Number (22802; 84% instances), Definite (17992; 66% instances), Gender (7889; 29% instances), VerbForm (2367; 9% instances), Abbr (25; 0% instances), Case (18; 0% instances)

ADJ occurs with 13 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, VerbForm=Part

ADJ occurs with 27 feature combinations. The most frequent feature combination is Definite=Ind|Degree=Pos|Gender=Neut|Number=Sing (7272 tokens). Examples: mykje, godt, heilt, langt, svært, litt, rett, veldig, viktig, norsk

Relations

ADJ nodes are attached to their parents using 24 different relations: amod (14779; 54% instances), advmod (5766; 21% instances), root (1871; 7% instances), conj (1497; 6% instances), xcomp (1214; 4% instances), nsubj (538; 2% instances), obj (382; 1% instances), advcl (271; 1% instances), ccomp (238; 1% instances), acl:relcl (209; 1% instances), acl (80; 0% instances), flat:name (77; 0% instances), appos (51; 0% instances), csubj (50; 0% instances), orphan (33; 0% instances), nmod (27; 0% instances), nsubj:pass (20; 0% instances), acl:cleft (15; 0% instances), iobj (12; 0% instances), compound (10; 0% instances), parataxis (4; 0% instances), flat:foreign (2; 0% instances), reparandum (2; 0% instances), discourse (1; 0% instances)

Parents of ADJ nodes belong to 15 different parts of speech: NOUN (15385; 57% instances), VERB (5564; 20% instances), ADJ (3320; 12% instances), (1871; 7% instances), PROPN (338; 1% instances), PRON (164; 1% instances), ADV (153; 1% instances), ADP (107; 0% instances), DET (95; 0% instances), NUM (94; 0% instances), SCONJ (25; 0% instances), PART (23; 0% instances), INTJ (4; 0% instances), X (4; 0% instances), AUX (2; 0% instances)

17591 (65%) ADJ nodes are leaves.

4637 (17%) ADJ nodes have one child.

1455 (5%) ADJ nodes have two children.

3466 (13%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 13.

Children of ADJ nodes are attached using 30 different relations: advmod (4346; 18% instances), punct (3327; 14% instances), cop (3016; 12% instances), obl (2289; 9% instances), nsubj (2267; 9% instances), cc (1546; 6% instances), conj (1491; 6% instances), mark (830; 3% instances), det (811; 3% instances), case (786; 3% instances), advcl (784; 3% instances), expl (639; 3% instances), csubj (567; 2% instances), nmod (471; 2% instances), aux (388; 2% instances), parataxis (245; 1% instances), obj (113; 0% instances), amod (103; 0% instances), acl:relcl (76; 0% instances), acl (67; 0% instances), nummod (47; 0% instances), appos (43; 0% instances), xcomp (36; 0% instances), orphan (23; 0% instances), discourse (20; 0% instances), acl:cleft (15; 0% instances), ccomp (6; 0% instances), reparandum (4; 0% instances), flat:name (3; 0% instances), compound (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: AUX (3404; 14% instances), NOUN (3387; 14% instances), PUNCT (3327; 14% instances), ADJ (3320; 14% instances), VERB (1949; 8% instances), PRON (1948; 8% instances), ADV (1876; 8% instances), CCONJ (1545; 6% instances), ADP (917; 4% instances), DET (862; 4% instances), SCONJ (763; 3% instances), PART (497; 2% instances), PROPN (419; 2% instances), NUM (124; 1% instances), INTJ (20; 0% instances), X (2; 0% instances)