home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: DET

There are 38 DET lemmas (0%), 66 DET types (0%) and 14380 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 12 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: en, den, de, det, annen, noen, all, denne, egen, hver

The 10 most frequent DET types: en, et, den, de, det, andre, alle, denne, noen, noe

The 10 most frequent ambiguous lemmas: en (DET 6185, PRON 77, X 2), den (DET 1494, PRON 437), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), det (PRON 5440, DET 1116, X 3), noen (DET 536, PRON 109), all (DET 477, X 2), denne (DET 354, PRON 20), selv (ADV 300, DET 201), ingen (DET 197, PRON 104), slik (ADV 228, DET 190)

The 10 most frequent ambiguous types: en (DET 3932, PRON 68, ADP 5, SCONJ 2, X 2), et (DET 1784, PRON 1), den (DET 1275, PRON 354), de (DET 1170, PRON 1054, PROPN 11, X 6, ADV 1), det (PRON 3781, DET 931, X 3), andre (DET 473, ADJ 53), alle (DET 337, PRON 137, ADV 1), denne (DET 306, PRON 18), noen (DET 288, PRON 87), noe (PRON 311, DET 221)

Morphology

The form / lemma ratio of DET is 1.736842 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (7) was observed with the lemma “en”: at, ei, en, ens, er, et, ett.

The 2nd highest number of forms (5) was observed with the lemma “annen”: andre, andres, annen, annens, annet.

The 3rd highest number of forms (4) was observed with the lemma “all”: all, alle, alles, alt.

DET occurs with 6 features: PronType (14380; 100% instances), Number (13821; 96% instances), Gender (10811; 75% instances), Definite (884; 6% instances), Polarity (197; 1% instances), Case (47; 0% instances)

DET occurs with 17 feature-value pairs: Case=Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Dem,Ind, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 44 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Art (4216 tokens). Examples: en

Relations

DET nodes are attached to their parents using 20 different relations: det (13554; 94% instances), obl (214; 1% instances), nmod (142; 1% instances), nsubj (141; 1% instances), obj (84; 1% instances), conj (78; 1% instances), root (75; 1% instances), appos (20; 0% instances), nsubj:pass (15; 0% instances), xcomp (15; 0% instances), flat:name (8; 0% instances), orphan (7; 0% instances), acl:relcl (6; 0% instances), ccomp (5; 0% instances), acl (4; 0% instances), iobj (4; 0% instances), reparandum (4; 0% instances), advcl (2; 0% instances), compound (1; 0% instances), expl (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (12380; 86% instances), ADJ (790; 5% instances), VERB (394; 3% instances), PRON (258; 2% instances), PROPN (199; 1% instances), DET (193; 1% instances), NUM (79; 1% instances), (75; 1% instances), ADV (6; 0% instances), ADP (3; 0% instances), SCONJ (3; 0% instances)

13449 (94%) DET nodes are leaves.

615 (4%) DET nodes have one child.

162 (1%) DET nodes have two children.

154 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 24 different relations: case (287; 18% instances), nmod (248; 15% instances), advmod (166; 10% instances), punct (166; 10% instances), det (159; 10% instances), obl (107; 7% instances), cop (95; 6% instances), nsubj (86; 5% instances), cc (73; 5% instances), conj (57; 4% instances), advcl (39; 2% instances), amod (35; 2% instances), acl:relcl (30; 2% instances), mark (18; 1% instances), expl (10; 1% instances), acl (8; 0% instances), aux (8; 0% instances), acl:cleft (6; 0% instances), appos (4; 0% instances), nummod (4; 0% instances), csubj (3; 0% instances), orphan (3; 0% instances), parataxis (3; 0% instances), flat:name (2; 0% instances)

Children of DET nodes belong to 15 different parts of speech: ADP (300; 19% instances), NOUN (289; 18% instances), DET (193; 12% instances), PUNCT (166; 10% instances), PRON (113; 7% instances), ADV (109; 7% instances), AUX (103; 6% instances), ADJ (89; 6% instances), VERB (85; 5% instances), CCONJ (73; 5% instances), PROPN (58; 4% instances), SCONJ (18; 1% instances), PART (11; 1% instances), NUM (9; 1% instances), X (1; 0% instances)