home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: DET

There are 47 DET lemmas (0%), 75 DET types (0%) and 14396 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: en, den, de, det, annen, noen, all, denne, egen, hver

The 10 most frequent DET types: en, et, den, de, det, andre, alle, denne, noen, noe

The 10 most frequent ambiguous lemmas: en (DET 6185, PRON 77, X 2), den (DET 1494, PRON 437), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), det (PRON 5440, DET 1116, X 3), noen (DET 536, PRON 109), all (DET 477, X 2), denne (DET 354, PRON 20), selv (ADV 300, DET 201), ingen (DET 197, PRON 104), slik (ADV 228, DET 190)

The 10 most frequent ambiguous types: en (DET 3932, PRON 68, ADP 6, X 2, SCONJ 1), et (DET 1784, PRON 1), den (DET 1275, PRON 354), de (DET 1170, PRON 1054, PROPN 11, X 6, ADV 1), det (PRON 3781, DET 931, X 3), andre (DET 473, ADJ 53), alle (DET 337, PRON 137, ADV 1), denne (DET 306, PRON 18), noen (DET 288, PRON 87), noe (PRON 311, DET 221)

Morphology

The form / lemma ratio of DET is 1.595745 (the average of all parts of speech is 1.381699).

The 1st highest number of forms (7) was observed with the lemma “en”: at, ei, en, ens, er, et, ett.

The 2nd highest number of forms (5) was observed with the lemma “annen”: andre, andres, annen, annens, annet.

The 3rd highest number of forms (4) was observed with the lemma “all”: all, alle, alles, alt.

DET occurs with 6 features: PronType (14199; 99% instances), Number (13821; 96% instances), Gender (10811; 75% instances), Definite (884; 6% instances), Polarity (182; 1% instances), Case (47; 0% instances)

DET occurs with 16 feature-value pairs: Case=Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Dem,Ind, PronType=Ind, PronType=Int, PronType=Prs, PronType=Tot

DET occurs with 47 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Art (4216 tokens). Examples: en

Relations

DET nodes are attached to their parents using 17 different relations: det (13596; 94% instances), obl (219; 2% instances), nsubj (151; 1% instances), nmod (110; 1% instances), obj (85; 1% instances), conj (78; 1% instances), root (72; 1% instances), xcomp (20; 0% instances), flat:name (19; 0% instances), appos (15; 0% instances), ccomp (8; 0% instances), nsubj:outer (7; 0% instances), dislocated (6; 0% instances), iobj (4; 0% instances), reparandum (4; 0% instances), compound (1; 0% instances), expl (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (12391; 86% instances), ADJ (791; 5% instances), VERB (404; 3% instances), PRON (255; 2% instances), PROPN (199; 1% instances), DET (194; 1% instances), NUM (79; 1% instances), (72; 1% instances), ADV (7; 0% instances), SCONJ (3; 0% instances), ADP (1; 0% instances)

13464 (94%) DET nodes are leaves.

615 (4%) DET nodes have one child.

162 (1%) DET nodes have two children.

155 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 24 different relations: case (300; 19% instances), nmod (244; 15% instances), advmod (176; 11% instances), det (166; 10% instances), punct (165; 10% instances), obl (99; 6% instances), cop (95; 6% instances), nsubj (79; 5% instances), cc (73; 5% instances), conj (57; 4% instances), advcl (39; 2% instances), amod (36; 2% instances), acl:relcl (32; 2% instances), mark (13; 1% instances), expl (10; 1% instances), aux (8; 0% instances), acl (6; 0% instances), appos (6; 0% instances), nummod (4; 0% instances), csubj (3; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), obj (2; 0% instances), nsubj:outer (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: ADP (302; 19% instances), NOUN (295; 18% instances), DET (194; 12% instances), PUNCT (165; 10% instances), ADV (120; 7% instances), PRON (107; 7% instances), AUX (103; 6% instances), ADJ (89; 6% instances), VERB (82; 5% instances), CCONJ (73; 5% instances), PROPN (54; 3% instances), SCONJ (13; 1% instances), PART (11; 1% instances), NUM (9; 1% instances), X (1; 0% instances)