home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: DET

There are 44 DET lemmas (0%), 68 DET types (0%) and 14988 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: ein, den, dei, det, annan, all, denne, nokon, eigen, slik

The 10 most frequent DET types: ein, den, eit, dei, ei, det, andre, alle, denne, anna

The 10 most frequent ambiguous lemmas: ein (DET 5926, PRON 824), den (DET 1927, PRON 148, X 12, PROPN 1), dei (PRON 1668, DET 1515), det (PRON 5532, DET 1337, X 16), all (DET 583, X 3, ADV 2), denne (DET 442, PRON 23, X 1), nokon (DET 430, PRON 74), slik (ADV 368, DET 238), same (DET 227, ADV 1), kvar (DET 211, ADV 27)

The 10 most frequent ambiguous types: ein (DET 2404, PRON 728, ADP 1), den (DET 1665, PRON 115, X 12), dei (PRON 1439, DET 1346), ei (DET 1288, PART 2, PRON 2), det (PRON 4104, DET 1165, X 16, ADV 1), andre (DET 472, ADJ 55, X 1), alle (DET 409, PRON 110), denne (DET 389, PRON 19, X 1), noko (PRON 274, DET 183, NUM 23), dette (PRON 472, DET 161, X 2)

Morphology

The form / lemma ratio of DET is 1.545455 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (6) was observed with the lemma “ein”: ei, ein, eir, eit, eitt, en.

The 2nd highest number of forms (5) was observed with the lemma “eigen”: egen, eiga, eige, eigen, eigne.

The 3rd highest number of forms (5) was observed with the lemma “nokon”: noka, noko, nokon, nokor, nokre.

DET occurs with 6 features: PronType (14850; 99% instances), Gender (11269; 75% instances), Number (2971; 20% instances), Definite (548; 4% instances), Polarity (120; 1% instances), Case (5; 0% instances)

DET occurs with 14 feature-value pairs: Case=Gen, Definite=Ind, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Tot

DET occurs with 36 feature combinations. The most frequent feature combination is Gender=Masc|PronType=Art (2568 tokens). Examples: ein, en

Relations

DET nodes are attached to their parents using 20 different relations: det (14115; 94% instances), obl (279; 2% instances), nsubj (150; 1% instances), nmod (98; 1% instances), conj (93; 1% instances), obj (86; 1% instances), root (72; 0% instances), xcomp (21; 0% instances), flat:name (19; 0% instances), appos (16; 0% instances), nsubj:outer (10; 0% instances), dislocated (7; 0% instances), ccomp (6; 0% instances), nsubj:pass (6; 0% instances), iobj (4; 0% instances), compound (2; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (12904; 86% instances), ADJ (892; 6% instances), VERB (425; 3% instances), PROPN (213; 1% instances), PRON (201; 1% instances), DET (180; 1% instances), NUM (78; 1% instances), (72; 0% instances), ADV (14; 0% instances), SCONJ (7; 0% instances), ADP (1; 0% instances), X (1; 0% instances)

14002 (93%) DET nodes are leaves.

649 (4%) DET nodes have one child.

179 (1%) DET nodes have two children.

158 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 26 different relations: case (335; 20% instances), nmod (252; 15% instances), punct (179; 11% instances), advmod (159; 9% instances), det (150; 9% instances), obl (122; 7% instances), cop (98; 6% instances), cc (88; 5% instances), nsubj (76; 4% instances), conj (51; 3% instances), acl:relcl (39; 2% instances), advcl (39; 2% instances), amod (29; 2% instances), mark (18; 1% instances), appos (11; 1% instances), nmod:poss (9; 1% instances), expl (7; 0% instances), flat:name (7; 0% instances), aux (5; 0% instances), acl (4; 0% instances), csubj (3; 0% instances), flat (3; 0% instances), nummod (3; 0% instances), parataxis (2; 0% instances), dislocated (1; 0% instances), reparandum (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: NOUN (342; 20% instances), ADP (339; 20% instances), DET (180; 11% instances), PUNCT (179; 11% instances), AUX (103; 6% instances), ADJ (101; 6% instances), ADV (98; 6% instances), CCONJ (88; 5% instances), VERB (86; 5% instances), PRON (85; 5% instances), PROPN (45; 3% instances), SCONJ (17; 1% instances), NUM (14; 1% instances), PART (14; 1% instances)