home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: DET

There are 43 DET lemmas (0%), 67 DET types (0%) and 15006 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: ein, den, dei, det, annan, all, denne, nokon, eigen, slik

The 10 most frequent DET types: ein, den, eit, dei, ei, det, andre, alle, denne, anna

The 10 most frequent ambiguous lemmas: ein (DET 5926, PRON 824), den (DET 1927, PRON 148, X 12, PROPN 1), dei (PRON 1668, DET 1515), det (PRON 5532, DET 1337, X 16), all (DET 583, X 3, ADV 2), denne (DET 442, PRON 23, X 1), nokon (DET 430, PRON 74), slik (ADV 368, DET 238), same (DET 227, ADV 1), kvar (DET 211, ADV 27)

The 10 most frequent ambiguous types: ein (DET 2404, PRON 728, ADP 1), den (DET 1665, PRON 115, X 12), dei (PRON 1439, DET 1346), ei (DET 1287, PART 2, PRON 2, NUM 1), det (PRON 4104, DET 1165, X 16, ADV 1), andre (DET 472, ADJ 55, X 1), alle (DET 409, PRON 110), denne (DET 389, PRON 19, X 1), noko (PRON 274, DET 206), dette (PRON 472, DET 161, X 2)

Morphology

The form / lemma ratio of DET is 1.558140 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (6) was observed with the lemma “ein”: ei, ein, eir, eit, eitt, en.

The 2nd highest number of forms (5) was observed with the lemma “eigen”: egen, eiga, eige, eigen, eigne.

The 3rd highest number of forms (5) was observed with the lemma “nokon”: noka, noko, nokon, nokor, nokre.

DET occurs with 6 features: PronType (15006; 100% instances), Number (14293; 95% instances), Gender (11254; 75% instances), Definite (1024; 7% instances), Polarity (138; 1% instances), Case (5; 0% instances)

DET occurs with 16 feature-value pairs: Case=Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 36 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Art (2565 tokens). Examples: ein, en

Relations

DET nodes are attached to their parents using 22 different relations: det (14125; 94% instances), obl (276; 2% instances), nsubj (158; 1% instances), conj (93; 1% instances), nmod (92; 1% instances), obj (87; 1% instances), root (75; 0% instances), appos (21; 0% instances), xcomp (20; 0% instances), flat:name (19; 0% instances), acl (7; 0% instances), nsubj:pass (7; 0% instances), acl:relcl (6; 0% instances), ccomp (4; 0% instances), iobj (4; 0% instances), orphan (3; 0% instances), advcl (2; 0% instances), compound (2; 0% instances), csubj (2; 0% instances), expl (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (12948; 86% instances), ADJ (862; 6% instances), VERB (448; 3% instances), PRON (203; 1% instances), PROPN (189; 1% instances), DET (180; 1% instances), NUM (78; 1% instances), (75; 0% instances), ADV (14; 0% instances), SCONJ (7; 0% instances), ADP (1; 0% instances), X (1; 0% instances)

14016 (93%) DET nodes are leaves.

648 (4%) DET nodes have one child.

185 (1%) DET nodes have two children.

157 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 25 different relations: case (322; 19% instances), nmod (263; 15% instances), punct (177; 10% instances), det (150; 9% instances), advmod (148; 9% instances), obl (128; 8% instances), cop (98; 6% instances), cc (88; 5% instances), nsubj (82; 5% instances), conj (51; 3% instances), acl:relcl (39; 2% instances), advcl (39; 2% instances), amod (29; 2% instances), mark (25; 1% instances), appos (13; 1% instances), flat:name (10; 1% instances), expl (7; 0% instances), orphan (7; 0% instances), acl (6; 0% instances), aux (5; 0% instances), parataxis (4; 0% instances), csubj (3; 0% instances), nummod (3; 0% instances), acl:cleft (2; 0% instances), reparandum (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: NOUN (356; 21% instances), ADP (335; 20% instances), DET (180; 11% instances), PUNCT (177; 10% instances), AUX (103; 6% instances), ADJ (95; 6% instances), VERB (94; 6% instances), PRON (93; 5% instances), ADV (89; 5% instances), CCONJ (88; 5% instances), PROPN (38; 2% instances), SCONJ (25; 1% instances), NUM (14; 1% instances), PART (13; 1% instances)