Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: DET
There are 43 DET lemmas (0%), 67 DET types (0%) and 14987 DET tokens (5%).
Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 9 in number of types and 9 in number of tokens.
The 10 most frequent DET lemmas: ein, den, dei, det, annan, all, denne, nokon, eigen, slik
The 10 most frequent DET types: ein, den, eit, dei, ei, det, andre, alle, denne, anna
The 10 most frequent ambiguous lemmas: ein (DET 5926, PRON 824), den (DET 1927, PRON 149, X 12, PROPN 1), dei (PRON 1668, DET 1515), det (PRON 5531, DET 1337, X 16), all (DET 583, X 3, ADV 2), denne (DET 442, PRON 23, X 1), nokon (DET 430, PRON 74), slik (ADV 368, DET 238), same (DET 227, ADV 1), kvar (DET 211, ADV 27)
The 10 most frequent ambiguous types: ein (DET 2404, PRON 728, ADP 1), den (DET 1665, PRON 115, X 12), dei (PRON 1439, DET 1346), ei (DET 1288, PART 2, PRON 2), det (PRON 4104, DET 1165, X 16, ADV 1), andre (DET 472, ADJ 55, X 1), alle (DET 409, PRON 110), denne (DET 389, PRON 19, X 1), noko (PRON 274, DET 183, NUM 23), dette (PRON 472, DET 161, X 2)
- ein
- den
- dei
- ei
- det
- PRON 4104: Slik gjer eg det :
- DET 1165: … Er ofte det beste .
- X 16: den trenger det . »
- ADV 1: Dette samanfallet , pluss det faktum at Ronald Reagan gjekk i sin andre presidentperiode i 1984 , fekk nyleg ein filmkritikar i New York til å skriva at « ein ny episode i Terminator-franchisen er på veg , det må det bety at ein republikansk president stiller til attval » .
- andre
- alle
- denne
- noko
- dette
Morphology
The form / lemma ratio of DET is 1.558140 (the average of all parts of speech is 1.346300).
The 1st highest number of forms (6) was observed with the lemma “ein”: ei, ein, eir, eit, eitt, en.
The 2nd highest number of forms (5) was observed with the lemma “eigen”: egen, eiga, eige, eigen, eigne.
The 3rd highest number of forms (5) was observed with the lemma “nokon”: noka, noko, nokon, nokor, nokre.
DET occurs with 6 features: PronType (14849; 99% instances), Gender (11269; 75% instances), Number (2971; 20% instances), Definite (548; 4% instances), Polarity (120; 1% instances), Case (5; 0% instances)
DET occurs with 15 feature-value pairs: Case=Gen, Definite=Ind, Gender=Com, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Polarity=Neg, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Tot
DET occurs with 37 feature combinations.
The most frequent feature combination is Gender=Masc|PronType=Art (3848 tokens).
Examples: ein, den, en
Relations
DET nodes are attached to their parents using 20 different relations: det (14114; 94% instances), obl (279; 2% instances), nsubj (150; 1% instances), nmod (98; 1% instances), conj (93; 1% instances), obj (86; 1% instances), root (72; 0% instances), xcomp (21; 0% instances), flat:name (19; 0% instances), appos (16; 0% instances), nsubj:outer (10; 0% instances), dislocated (7; 0% instances), ccomp (6; 0% instances), nsubj:pass (6; 0% instances), iobj (4; 0% instances), compound (2; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)
Parents of DET nodes belong to 12 different parts of speech: NOUN (12908; 86% instances), ADJ (893; 6% instances), VERB (425; 3% instances), PROPN (211; 1% instances), PRON (201; 1% instances), DET (174; 1% instances), NUM (78; 1% instances), (72; 0% instances), ADV (14; 0% instances), SCONJ (7; 0% instances), X (3; 0% instances), ADP (1; 0% instances)
14208 (95%) DET nodes are leaves.
453 (3%) DET nodes have one child.
169 (1%) DET nodes have two children.
157 (1%) DET nodes have three or more children.
The highest child degree of a DET node is 8.
Children of DET nodes are attached using 26 different relations: case (324; 22% instances), nmod (252; 17% instances), punct (178; 12% instances), det (148; 10% instances), cop (98; 7% instances), cc (86; 6% instances), nsubj (76; 5% instances), advmod (51; 3% instances), obl (51; 3% instances), conj (50; 3% instances), acl:relcl (39; 3% instances), amod (29; 2% instances), advcl (18; 1% instances), mark (18; 1% instances), appos (11; 1% instances), nmod:poss (9; 1% instances), expl (7; 0% instances), flat:name (7; 0% instances), aux (5; 0% instances), acl (4; 0% instances), csubj (3; 0% instances), nummod (3; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), dislocated (1; 0% instances), reparandum (1; 0% instances)
Children of DET nodes belong to 14 different parts of speech: ADP (328; 22% instances), NOUN (304; 21% instances), PUNCT (178; 12% instances), DET (174; 12% instances), AUX (103; 7% instances), CCONJ (86; 6% instances), PRON (69; 5% instances), VERB (65; 4% instances), ADJ (62; 4% instances), ADV (33; 2% instances), PROPN (32; 2% instances), SCONJ (17; 1% instances), NUM (12; 1% instances), PART (10; 1% instances)