home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: PRON

There are 49 PRON lemmas (0%), 72 PRON types (0%) and 22645 PRON tokens (7%). Out of 17 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: det, jeg, han, vi, de, seg, sin, hun, du, dette

The 10 most frequent PRON types: det, jeg, han, vi, de, seg, hun, du, dette, man

The 10 most frequent ambiguous lemmas: det (PRON 5440, DET 1116, X 3), jeg (PRON 2795, NOUN 4), vi (PRON 2214, NOUN 1), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), seg (PRON 1231, X 1), du (PRON 799, NOUN 1), dette (PRON 587, DET 171), man (PRON 479, NOUN 1, X 1), den (DET 1494, PRON 437), min (PRON 335, DET 1, X 1)

The 10 most frequent ambiguous types: det (PRON 3781, DET 931, X 3), jeg (PRON 1466, NOUN 4), vi (PRON 1246, NOUN 1), de (DET 1170, PRON 1054, PROPN 11, X 6, ADV 1), seg (PRON 1231, X 1), du (PRON 567, NOUN 1), dette (PRON 375, DET 142), man (PRON 428, NOUN 1, X 1), meg (PRON 441, ADP 1), den (DET 1275, PRON 354)

Morphology

The form / lemma ratio of PRON is 1.469388 (the average of all parts of speech is 1.381699).

The 1st highest number of forms (5) was observed with the lemma “jeg”: Eg, jeg, meg, mig, mæ.

The 2nd highest number of forms (4) was observed with the lemma “din”: di, din, dine, ditt.

The 3rd highest number of forms (4) was observed with the lemma “min”: mi, min, mine, mitt.

PRON occurs with 8 features: PronType (22598; 100% instances), Number (20916; 92% instances), Person (18148; 80% instances), Case (12386; 55% instances), Gender (11798; 52% instances), Animacy (9745; 43% instances), Poss (2099; 9% instances), Reflex (1231; 5% instances)

PRON occurs with 22 feature-value pairs: Animacy=Hum, Case=Acc, Case=Gen, Case=Gen,Nom, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art,Prs, PronType=Ind,Prs, PronType=Int, PronType=Prs, PronType=Prs,Tot, PronType=Rcp, Reflex=Yes

PRON occurs with 39 feature combinations. The most frequent feature combination is Gender=Neut|Number=Sing|Person=3|PronType=Prs (6239 tokens). Examples: det, dette, alt, slikt, sånt, intet, dét, et

Relations

PRON nodes are attached to their parents using 19 different relations: nsubj (11925; 53% instances), expl (3240; 14% instances), obj (2621; 12% instances), det (2112; 9% instances), obl (1224; 5% instances), nmod (534; 2% instances), iobj (386; 2% instances), root (226; 1% instances), conj (118; 1% instances), dislocated (60; 0% instances), appos (44; 0% instances), ccomp (43; 0% instances), nsubj:outer (43; 0% instances), xcomp (39; 0% instances), compound (10; 0% instances), flat:name (9; 0% instances), parataxis (5; 0% instances), csubj (4; 0% instances), reparandum (2; 0% instances)

Parents of PRON nodes belong to 12 different parts of speech: VERB (15989; 71% instances), NOUN (3830; 17% instances), ADJ (1876; 8% instances), PRON (241; 1% instances), (226; 1% instances), ADV (146; 1% instances), PROPN (125; 1% instances), DET (107; 0% instances), ADP (54; 0% instances), NUM (46; 0% instances), AUX (4; 0% instances), CCONJ (1; 0% instances)

19609 (87%) PRON nodes are leaves.

1915 (8%) PRON nodes have one child.

672 (3%) PRON nodes have two children.

449 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 29 different relations: case (1839; 36% instances), acl:relcl (727; 14% instances), punct (635; 12% instances), cop (272; 5% instances), advmod (264; 5% instances), det (244; 5% instances), nmod (207; 4% instances), conj (165; 3% instances), nsubj (160; 3% instances), cc (137; 3% instances), acl (100; 2% instances), expl (93; 2% instances), amod (76; 1% instances), obl (71; 1% instances), appos (46; 1% instances), mark (44; 1% instances), advcl (21; 0% instances), aux (21; 0% instances), discourse (10; 0% instances), xcomp (10; 0% instances), csubj (4; 0% instances), dislocated (4; 0% instances), reparandum (4; 0% instances), flat:name (2; 0% instances), nsubj:outer (2; 0% instances), ccomp (1; 0% instances), flat (1; 0% instances), nummod (1; 0% instances), obj (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1853; 36% instances), VERB (867; 17% instances), PUNCT (635; 12% instances), NOUN (372; 7% instances), AUX (293; 6% instances), DET (255; 5% instances), PRON (241; 5% instances), ADV (194; 4% instances), CCONJ (138; 3% instances), ADJ (135; 3% instances), PROPN (75; 1% instances), PART (56; 1% instances), SCONJ (34; 1% instances), INTJ (10; 0% instances), NUM (4; 0% instances)