home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: PRON

There are 50 PRON lemmas (0%), 73 PRON types (0%) and 25962 PRON tokens (8%). Out of 17 observed tags, the rank of PRON is: 11 in number of lemmas, 10 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: det, som, jeg, han, vi, de, seg, sin, hun, du

The 10 most frequent PRON types: det, som, jeg, han, vi, de, seg, hun, du, dette

The 10 most frequent ambiguous lemmas: det (PRON 5440, DET 1116, X 3), som (PRON 3317, SCONJ 784, ADP 654, X 4), jeg (PRON 2795, NOUN 4), vi (PRON 2214, NOUN 1), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), seg (PRON 1231, X 1), du (PRON 799, NOUN 1), dette (PRON 587, DET 171), man (PRON 479, NOUN 1, X 1), den (DET 1494, PRON 437)

The 10 most frequent ambiguous types: det (PRON 3781, DET 931, X 3), som (PRON 3317, SCONJ 718, ADP 618, X 4), jeg (PRON 1466, NOUN 4), vi (PRON 1246, NOUN 1), de (DET 1170, PRON 1054, PROPN 11, X 6, ADV 1), seg (PRON 1231, X 1), du (PRON 567, NOUN 1), dette (PRON 375, DET 142), man (PRON 428, NOUN 1, X 1), meg (PRON 441, ADP 1)

Morphology

The form / lemma ratio of PRON is 1.460000 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (5) was observed with the lemma “jeg”: Eg, jeg, meg, mig, mæ.

The 2nd highest number of forms (4) was observed with the lemma “din”: di, din, dine, ditt.

The 3rd highest number of forms (4) was observed with the lemma “min”: mi, min, mine, mitt.

PRON occurs with 9 features: PronType (25962; 100% instances), Number (20916; 81% instances), Person (18148; 70% instances), Case (12386; 48% instances), Gender (11798; 45% instances), Animacy (9745; 38% instances), Poss (2099; 8% instances), Reflex (1231; 5% instances), Polarity (151; 1% instances)

PRON occurs with 26 feature-value pairs: Animacy=Hum, Case=Acc, Case=Gen, Case=Gen,Nom, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, PronType=Art,Prs, PronType=Ind,Prs, PronType=Int, PronType=Neg, PronType=Neg,Prs, PronType=Prs, PronType=Prs,Tot, PronType=Rcp, PronType=Rel, Reflex=Yes

PRON occurs with 42 feature combinations. The most frequent feature combination is Gender=Neut|Number=Sing|Person=3|PronType=Prs (6239 tokens). Examples: det, dette, alt, slikt, sånt, intet, dét, et

Relations

PRON nodes are attached to their parents using 22 different relations: nsubj (14138; 54% instances), expl (3237; 12% instances), obj (2918; 11% instances), nmod (2746; 11% instances), obl (1200; 5% instances), nsubj:pass (627; 2% instances), iobj (482; 2% instances), root (243; 1% instances), conj (118; 0% instances), appos (86; 0% instances), xcomp (52; 0% instances), det (28; 0% instances), advcl (18; 0% instances), compound (11; 0% instances), ccomp (9; 0% instances), flat:name (9; 0% instances), orphan (9; 0% instances), acl (8; 0% instances), csubj (7; 0% instances), acl:relcl (6; 0% instances), parataxis (5; 0% instances), reparandum (5; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (18809; 72% instances), NOUN (3972; 15% instances), ADJ (2126; 8% instances), PRON (267; 1% instances), (243; 1% instances), ADV (166; 1% instances), PROPN (144; 1% instances), DET (113; 0% instances), ADP (73; 0% instances), NUM (48; 0% instances), CCONJ (1; 0% instances)

22739 (88%) PRON nodes are leaves.

2092 (8%) PRON nodes have one child.

676 (3%) PRON nodes have two children.

455 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 29 different relations: case (1794; 33% instances), acl:relcl (749; 14% instances), punct (632; 12% instances), cop (272; 5% instances), advmod (258; 5% instances), acl (250; 5% instances), det (244; 5% instances), nmod (183; 3% instances), conj (165; 3% instances), nsubj (164; 3% instances), cc (137; 3% instances), expl (93; 2% instances), appos (87; 2% instances), mark (87; 2% instances), acl:cleft (80; 1% instances), obl (68; 1% instances), advcl (24; 0% instances), aux (21; 0% instances), amod (19; 0% instances), parataxis (18; 0% instances), discourse (10; 0% instances), orphan (10; 0% instances), xcomp (6; 0% instances), csubj (4; 0% instances), reparandum (4; 0% instances), flat:name (3; 0% instances), ccomp (1; 0% instances), nummod (1; 0% instances), obj (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (1810; 34% instances), VERB (892; 17% instances), PUNCT (632; 12% instances), NOUN (458; 9% instances), AUX (293; 5% instances), PRON (267; 5% instances), DET (258; 5% instances), ADV (203; 4% instances), ADJ (190; 4% instances), CCONJ (138; 3% instances), PROPN (94; 2% instances), SCONJ (76; 1% instances), PART (56; 1% instances), INTJ (10; 0% instances), NUM (7; 0% instances), X (1; 0% instances)