Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: X
There are 467 X
lemmas (2%), 467 X
types (1%) and 795 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: $,, the, of, and, in, to, you, a, i, is
The 10 most frequent X
types: ,, the, of, and, in, to, you, a, i, is
The 10 most frequent ambiguous lemmas: $, (PUNCT 11516, X 34), the (X 31, DET 4), of (X 25, ADV 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 356, X 11), you (X 9, PRON 1), a (X 8, NOUN 7, INTJ 1), i (ADP 8577, ADV 54, X 3, PROPN 1), is (NOUN 13, X 7, VERB 1)
The 10 most frequent ambiguous types: , (PUNCT 11516, X 34), the (X 31, DET 4), of (X 25, ADV 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 331, X 11), you (X 9, PRON 1), a (X 8, ADJ 5, NOUN 2, INTJ 1), i (ADP 7800, ADV 53, X 3), is (X 7, NOUN 4, VERB 1)
- ,
- the
- of
- X 25: - President Annan of the world , erklærte Jean til klampetrapp .
- ADV 1: Sist jeg sjekka var Tupperware homeparties noe veldig amerikansk og veldig kjedelig som utdaterte husmødre holdt på med når de innimellom ikke hadde syklubb ( speaking of ; syklubb er vel også noe vi plutselig driver med i fullt alvor , eller ? ) .
- and
- X 20: What lies beyond , and what lay before ?
- CCONJ 1: En geriljakrig handler om « hearts and minds » og om presise nålestikk mot geriljaens våpen .
- NOUN 1: Med et utvidet « Gjærbägst and the Homöcidal Sirupsnipps » -konsept og et trailerlass med plast ble stua raskt forvandlet til et improvisert øvingslokale med høy jallafaktor og lyden skrudd opp til 11 .
- in
- to
- you
- a
- X 8: « It seemed like a great idea at the time . »
- ADJ 5: Den gir følgelig hjemmel for bestemmelser bl a om fredning , jakt , fangst og fiske , turisme og ymse næringsvirksomhet .
- NOUN 2: 5 Saltvannsfiskeloven ( lov 3 juni 1983 nr 40 om saltvannsfiske m.v. ) gjelder i fiskerisonen ved Jan Mayen ( jf lovens § 1 første ledd bokstav a ) .
- INTJ 1: Og da var det sånn « a , hvordan står det til med tingene ellers i livet , har du noen utestående regninger og sånn ? »
- i
- is
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.381641).
The 1st highest number of forms (1) was observed with the lemma “$,”: ,.
The 2nd highest number of forms (1) was observed with the lemma “$-”: -.
The 3rd highest number of forms (1) was observed with the lemma “$.”: ..
X
occurs with 1 features: Foreign (696; 88% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes
(696 tokens).
Examples: ,, the, and, in, to, you, of, a, i, it
Relations
X
nodes are attached to their parents using 13 different relations: flat:foreign (590; 74% instances), flat:name (79; 10% instances), root (47; 6% instances), obl (14; 2% instances), obj (13; 2% instances), ccomp (10; 1% instances), nsubj (10; 1% instances), compound (9; 1% instances), xcomp (8; 1% instances), appos (6; 1% instances), nmod (6; 1% instances), conj (2; 0% instances), nsubj:pass (1; 0% instances)
Parents of X
nodes belong to 8 different parts of speech: X (590; 74% instances), PROPN (83; 10% instances), VERB (54; 7% instances), (47; 6% instances), NOUN (18; 2% instances), ADJ (1; 0% instances), ADP (1; 0% instances), DET (1; 0% instances)
666 (84%) X
nodes are leaves.
18 (2%) X
nodes have one child.
8 (1%) X
nodes have two children.
103 (13%) X
nodes have three or more children.
The highest child degree of a X
node is 46.
Children of X
nodes are attached using 17 different relations: flat:foreign (590; 64% instances), punct (217; 24% instances), flat:name (55; 6% instances), case (21; 2% instances), appos (4; 0% instances), nmod:poss (4; 0% instances), obl (4; 0% instances), advmod (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), cop (2; 0% instances), nsubj (2; 0% instances), cc (1; 0% instances), det (1; 0% instances), mark (1; 0% instances)
Children of X
nodes belong to 13 different parts of speech: X (590; 64% instances), PUNCT (217; 24% instances), PROPN (59; 6% instances), ADP (22; 2% instances), NOUN (11; 1% instances), ADJ (3; 0% instances), ADV (3; 0% instances), VERB (3; 0% instances), AUX (2; 0% instances), NUM (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)