Statistics of NUM in UD_Dutch-LassySmall

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: `NUM`

There are 1026 NUM lemmas (4%), 1052 NUM types (3%) and 7702 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: één, twee, 1, drie, 2, 2004, 2005, vier, 3, 2003

The 10 most frequent NUM types: twee, één, 1, een, drie, 2, 2004, 2005, vier, 3

The 10 most frequent ambiguous lemmas: één (ADJ 572, NUM 371, PROPN 6), twee (NUM 317, ADJ 128), 1 (NUM 159, ADJ 13, PROPN 3, X 1), drie (NUM 151, ADJ 51), 2 (NUM 108, ADJ 14, X 6, SYM 2, PROPN 1), vier (NUM 94, ADJ 27), 3 (NUM 93, ADJ 11), 2003 (NUM 89, X 1), 10 (NUM 85, ADJ 5, PROPN 1, X 1), 5 (NUM 77, ADJ 1, PROPN 1, X 1)

The 10 most frequent ambiguous types: één (NUM 195, PROPN 6), 1 (NUM 159, PROPN 3, X 1), een (DET 5510, NUM 126, CCONJ 1), 2 (NUM 108, X 6, SYM 2, PROPN 1), vier (NUM 84, VERB 1), 2003 (NUM 89, X 1), 10 (NUM 85, PROPN 1, X 1), 5 (NUM 77, PROPN 1, X 1), 7 (NUM 77, SYM 1), 4 (NUM 76, X 5)

één
- NUM 195: België is één van de dichtstbevolkte landen in Europa .
- PROPN 6: « Vive le Tour » , wieleromkadering voor Sporza op één ( 2005 )
1
- NUM 159: Deze scheiding voltrok zich op 1 januari 1995 .
- PROPN 3: Britse Mark 1 tank ; tank C15 op 26 september 1916
- X 1: Ook zijn er inmiddels avonturenspellen in de wereld van Star Trek , zoals Star Trek : Elite Force 1 & 2 .
een
- DET 5510: Het Vlaams Gewest is een deel van België , met als hoofdstad Brussel .
- NUM 126: Ze zou een van zijn favoriete modellen worden .
- CCONJ 1: De camera zoomt uit een we zien nu dat de legerkazerne een decor is , en dat een filmcamera het hele tafereel vastlegt .
2
- NUM 108: 2 bestuurlijke arrondissementen : Halle-Vilvoorde , Leuven
- X 6: Één nummer van dit album , de ballade When 2 R In Love , is eveneens op Lovesexy te vinden .
- SYM 2: Daarom deed Montgomery aan Generaal Browning de toezegging dat het Britse 2e Leger op de tweede dag van de operatie in Arnhem zal zijn ( 2 ) .
- PROPN 1: In 2006 is Urbanus ook te zien op de Vlaamse zender één en het Nederlandse Nederland 2 met een real-life-soap genaamd Urbain .
vier
- NUM 84: Na vier dagen komen de larven uit de eitjes .
- VERB 1: ( Willy Vandersteen in : “ Ik vier het elke dag , 65 “ door Erik Durnez , Standaard Uitgeverij , 1978 )
2003
- NUM 89: Van 1993 tot 2003 verscheen er een weekblad rond hen .
- X 1: Intégrale 2003
10
- NUM 85: Vlaams Belang : 10
- PROPN 1: In 1982 haalde zijn dubbelelpee « 10 jaar Urbanus live » platina , en zijn single « Quand les Zosiaux chantent dans les bois » ( “ Als de vogeltjens zingen in ‘t woud “ ) was wederom een groot succes .
- X 1: De surrealistische Dylan treedt opnieuw op in I Shall Be Free # 10 en Motorpsycho Nightmare , met een bepaalde humor die kenmerkend is voor zijn gehele loopbaan .
5
- NUM 77: N-VA : 5
- PROPN 1: Deze tijd , meer dan honderd jaar voor de oorspronkelijke serie , is de tijd waarop de mensheid begint aan het onderzoeken van de ruimte vanuit ruimteschepen , nu zij schepen ontwikkeld heeft die warp 5 kunnen vliegen .
- X 1: Enkele hoogtepunten zijn tegenwoordig verkrijgbaar op The Bootleg Series 5 .
7
- NUM 77: Ecolo : 7 zetels
- SYM 1: 7 .
4
- NUM 76: Regering van de Duitstalige Gemeenschap ( 4 ministers )
- X 5: Prince leverde voor Live Aid het nummer 4 The Tears In Your Eyes aan , dat op het album van USA For Africa terecht kwam .

Morphology

The form / lemma ratio of NUM is 1.025341 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (5) was observed with the lemma “één”: Eén, een, eentje, en, één.

The 2nd highest number of forms (3) was observed with the lemma “1975”: (1975), 1955-1975, 1975.

The 3rd highest number of forms (2) was observed with the lemma “150”: 125-150, 150.

NUM occurs with 1 features: ExtPos (944; 12% instances)

NUM occurs with 3 feature-value pairs: ExtPos=ADP, ExtPos=PRON, ExtPos=PROPN

NUM occurs with 4 feature combinations. The most frequent feature combination is _ (6758 tokens). Examples: twee, één, een, drie, 2004, 2005, vier, 2003, 2006, 2

Relations

NUM nodes are attached to their parents using 22 different relations: nummod (2505; 33% instances), obl (1804; 23% instances), nmod (742; 10% instances), flat (676; 9% instances), root (653; 8% instances), appos (344; 4% instances), conj (322; 4% instances), parataxis (227; 3% instances), nsubj (86; 1% instances), fixed (74; 1% instances), acl (65; 1% instances), advcl (37; 0% instances), obj (37; 0% instances), obl:arg (34; 0% instances), det (26; 0% instances), orphan (20; 0% instances), xcomp (20; 0% instances), nsubj:pass (15; 0% instances), acl:relcl (11; 0% instances), ccomp (2; 0% instances), amod (1; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 14 different parts of speech: NOUN (2845; 37% instances), VERB (1969; 26% instances), NUM (1003; 13% instances), PROPN (703; 9% instances), (653; 8% instances), SYM (228; 3% instances), ADJ (110; 1% instances), DET (73; 1% instances), X (66; 1% instances), ADV (21; 0% instances), ADP (15; 0% instances), PRON (14; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)

3183 (41%) NUM nodes are leaves.

2455 (32%) NUM nodes have one child.

900 (12%) NUM nodes have two children.

1164 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 11.

Children of NUM nodes are attached using 25 different relations: case (2533; 31% instances), punct (1851; 22% instances), flat (1543; 19% instances), parataxis (642; 8% instances), nmod (384; 5% instances), conj (330; 4% instances), fixed (209; 3% instances), cc (165; 2% instances), advmod (162; 2% instances), cop (94; 1% instances), nsubj (94; 1% instances), amod (86; 1% instances), det (46; 1% instances), mark (32; 0% instances), appos (22; 0% instances), acl:relcl (19; 0% instances), obl (19; 0% instances), acl (14; 0% instances), advcl (11; 0% instances), nmod:poss (9; 0% instances), orphan (4; 0% instances), cc:preconj (3; 0% instances), nummod (3; 0% instances), aux (1; 0% instances), csubj (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (2538; 31% instances), PUNCT (1851; 22% instances), PROPN (1374; 17% instances), NUM (1003; 12% instances), NOUN (528; 6% instances), CCONJ (219; 3% instances), ADV (167; 2% instances), PRON (107; 1% instances), AUX (95; 1% instances), ADJ (88; 1% instances), SYM (74; 1% instances), VERB (70; 1% instances), X (70; 1% instances), DET (67; 1% instances), SCONJ (26; 0% instances)

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: `NUM`