NUM

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home en/pos issue tracker

`NUM`: numeral

The English NUM corresponds exactly to the PTB CD.

Treebank Statistics (UD_English)

There are 1184 NUM lemmas (6%), 1184 NUM types (5%) and 4496 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: one, two, 2, 3, 1, 5, 4, 10, three, 20

The 10 most frequent NUM types: one, two, 2, 3, 1, 5, 4, 10, three, 20

The 10 most frequent ambiguous lemmas: one (NUM 446, NOUN 144, PRON 26, VERB 1), 2 (NUM 140, X 30, PROPN 2, ADP 1, PART 1), 3 (NUM 119, X 17, NOUN 1), 1 (NUM 103, X 31), 5 (NUM 103, X 4, PROPN 1), 4 (NUM 95, X 13, ADP 1, SCONJ 1), 10 (NUM 93, X 2), 20 (NUM 63, NOUN 5), 6 (NUM 61, X 2), 12 (NUM 37, X 1)

The 10 most frequent ambiguous types: one (NUM 393, NOUN 104, PRON 22), 2 (NUM 140, X 30, PROPN 2, ADP 1, PART 1), 3 (NUM 119, X 17), 1 (NUM 103, X 31), 5 (NUM 103, X 4, PROPN 1), 4 (NUM 95, X 13, SCONJ 1, ADP 1), 10 (NUM 93, X 2), 20 (NUM 63, NOUN 3), 6 (NUM 61, X 2), 12 (NUM 37, X 1)

one
- NUM 393: This is one thought - provoking film .
- NOUN 104: I think a test like this one is much needed .
- PRON 22: What a neat gem of a restaurant in a corner one would n’t expect it .
2
- NUM 140: Analyst Team 2 : Coach : Doug Sewell
- X 30: * 2 . The second ingredient is words , more precisely lies . *
- PROPN 2: and it seems this is the FIRST site of ragnarok 2 hahaha since the site is new send me your suggestions and comments
- ADP 1: go 2 starbucks do nt spend more than 20 bucks :)
- PART 1: hi everyone …. just hav my hands on my new OLYMPUS X940 digital camera .. wel , i always wanted 2 hav one by sony .. but anyways , ended up having olympus X940 from my dad ……. does any1 already has it ?
3
- NUM 119: 3 TO 4 DAYS if you are lucky on average it takes about 6 days .
- X 17: * 3 . The third aspect is money . *
1
- NUM 103: Analyst Team 1 : Coach : Lisa Gilette
- X 31: Price : 3,40 Euros , 5 Euros or 7,5 Euros ( 1 ) for a 72 heures lenght .
5
- NUM 103: I was thinking Kenneally ‘s at around 5 .
- X 4: 5 ) W. Brumbley 4632 Hilton Ave Suite # 31 Columbus , Ohio 43228
- PROPN 1: Lunar landers and other gear needed for extended visits to the moon will be lofted by gargantuan launchers as big as the Apollo - era Saturn 5 , the most powerful rockets ever flown .
4
- NUM 95: The US troops fired into the hostile crowd , killing 4 .
- X 13: 4 . Alan Greenspan , Chairman , Federal Reserve , U.S.A .
- SCONJ 1: home team - thanks 4 playin !!!
- ADP 1: thanks guys goes really well and thaks 4 the cheap price ..
10
- NUM 93: see you there on court 10
- X 2: 10 . Jack Welch , CEO , General Motors
20
- NUM 63: September 20 , 1888 ?
- NOUN 3: Mary « MEH-risk Oct 20 »
6
- NUM 61: 3 TO 4 DAYS if you are lucky on average it takes about 6 days .
- X 2: 6 . Oprah Winfrey , talkshow host
12
- NUM 37: August 12 , 2000
- X 1: 12 . Remedies should be deleted as it is already covered in Section 6.1 .

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.173735).

The 1st highest number of forms (1) was observed with the lemma “’02”: ‘02.

The 2nd highest number of forms (1) was observed with the lemma “’05”: ‘05.

The 3rd highest number of forms (1) was observed with the lemma “’07”: ‘07.

NUM occurs with 1 features: en-feat/NumType (4496; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (4496 tokens). Examples: one, two, 2, 3, 1, 5, 4, 10, three, 20

Relations

NUM nodes are attached to their parents using 26 different relations: en-dep/nummod (2738; 61% instances), en-dep/nmod (500; 11% instances), en-dep/root (272; 6% instances), en-dep/appos (211; 5% instances), en-dep/compound (205; 5% instances), en-dep/list (115; 3% instances), en-dep/dobj (103; 2% instances), en-dep/nsubj (103; 2% instances), en-dep/conj (90; 2% instances), en-dep/nmod:tmod (64; 1% instances), en-dep/parataxis (18; 0% instances), en-dep/amod (13; 0% instances), en-dep/advcl (9; 0% instances), en-dep/nmod:npmod (9; 0% instances), en-dep/xcomp (9; 0% instances), en-dep/remnant (8; 0% instances), en-dep/ccomp (7; 0% instances), en-dep/advmod (6; 0% instances), en-dep/nsubjpass (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/case (2; 0% instances), en-dep/det (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/iobj (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/vocative (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (2310; 51% instances), PROPN (737; 16% instances), VERB (412; 9% instances), NUM (389; 9% instances), SYM (295; 7% instances), ROOT (272; 6% instances), ADJ (39; 1% instances), X (15; 0% instances), ADV (14; 0% instances), PRON (6; 0% instances), DET (5; 0% instances), AUX (1; 0% instances), PUNCT (1; 0% instances)

3019 (67%) NUM nodes are leaves.

959 (21%) NUM nodes have one child.

269 (6%) NUM nodes have two children.

249 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 13.

Children of NUM nodes are attached using 34 different relations: en-dep/case (517; 21% instances), en-dep/punct (399; 16% instances), en-dep/nmod (355; 14% instances), en-dep/advmod (207; 8% instances), en-dep/nmod:tmod (197; 8% instances), en-dep/compound (115; 5% instances), en-dep/conj (97; 4% instances), en-dep/cop (91; 4% instances), en-dep/cc (89; 4% instances), en-dep/nsubj (88; 4% instances), en-dep/nummod (85; 3% instances), en-dep/det (57; 2% instances), en-dep/parataxis (46; 2% instances), en-dep/amod (24; 1% instances), en-dep/appos (23; 1% instances), en-dep/acl:relcl (20; 1% instances), en-dep/mark (15; 1% instances), en-dep/nmod:npmod (13; 1% instances), en-dep/aux (11; 0% instances), en-dep/advcl (9; 0% instances), en-dep/remnant (7; 0% instances), en-dep/discourse (5; 0% instances), en-dep/neg (5; 0% instances), en-dep/acl (4; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/cc:preconj (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/csubj (1; 0% instances), en-dep/det:predet (1; 0% instances), en-dep/dobj (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/list (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 17 different parts of speech: ADP (441; 18% instances), NOUN (420; 17% instances), PUNCT (390; 16% instances), NUM (389; 16% instances), ADV (187; 8% instances), VERB (163; 7% instances), SYM (112; 4% instances), CONJ (86; 3% instances), PRON (81; 3% instances), ADJ (77; 3% instances), DET (69; 3% instances), PROPN (46; 2% instances), AUX (11; 0% instances), SCONJ (8; 0% instances), PART (5; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)

Treebank Statistics (UD_English-ESL)

There are 1 NUM lemmas (6%), 1 NUM types (6%) and 844 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 15635, VERB 15080, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

The 10 most frequent ambiguous types: _ (NOUN 15635, VERB 15080, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

_
- NOUN 15635: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 15080: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 10618: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 10057: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 9580: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 8546: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5857: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5704: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 4533: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3531: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CONJ 3198: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2516: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1795: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 844: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 80: _ _ _ _ _ _ _ _ _ _ _ _
- X 68: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 39: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 19 different relations: en-dep/nummod (525; 62% instances), en-dep/nmod (156; 18% instances), en-dep/root (31; 4% instances), en-dep/nsubj (28; 3% instances), en-dep/conj (27; 3% instances), en-dep/dobj (22; 3% instances), en-dep/compound (13; 2% instances), en-dep/appos (10; 1% instances), en-dep/nmod:tmod (6; 1% instances), en-dep/advcl (5; 1% instances), en-dep/ccomp (4; 0% instances), en-dep/parataxis (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/xcomp (3; 0% instances), en-dep/goeswith (2; 0% instances), en-dep/nmod:npmod (2; 0% instances), en-dep/amod (1; 0% instances), en-dep/csubjpass (1; 0% instances), en-dep/det (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (507; 60% instances), VERB (180; 21% instances), PROPN (38; 5% instances), NUM (36; 4% instances), SYM (35; 4% instances), ROOT (31; 4% instances), ADJ (11; 1% instances), ADV (3; 0% instances), PRON (2; 0% instances), PUNCT (1; 0% instances)

509 (60%) NUM nodes are leaves.

209 (25%) NUM nodes have one child.

52 (6%) NUM nodes have two children.

74 (9%) NUM nodes have three or more children.

The highest child degree of a NUM node is 9.

Children of NUM nodes are attached using 25 different relations: en-dep/case (163; 26% instances), en-dep/nmod (103; 16% instances), en-dep/punct (53; 8% instances), en-dep/cop (51; 8% instances), en-dep/nsubj (48; 8% instances), en-dep/advmod (47; 7% instances), en-dep/conj (32; 5% instances), en-dep/cc (30; 5% instances), en-dep/det (21; 3% instances), en-dep/compound (17; 3% instances), en-dep/amod (15; 2% instances), en-dep/mark (12; 2% instances), en-dep/acl:relcl (7; 1% instances), en-dep/parataxis (6; 1% instances), en-dep/appos (4; 1% instances), en-dep/neg (4; 1% instances), en-dep/advcl (3; 0% instances), en-dep/aux (3; 0% instances), en-dep/goeswith (3; 0% instances), en-dep/acl (2; 0% instances), en-dep/nummod (2; 0% instances), en-dep/csubj (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: ADP (159; 25% instances), NOUN (99; 16% instances), VERB (80; 13% instances), ADV (54; 9% instances), PUNCT (52; 8% instances), PRON (37; 6% instances), NUM (36; 6% instances), CONJ (30; 5% instances), ADJ (26; 4% instances), DET (23; 4% instances), PROPN (15; 2% instances), SCONJ (8; 1% instances), PART (5; 1% instances), AUX (3; 0% instances), SYM (2; 0% instances), X (1; 0% instances)

Treebank Statistics (UD_English-LinES)

There are 1 NUM lemmas (6%), 125 NUM types (1%) and 581 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: one, two, three, 2002, six, five, 2000, 1, 2, ten

The 10 most frequent ambiguous lemmas: _ (NOUN 14939, VERB 11076, PUNCT 10025, ADP 8281, DET 7865, PRON 7793, ADJ 5305, ADV 4610, AUX 3168, PROPN 2792, CONJ 2535, PART 2131, SCONJ 1512, NUM 581, INTJ 159, X 43, SYM 6)

The 10 most frequent ambiguous types: one (PRON 115, NUM 102, DET 10), 1 (NUM 14, ADJ 2), 12 (NUM 7, ADJ 1), 3 (NUM 5, ADJ 1), 5 (NUM 3, ADJ 1), 30 (NUM 2, ADJ 1), U (NUM 2, NOUN 1), 14 (NUM 1, ADJ 1), 22 (ADJ 2, NUM 1)

one
- PRON 115: Margot Wentz said , looking at no one , That one ca n’t say .
- NUM 102: The North Pole was one of these places , I remember .
- DET 10: In that one brief moment he knew that he was in trouble .
1
- NUM 14: 1 Filter field
- ADJ 2: The European Union put a new and revised banana regime in place on 1 January 1999 .
12
- NUM 7: The vote will take place tomorrow at 12 noon .
- ADJ 1: On July 12 , after the raid , Israel was accused of giving comfort to the reactionaries of Rhodesia and South Africa by its demonstration of military superiority and its use of Western arms and techniques , upsetting the balance between poor and rich countries , disturbing the work of men of good will in Paris who were trying to create a new climate and to treat the countries of the Third World as equals and partners .
3
- NUM 5: 3 Outer field items
- ADJ 1: On July 3 , 1976 , before Israel had freed the hostages at Entebbe , the paper observed with some satisfaction that Amin , “ the disquieting Marshal , “ maligned by everyone , had now become the support and the hope of his foolish detractors .
5
- NUM 3: 5 Aggregate field
- ADJ 1: We have done so : on 5 February we published an extremely detailed press release dealing with the questions you have raised .
30
- NUM 2: A mounted escort of some 30 men , all armed .
- ADJ 1: In the proposal for a directive , summer begins on 1 April and ends on 30 September .
U
- NUM 2: SELECT asterisk FROM Customers WHERE Country Like U %
- NOUN 1: It returns all customers from a (country|region) named “ U % “ , not all (countries|regions) beginning with the letter “ U “ , because the percent sign ( % ) is not a wildcard character in ANSI-89 SQL .
14
- NUM 1: Then there is the famous project No-8 of the 14 very important projects endorsed by the Essen summit .
- ADJ 1: Through you I urge the Commission and the Council to reach decisions and take urgent action following the meeting on 14 October .
22
- ADJ 2: Parliament agreed to urgent procedure for 22 March
- NUM 1: These are Amendments Nos 22 , 23 , 37 and 38 .

Morphology

The form / lemma ratio of NUM is 125.000000 (the average of all parts of speech is 597.705882).

The 1st highest number of forms (125) was observed with the lemma “_”: 01-Jul-1999, 08-Jul-1999, 1, 1-100, 10, 100, 100c, 101-200, 11.25, 11.30, 111, 12, 12.00, 12:30, 13, 14, 1857, 1875, 1910, 1945, 1947, 1950s, 1952, 1953, 1955, 1972, 1973, 1976, 1996, 1996-1997, 1997, 1998, 1999, 2, 2.6, 2000, 2002, 2005, 22, 23, 25, 3, 30, 31-Dec-1999, 37, 38, 4, 4-5, 40, 43, 4:30, 5, 5.5, 50, 50000, 6, 60, 6500, 7, 7.0, 7.15, 747, 84, 9, 96/23, 96/96/EC, 97, A4-0029/99, A4-0072/97, A4-0090/99, C4-0497/98-98/0126, H-0002/99, H-0045/99, H-0209/99, H-0218/97, H-0237/97, No-12, No-15, No-4, No-44, No-46, No-49, No-59, No-6, No-8, U, billion, eight, eight-and-a-half-by-eleven, eighteen, eleven, fifteen, fifty, five, forty, forty-eight, four, fourteen, half-a-dozen, hundred, million, n, nine, nineteen, nn, one, seven, six, six-forty-one, six-thirty, sixteen, sixty, ten, thirty, thirty-eight, thirty-five, thousand, three, twelve, twenty, twenty-five, twenty-four, twenty-six, twenty-two, two.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 15 different relations: en-dep/nummod (396; 68% instances), en-dep/nmod (57; 10% instances), en-dep/conj (36; 6% instances), en-dep/discourse (24; 4% instances), en-dep/nsubj (16; 3% instances), en-dep/root (15; 3% instances), en-dep/appos (13; 2% instances), en-dep/dobj (10; 2% instances), en-dep/name (4; 1% instances), en-dep/nsubjpass (3; 1% instances), en-dep/xcomp (3; 1% instances), en-dep/advmod (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/compound (1; 0% instances), en-dep/dislocated (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (360; 62% instances), VERB (95; 16% instances), NUM (52; 9% instances), PROPN (39; 7% instances), ROOT (15; 3% instances), ADJ (5; 1% instances), SYM (5; 1% instances), ADV (4; 1% instances), PRON (4; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances)

354 (61%) NUM nodes are leaves.

114 (20%) NUM nodes have one child.

69 (12%) NUM nodes have two children.

44 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 16.

Children of NUM nodes are attached using 19 different relations: en-dep/case (92; 22% instances), en-dep/nmod (60; 14% instances), en-dep/punct (59; 14% instances), en-dep/advmod (43; 10% instances), en-dep/conj (41; 10% instances), en-dep/compound (31; 7% instances), en-dep/cc (25; 6% instances), en-dep/det (16; 4% instances), en-dep/nummod (16; 4% instances), en-dep/mwe (12; 3% instances), en-dep/appos (8; 2% instances), en-dep/cop (6; 1% instances), en-dep/nsubj (6; 1% instances), en-dep/amod (5; 1% instances), en-dep/acl (2; 0% instances), en-dep/acl:relcl (1; 0% instances), en-dep/aux (1; 0% instances), en-dep/mark (1; 0% instances), en-dep/parataxis (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (91; 21% instances), NOUN (87; 20% instances), PUNCT (59; 14% instances), NUM (52; 12% instances), ADV (46; 11% instances), CONJ (31; 7% instances), DET (16; 4% instances), ADJ (12; 3% instances), VERB (11; 3% instances), PROPN (9; 2% instances), PRON (7; 2% instances), AUX (2; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

NUM: numeral

Treebank Statistics (UD_English)

Morphology

Relations

Treebank Statistics (UD_English-ESL)

Morphology

Relations

Treebank Statistics (UD_English-LinES)

Morphology

Relations

`NUM`: numeral