Treebank Statistics: UD_English-EWT: POS Tags: NUM
There are 1236 NUM
lemmas (7%), 1246 NUM
types (5%) and 5043 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: one, two, 2, 1, 3, 5, 4, 10, three, 20
The 10 most frequent NUM
types: one, two, 2, 1, 3, 5, 4, 10, three, 20
The 10 most frequent ambiguous lemmas: one (NUM 448, NOUN 148, PRON 26), 2 (NUM 173, PROPN 2, X 2), 1 (NUM 140, X 2), 3 (NUM 138, NOUN 1, X 1), 5 (NUM 116, PROPN 1), 4 (NUM 108, X 1), 20 (NUM 66, NOUN 2), m (NUM 46, NOUN 17, PROPN 3), 30 (NUM 37, NOUN 1), million (NUM 37, NOUN 8)
The 10 most frequent ambiguous types: one (NUM 395, NOUN 105, PRON 22, X 3), 2 (NUM 173, PROPN 2, X 2, ADP 1, PART 1), 1 (NUM 140, X 2), 3 (NUM 138, X 1), 5 (NUM 116, PROPN 1), 4 (NUM 108, ADP 1, SCONJ 1, X 1), 20 (NUM 66, X 3), m (NUM 41, AUX 21, NOUN 11, PROPN 3, VERB 1), million (NUM 37, NOUN 1), 8 (NUM 30, PROPN 2)
- one
- 2
- NUM 173: Analyst Team 2 : Coach : Doug Sewell
- PROPN 2: and it seems this is the FIRST site of ragnarok 2 hahaha since the site is new send me your suggestions and comments
- X 2: 2
- ADP 1: go 2 starbucks do nt spend more than 20 bucks :)
- PART 1: hi everyone …. just hav my hands on my new OLYMPUS X940 digital camera .. wel , i always wanted 2 hav one by sony .. but anyways , ended up having olympus X940 from my dad ……. does any1 already has it ?
- 1
- 3
- 5
- 4
- 20
- m
- million
- 8
Morphology
The form / lemma ratio of NUM
is 1.008091 (the average of all parts of speech is 1.228673).
The 1st highest number of forms (3) was observed with the lemma “billion”: b, billion, bn.
The 2nd highest number of forms (2) was observed with the lemma “’72”: ‘72, ’72.
The 3rd highest number of forms (2) was observed with the lemma “’73”: ‘73, ’73.
NUM
occurs with 5 features: NumType (4921; 98% instances), Abbr (3; 0% instances), ExtPos (2; 0% instances), Number (1; 0% instances), Typo (1; 0% instances)
NUM
occurs with 5 feature-value pairs: Abbr=Yes
, ExtPos=PRON
, NumType=Card
, Number=Sing
, Typo=Yes
NUM
occurs with 6 feature combinations.
The most frequent feature combination is NumType=Card
(4915 tokens).
Examples: one, two, 2, 3, 5, 1, 10, 4, three, 20
Relations
NUM
nodes are attached to their parents using 27 different relations: nummod (3077; 61% instances), root (421; 8% instances), nmod (291; 6% instances), obl (253; 5% instances), compound (218; 4% instances), appos (179; 4% instances), list (150; 3% instances), nsubj (107; 2% instances), obj (104; 2% instances), conj (90; 2% instances), nmod:tmod (58; 1% instances), parataxis (19; 0% instances), amod (11; 0% instances), obl:tmod (9; 0% instances), xcomp (9; 0% instances), advcl (7; 0% instances), ccomp (7; 0% instances), nmod:npmod (7; 0% instances), obl:npmod (7; 0% instances), flat (6; 0% instances), nsubj:pass (4; 0% instances), acl:relcl (3; 0% instances), reparandum (2; 0% instances), iobj (1; 0% instances), nmod:poss (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (2346; 47% instances), PROPN (857; 17% instances), VERB (500; 10% instances), NUM (450; 9% instances), (421; 8% instances), SYM (375; 7% instances), ADJ (54; 1% instances), X (17; 0% instances), ADV (14; 0% instances), PRON (5; 0% instances), DET (3; 0% instances), AUX (1; 0% instances)
3111 (62%) NUM
nodes are leaves.
1243 (25%) NUM
nodes have one child.
312 (6%) NUM
nodes have two children.
377 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 10.
Children of NUM
nodes are attached using 34 different relations: punct (832; 26% instances), case (563; 17% instances), nmod (349; 11% instances), advmod (216; 7% instances), appos (212; 7% instances), nmod:tmod (199; 6% instances), compound (175; 5% instances), conj (101; 3% instances), cop (90; 3% instances), nummod (90; 3% instances), nsubj (86; 3% instances), cc (82; 3% instances), det (64; 2% instances), parataxis (47; 1% instances), amod (29; 1% instances), acl:relcl (19; 1% instances), mark (13; 0% instances), nmod:npmod (12; 0% instances), obl (11; 0% instances), aux (10; 0% instances), advcl (8; 0% instances), discourse (5; 0% instances), acl (3; 0% instances), det:predet (2; 0% instances), fixed (2; 0% instances), nmod:poss (2; 0% instances), reparandum (2; 0% instances), cc:preconj (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), goeswith (1; 0% instances), list (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)
Children of NUM
nodes belong to 17 different parts of speech: PUNCT (832; 26% instances), NOUN (600; 19% instances), ADP (472; 15% instances), NUM (450; 14% instances), ADV (175; 5% instances), SYM (120; 4% instances), ADJ (109; 3% instances), AUX (101; 3% instances), CCONJ (80; 2% instances), PRON (80; 2% instances), DET (76; 2% instances), VERB (67; 2% instances), PROPN (45; 1% instances), SCONJ (10; 0% instances), PART (8; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)