Treebank Statistics: UD_English-ParTUT: POS Tags: NUM
There are 261 NUM
lemmas (4%), 261 NUM
types (3%) and 908 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: two, one, 1, three, 2, four, 18, 3, 6, five
The 10 most frequent NUM
types: two, one, 1, three, 2, four, 18, 3, 6, five
The 10 most frequent ambiguous lemmas: two (NUM 60, NOUN 2), one (NUM 58, PRON 31, DET 2, NOUN 1), three (NUM 27, NOUN 1), million (NUM 9, NOUN 2), ten (NUM 3, NOUN 2), - (PUNCT 359, NUM 1, X 1)
The 10 most frequent ambiguous types: two (NUM 50, NOUN 1), one (NUM 51, PRON 27, DET 2, NOUN 1), three (NUM 27, NOUN 1), ten (NUM 3, NOUN 1)
- two
- one
- NUM 51: I should like to address one final point .
- PRON 27: I love a challenge , and saving the Earth is probably a good one .
- DET 2: This was the first book Balzac released under his own name , and it gave him what one critic called “ passage into the Promised Land “ .
- NOUN 1: The late romances , with their shifts in time and surprising turns of plot , inspired a last poetic style in which long and short sentences are set against one another , clauses are piled up , subject and object are reversed , and words are omitted , creating an effect of spontaneity .
- three
- ten
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.198926).
The 1st highest number of forms (1) was observed with the lemma “-”: -20º.
The 2nd highest number of forms (1) was observed with the lemma “-40°”: -40°.
The 3rd highest number of forms (1) was observed with the lemma “0083”: 0083.
NUM
occurs with 1 features: NumType (908; 100% instances)
NUM
occurs with 1 feature-value pairs: NumType=Card
NUM
occurs with 1 feature combinations.
The most frequent feature combination is NumType=Card
(908 tokens).
Examples: two, one, 1, three, 2, four, 18, 3, 6, five
Relations
NUM
nodes are attached to their parents using 14 different relations: nummod (571; 63% instances), obl (140; 15% instances), flat (86; 9% instances), conj (35; 4% instances), nmod (23; 3% instances), compound (17; 2% instances), root (11; 1% instances), obj (10; 1% instances), nsubj (8; 1% instances), appos (2; 0% instances), ccomp (2; 0% instances), advcl (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (409; 45% instances), VERB (220; 24% instances), NUM (146; 16% instances), PROPN (57; 6% instances), SYM (42; 5% instances), ADJ (17; 2% instances), (11; 1% instances), X (4; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)
447 (49%) NUM
nodes are leaves.
172 (19%) NUM
nodes have one child.
167 (18%) NUM
nodes have two children.
122 (13%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 22 different relations: punct (326; 34% instances), case (207; 22% instances), flat (119; 13% instances), nmod (66; 7% instances), nummod (40; 4% instances), conj (39; 4% instances), amod (27; 3% instances), cc (25; 3% instances), dep (17; 2% instances), compound (16; 2% instances), advmod (15; 2% instances), det (13; 1% instances), cop (12; 1% instances), nsubj (11; 1% instances), advcl (3; 0% instances), appos (3; 0% instances), mark (3; 0% instances), aux (2; 0% instances), csubj (2; 0% instances), ccomp (1; 0% instances), nmod:poss (1; 0% instances), obj (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: PUNCT (326; 34% instances), ADP (206; 22% instances), NUM (146; 15% instances), PROPN (93; 10% instances), NOUN (51; 5% instances), ADJ (32; 3% instances), CCONJ (25; 3% instances), X (17; 2% instances), AUX (14; 1% instances), DET (14; 1% instances), ADV (13; 1% instances), VERB (6; 1% instances), SCONJ (3; 0% instances), PRON (2; 0% instances), PART (1; 0% instances)