home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: POS Tags: NUM

There are 350 NUM lemmas (5%), 350 NUM types (4%) and 1649 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: deux, 5, 2006, trois, 2, 10, 30, 4, 3, 1

The 10 most frequent NUM types: deux, 5, 2006, trois, 2, 10, 30, 4, 3, 1

The 10 most frequent ambiguous lemmas: deux (NUM 89, PRON 6), 5 (NUM 57, PRON 1), trois (NUM 51, PRON 2), 2 (NUM 50, PRON 1), 10 (NUM 38, PRON 3), 3 (NUM 32, PRON 1), 1 (NUM 31, PRON 9), 15 (NUM 28, PRON 1), 6 (NUM 21, PRON 2), 8 (NUM 18, PRON 1)

The 10 most frequent ambiguous types: deux (NUM 85, PRON 6), 5 (NUM 57, PRON 1), trois (NUM 46, PRON 2), 2 (NUM 50, PRON 1), 10 (NUM 38, PRON 3), 3 (NUM 32, PRON 1), 1 (NUM 31, PRON 9), 15 (NUM 28, PRON 1), 6 (NUM 21, PRON 2), 8 (NUM 18, PRON 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.401447).

The 1st highest number of forms (1) was observed with the lemma “-6”: -6.

The 2nd highest number of forms (1) was observed with the lemma “0,0001”: 0,0001.

The 3rd highest number of forms (1) was observed with the lemma “0,001”: 0,001.

NUM occurs with 3 features: NumType (1648; 100% instances), Gender (2; 0% instances), Number (2; 0% instances)

NUM occurs with 4 feature-value pairs: Gender=Masc, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 4 feature combinations. The most frequent feature combination is NumType=Card (1646 tokens). Examples: deux, 5, 2006, trois, 2, 10, 30, 4, 3, 1

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (1049; 64% instances), nmod (341; 21% instances), obl:mod (187; 11% instances), conj (39; 2% instances), obl:arg (9; 1% instances), obj (7; 0% instances), dep (4; 0% instances), root (4; 0% instances), fixed (2; 0% instances), nsubj (2; 0% instances), orphan (2; 0% instances), amod (1; 0% instances), appos (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (1314; 80% instances), VERB (200; 12% instances), NUM (61; 4% instances), PROPN (56; 3% instances), ADJ (7; 0% instances), (4; 0% instances), DET (3; 0% instances), PRON (2; 0% instances), X (2; 0% instances)

1238 (75%) NUM nodes are leaves.

281 (17%) NUM nodes have one child.

81 (5%) NUM nodes have two children.

49 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 19 different relations: nmod (207; 34% instances), case (165; 27% instances), det (92; 15% instances), conj (35; 6% instances), punct (33; 5% instances), cc (29; 5% instances), advmod (23; 4% instances), amod (5; 1% instances), dep (5; 1% instances), nummod (3; 0% instances), appos (2; 0% instances), fixed (2; 0% instances), orphan (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 13 different parts of speech: NOUN (186; 31% instances), ADP (166; 27% instances), DET (95; 16% instances), NUM (61; 10% instances), PUNCT (33; 5% instances), CCONJ (29; 5% instances), ADV (22; 4% instances), ADJ (7; 1% instances), VERB (4; 1% instances), PROPN (3; 0% instances), AUX (1; 0% instances), PRON (1; 0% instances), X (1; 0% instances)