home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Sequoia: POS Tags: NUM

There are 429 NUM lemmas (6%), 429 NUM types (4%) and 1826 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4

The 10 most frequent NUM types: deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4

The 10 most frequent ambiguous lemmas: neuf (ADJ 2, NUM 2), II (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: neuf (NUM 2, ADJ 1), II (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.408837).

The 1st highest number of forms (1) was observed with the lemma “-1,5”: -1,5.

The 2nd highest number of forms (1) was observed with the lemma “-2,5”: -2,5.

The 3rd highest number of forms (1) was observed with the lemma “-6”: -6.

NUM occurs with 3 features: NumType (1775; 97% instances), Number (1691; 93% instances), Gender (3; 0% instances)

NUM occurs with 4 feature-value pairs: Gender=Masc, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 7 feature combinations. The most frequent feature combination is Number=Plur|NumType=Card (909 tokens). Examples: deux, trois, 5, 10, 2, 1, 30, 4, 3, 100

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (913; 50% instances), nmod (546; 30% instances), obl:mod (193; 11% instances), conj (51; 3% instances), obl:arg (40; 2% instances), parataxis:insert (25; 1% instances), appos (17; 1% instances), parataxis (14; 1% instances), obj (8; 0% instances), nsubj:pass (5; 0% instances), orphan (5; 0% instances), nsubj (4; 0% instances), root (3; 0% instances), flat (2; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (1246; 68% instances), VERB (215; 12% instances), SYM (159; 9% instances), NUM (91; 5% instances), PROPN (61; 3% instances), ADJ (24; 1% instances), X (14; 1% instances), ADP (6; 0% instances), ADV (3; 0% instances), DET (3; 0% instances), (3; 0% instances), PRON (1; 0% instances)

1240 (68%) NUM nodes are leaves.

309 (17%) NUM nodes have one child.

163 (9%) NUM nodes have two children.

114 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 20 different relations: punct (310; 30% instances), case (230; 22% instances), nmod (216; 21% instances), det (102; 10% instances), conj (43; 4% instances), cc (36; 4% instances), obl:arg (23; 2% instances), advmod (15; 1% instances), obl:mod (14; 1% instances), amod (7; 1% instances), dep (5; 0% instances), nsubj (5; 0% instances), appos (4; 0% instances), parataxis (3; 0% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), flat (2; 0% instances), orphan (2; 0% instances), cop (1; 0% instances), flat:name (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: PUNCT (310; 30% instances), ADP (224; 22% instances), NOUN (204; 20% instances), DET (102; 10% instances), NUM (91; 9% instances), CCONJ (41; 4% instances), SYM (15; 1% instances), ADV (11; 1% instances), ADJ (6; 1% instances), PROPN (6; 1% instances), VERB (5; 0% instances), PRON (4; 0% instances), X (3; 0% instances), AUX (1; 0% instances)