home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParTUT: POS Tags: NUM

There are 132 NUM lemmas (4%), 132 NUM types (3%) and 428 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005

The 10 most frequent NUM types: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005

The 10 most frequent ambiguous lemmas: neuf (ADJ 1, NUM 1), un (DET 615, PRON 11, NUM 1)

The 10 most frequent ambiguous types: un (DET 209, PRON 5, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.361807).

The 1st highest number of forms (1) was observed with the lemma “-20”: -20.

The 2nd highest number of forms (1) was observed with the lemma “-20º”: ­20º.

The 3rd highest number of forms (1) was observed with the lemma “-40”: -40.

NUM occurs with 1 features: NumType (428; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (428 tokens). Examples: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (314; 73% instances), flat (44; 10% instances), nmod (28; 7% instances), obl (20; 5% instances), conj (14; 3% instances), appos (3; 1% instances), obj (2; 0% instances), goeswith (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (222; 52% instances), VERB (86; 20% instances), NUM (81; 19% instances), PROPN (14; 3% instances), ADJ (12; 3% instances), SYM (10; 2% instances), PRON (1; 0% instances), PUNCT (1; 0% instances), (1; 0% instances)

216 (50%) NUM nodes are leaves.

78 (18%) NUM nodes have one child.

71 (17%) NUM nodes have two children.

63 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 13 different relations: punct (214; 46% instances), flat (71; 15% instances), case (48; 10% instances), det (36; 8% instances), nummod (32; 7% instances), nmod (29; 6% instances), conj (14; 3% instances), cc (8; 2% instances), advmod (5; 1% instances), goeswith (4; 1% instances), amod (1; 0% instances), cop (1; 0% instances), nsubj (1; 0% instances)

Children of NUM nodes belong to 13 different parts of speech: PUNCT (214; 46% instances), NUM (81; 17% instances), ADP (48; 10% instances), NOUN (44; 9% instances), DET (36; 8% instances), PROPN (21; 5% instances), CCONJ (8; 2% instances), ADV (5; 1% instances), PRON (2; 0% instances), X (2; 0% instances), ADJ (1; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)