Treebank Statistics: UD_French-ParTUT: POS Tags: NUM
There are 131 NUM lemmas (4%), 131 NUM types (3%) and 424 NUM tokens (1%).
Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005
The 10 most frequent NUM types: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005
The 10 most frequent ambiguous lemmas: neuf (ADJ 1, NUM 1), un (DET 646, PRON 14, NOUN 1, NUM 1)
The 10 most frequent ambiguous types: un (DET 205, PRON 8, NUM 1)
- un
- DET 205: Je voudrais encore aborder un dernier point :
- PRON 8: Le mandat de l’ un de les juges de le tribunal nommés conformément à le paragraphe 1 expire le 31 août 2007 .
- NUM 1: La période de prorogation proposée est de six ans et le montant de référence reste identique , à savoir d’ un million d’ euros par an .
Morphology
The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.464850).
The 1st highest number of forms (1) was observed with the lemma “-20”: -20.
The 2nd highest number of forms (1) was observed with the lemma “-20º”: 20º.
The 3rd highest number of forms (1) was observed with the lemma “-40”: -40.
NUM occurs with 1 features: NumType (424; 100% instances)
NUM occurs with 1 feature-value pairs: NumType=Card
NUM occurs with 1 feature combinations.
The most frequent feature combination is NumType=Card (424 tokens).
Examples: 1, 2, deux, 3, 6, 2000, 2002, 1999, 2001, 2005
Relations
NUM nodes are attached to their parents using 9 different relations: nummod (312; 74% instances), flat (44; 10% instances), nmod (24; 6% instances), obl (23; 5% instances), conj (14; 3% instances), appos (3; 1% instances), obj (2; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)
Parents of NUM nodes belong to 7 different parts of speech: NOUN (218; 51% instances), VERB (92; 22% instances), NUM (80; 19% instances), PROPN (14; 3% instances), SYM (10; 2% instances), ADJ (9; 2% instances), (1; 0% instances)
218 (51%) NUM nodes are leaves.
74 (17%) NUM nodes have one child.
69 (16%) NUM nodes have two children.
63 (15%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 11 different relations: punct (215; 47% instances), flat (69; 15% instances), case (48; 11% instances), det (35; 8% instances), nummod (33; 7% instances), nmod (30; 7% instances), conj (14; 3% instances), cc (8; 2% instances), advmod (2; 0% instances), cop (1; 0% instances), nsubj (1; 0% instances)
Children of NUM nodes belong to 11 different parts of speech: PUNCT (215; 47% instances), NUM (80; 18% instances), ADP (48; 11% instances), NOUN (42; 9% instances), DET (35; 8% instances), PROPN (22; 5% instances), CCONJ (8; 2% instances), ADV (2; 0% instances), PRON (2; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)