Treebank Statistics: UD_French-Sequoia: POS Tags: NUM
There are 399 NUM lemmas (6%), 399 NUM types (4%) and 1790 NUM tokens (3%).
Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent NUM lemmas: deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
The 10 most frequent NUM types: deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
The 10 most frequent ambiguous lemmas: neuf (ADJ 2, NUM 2), II (ADJ 1, NUM 1)
The 10 most frequent ambiguous types: neuf (NUM 2, ADJ 1), II (ADJ 1, NUM 1)
- neuf
- II
- ADJ 1: ANNEXE II
- NUM 1: Plus gravement , l’ affaire de les fiches entamera profondément le moral et la cohésion de le corps militaire à une époque où , à l’ inverse de les français , le gouvernement allemand se persuade de plus en plus , comme l’ empereur Guillaume II dès son avènement , qu’ une guerre est à terme une nécessité inéluctable pour le développement et la prospérité politique et économique de son pays .
Morphology
The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.408433).
The 1st highest number of forms (1) was observed with the lemma “-6”: -6.
The 2nd highest number of forms (1) was observed with the lemma “0,0001”: 0,0001.
The 3rd highest number of forms (1) was observed with the lemma “0,001”: 0,001.
NUM occurs with 1 features: NumType (1739; 97% instances)
NUM occurs with 1 feature-value pairs: NumType=Card
NUM occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card (1739 tokens).
Examples: deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
Relations
NUM nodes are attached to their parents using 14 different relations: nummod (914; 51% instances), nmod (523; 29% instances), obl:mod (186; 10% instances), conj (50; 3% instances), obl:arg (38; 2% instances), parataxis:insert (25; 1% instances), parataxis (14; 1% instances), appos (13; 1% instances), obj (8; 0% instances), nsubj:pass (5; 0% instances), orphan (5; 0% instances), nsubj (4; 0% instances), root (3; 0% instances), dep (2; 0% instances)
Parents of NUM nodes belong to 12 different parts of speech: NOUN (1382; 77% instances), VERB (207; 12% instances), NUM (91; 5% instances), PROPN (58; 3% instances), ADJ (23; 1% instances), X (9; 1% instances), ADP (6; 0% instances), SYM (4; 0% instances), ADV (3; 0% instances), DET (3; 0% instances), (3; 0% instances), PRON (1; 0% instances)
1231 (69%) NUM nodes are leaves.
294 (16%) NUM nodes have one child.
154 (9%) NUM nodes have two children.
111 (6%) NUM nodes have three or more children.
The highest child degree of a NUM node is 7.
Children of NUM nodes are attached using 19 different relations: punct (277; 28% instances), case (222; 23% instances), nmod (216; 22% instances), det (97; 10% instances), conj (43; 4% instances), cc (35; 4% instances), obl:arg (23; 2% instances), advmod (15; 2% instances), obl:mod (14; 1% instances), amod (7; 1% instances), dep (7; 1% instances), nsubj (5; 1% instances), appos (4; 0% instances), parataxis (3; 0% instances), acl (2; 0% instances), orphan (2; 0% instances), acl:relcl (1; 0% instances), cop (1; 0% instances), flat:name (1; 0% instances)
Children of NUM nodes belong to 14 different parts of speech: PUNCT (277; 28% instances), ADP (216; 22% instances), NOUN (207; 21% instances), DET (97; 10% instances), NUM (91; 9% instances), CCONJ (40; 4% instances), SYM (12; 1% instances), ADV (11; 1% instances), ADJ (6; 1% instances), PROPN (6; 1% instances), PRON (4; 0% instances), VERB (4; 0% instances), X (3; 0% instances), AUX (1; 0% instances)