Treebank Statistics: UD_French-ParisStories: POS Tags: X
There are 20 X
lemmas (1%), 20 X
types (1%) and 65 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 11 in number of lemmas, 12 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: XXX, s~, d~, m~, pl~, _, b~, comple~, c~, c~…
The 10 most frequent X
types: XXX, s~, d~, m~, pl~, b~, comple~, c~, c~…, dans
The 10 most frequent ambiguous lemmas: s~ (X 6, VERB 3), d~ (X 3, ADP 2, NOUN 1, VERB 1), _ (NOUN 15, VERB 2, ADV 1, PUNCT 1, X 1), dans (ADP 202, X 1)
The 10 most frequent ambiguous types: s~ (X 6, VERB 3), d~ (X 3, ADP 2, NOUN 1, VERB 1), dans (ADP 202, X 1)
- s~
- d~
- dans
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.332572).
The 1st highest number of forms (1) was observed with the lemma “XXX”: XXX.
The 2nd highest number of forms (1) was observed with the lemma “_”: port.
The 3rd highest number of forms (1) was observed with the lemma “b~”: b~.
X
occurs with 1 features: ExtPos (15; 23% instances)
X
occurs with 2 feature-value pairs: ExtPos=PROPN
, ExtPos=VERB
X
occurs with 3 feature combinations.
The most frequent feature combination is _
(50 tokens).
Examples: XXX, s~, d~, m~, pl~, b~, comple~, c~, c~…, dans
Relations
X
nodes are attached to their parents using 13 different relations: reparandum (13; 20% instances), dep (10; 15% instances), discourse (8; 12% instances), obj (7; 11% instances), dislocated (5; 8% instances), nmod (4; 6% instances), obl:arg (4; 6% instances), root (4; 6% instances), conj (3; 5% instances), vocative (3; 5% instances), obl:mod (2; 3% instances), ccomp (1; 2% instances), xcomp (1; 2% instances)
Parents of X
nodes belong to 10 different parts of speech: VERB (36; 55% instances), NOUN (9; 14% instances), ADV (4; 6% instances), PRON (4; 6% instances), (4; 6% instances), ADP (2; 3% instances), AUX (2; 3% instances), X (2; 3% instances), ADJ (1; 2% instances), DET (1; 2% instances)
24 (37%) X
nodes are leaves.
25 (38%) X
nodes have one child.
6 (9%) X
nodes have two children.
10 (15%) X
nodes have three or more children.
The highest child degree of a X
node is 8.
Children of X
nodes are attached using 16 different relations: punct (29; 37% instances), case (9; 12% instances), det (6; 8% instances), discourse (6; 8% instances), conj (5; 6% instances), reparandum (5; 6% instances), cc (4; 5% instances), nsubj (3; 4% instances), cop (2; 3% instances), mark (2; 3% instances), nmod (2; 3% instances), acl:relcl (1; 1% instances), aux:tense (1; 1% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), nummod (1; 1% instances)
Children of X
nodes belong to 13 different parts of speech: PUNCT (29; 37% instances), ADP (8; 10% instances), DET (6; 8% instances), INTJ (6; 8% instances), PRON (5; 6% instances), CCONJ (4; 5% instances), NOUN (4; 5% instances), VERB (4; 5% instances), ADV (3; 4% instances), AUX (3; 4% instances), SCONJ (3; 4% instances), X (2; 3% instances), NUM (1; 1% instances)