Treebank Statistics: UD_French-FTB: POS Tags: X
There are 23 X
lemmas (1%), 23 X
types (1%) and 2192 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: _, NEW, New, British, Grand, In, A, Altus, BUENOS, Body
The 10 most frequent X
types: _, NEW, New, British, Grand, In, A, Altus, BUENOS, Body
The 10 most frequent ambiguous lemmas: _ (NOUN 115984, ADP 89082, DET 79465, PUNCT 73863, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), Moody’ (PROPN 1, X 1)
The 10 most frequent ambiguous types: _ (NOUN 115984, ADP 89082, DET 79465, PUNCT 73863, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), Grand (ADJ 3, X 2), A (ADP 388, AUX 1, NOUN 1, X 1), Moody’ (PROPN 1, X 1)
- _
- NOUN 115984: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 89082: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 79465: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 73863: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 47092: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 36213: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 22183: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 21225: - _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 20877: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 17577: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 12831: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 11039: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 4969: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- X 2163: In _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 239: L’ _ _ _ _ _ _ _ _ _ _ _
- INTJ 33: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- Grand
- A
- Moody’
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.170225).
The 1st highest number of forms (1) was observed with the lemma “A”: A.
The 2nd highest number of forms (1) was observed with the lemma “Altus”: Altus.
The 3rd highest number of forms (1) was observed with the lemma “BUENOS”: BUENOS.
X
occurs with 4 features: Mood (1; 0% instances), Number (1; 0% instances), Tense (1; 0% instances), VerbForm (1; 0% instances)
X
occurs with 4 feature-value pairs: Mood=Ind
, Number=Plur
, Tense=Pres
, VerbForm=Fin
X
occurs with 2 feature combinations.
The most frequent feature combination is _
(2191 tokens).
Examples: _, NEW, New, British, Grand, In, A, Altus, BUENOS, Body
Relations
X
nodes are attached to their parents using 19 different relations: fixed (1095; 50% instances), nmod (345; 16% instances), dep (199; 9% instances), obl (159; 7% instances), advmod (111; 5% instances), nsubj (96; 4% instances), conj (91; 4% instances), obj (37; 2% instances), root (19; 1% instances), amod (13; 1% instances), orphan (7; 0% instances), flat:name (5; 0% instances), cc (4; 0% instances), mark (4; 0% instances), flat (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), aux (1; 0% instances), punct (1; 0% instances)
Parents of X
nodes belong to 14 different parts of speech: X (992; 45% instances), NOUN (539; 25% instances), VERB (322; 15% instances), PROPN (239; 11% instances), ADJ (41; 2% instances), (19; 1% instances), PRON (10; 0% instances), NUM (9; 0% instances), ADP (6; 0% instances), AUX (4; 0% instances), CCONJ (4; 0% instances), DET (4; 0% instances), ADV (2; 0% instances), PUNCT (1; 0% instances)
1308 (60%) X
nodes are leaves.
212 (10%) X
nodes have one child.
216 (10%) X
nodes have two children.
456 (21%) X
nodes have three or more children.
The highest child degree of a X
node is 14.
Children of X
nodes are attached using 21 different relations: fixed (1232; 47% instances), case (358; 14% instances), det (303; 12% instances), punct (300; 11% instances), nmod (127; 5% instances), conj (76; 3% instances), dep (74; 3% instances), cc (59; 2% instances), amod (29; 1% instances), acl:relcl (27; 1% instances), acl (14; 1% instances), advmod (8; 0% instances), orphan (6; 0% instances), nsubj (4; 0% instances), nummod (4; 0% instances), cop (3; 0% instances), flat (2; 0% instances), obl (2; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), flat:name (1; 0% instances)
Children of X
nodes belong to 13 different parts of speech: X (992; 38% instances), PUNCT (539; 20% instances), ADP (357; 14% instances), DET (305; 12% instances), PROPN (130; 5% instances), NOUN (119; 5% instances), ADJ (65; 2% instances), CCONJ (54; 2% instances), VERB (47; 2% instances), NUM (11; 0% instances), ADV (7; 0% instances), PRON (3; 0% instances), AUX (2; 0% instances)