home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Rhapsodie: POS Tags: X

There are 112 X lemmas (3%), 112 X types (2%) and 328 X tokens (1%). Out of 15 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: XXX, j~, l~, s~, c~, a~, f~, m~, p~, d~

The 10 most frequent X types: XXX, j~, l~, s~, c~, a~, f~, m~, p~, d~

The 10 most frequent ambiguous lemmas: A (X 3, PROPN 2)

The 10 most frequent ambiguous types: A (X 3, PROPN 2)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.361064).

The 1st highest number of forms (1) was observed with the lemma “A”: A.

The 2nd highest number of forms (1) was observed with the lemma “A~”: A~.

The 3rd highest number of forms (1) was observed with the lemma “Bel~”: Bel~.

X occurs with 2 features: ExtPos (155; 47% instances), Foreign (18; 5% instances)

X occurs with 12 feature-value pairs: ExtPos=ADJ, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=DET, ExtPos=NOUN, ExtPos=NUM, ExtPos=PRON, ExtPos=PROPN, ExtPos=SCONJ, ExtPos=VERB, Foreign=Yes

X occurs with 13 feature combinations. The most frequent feature combination is _ (155 tokens). Examples: XXX, c~, s~, l~, a~, m~, d~, com, de~, f~

Relations

X nodes are attached to their parents using 22 different relations: reparandum (152; 46% instances), dep (69; 21% instances), root (45; 14% instances), obj (10; 3% instances), flat:foreign (8; 2% instances), obl:mod (7; 2% instances), discourse (5; 2% instances), nmod (5; 2% instances), nsubj (4; 1% instances), conj (3; 1% instances), flat (3; 1% instances), nmod:appos (3; 1% instances), obl:arg (3; 1% instances), ccomp (2; 1% instances), xcomp (2; 1% instances), advcl:cleft (1; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), dep:comp (1; 0% instances), dislocated:subj (1; 0% instances), flat:name (1; 0% instances), parataxis:parenth (1; 0% instances)

Parents of X nodes belong to 15 different parts of speech: VERB (87; 27% instances), NOUN (63; 19% instances), (45; 14% instances), PRON (36; 11% instances), X (20; 6% instances), ADV (17; 5% instances), PROPN (13; 4% instances), ADJ (12; 4% instances), DET (12; 4% instances), INTJ (11; 3% instances), ADP (4; 1% instances), AUX (3; 1% instances), CCONJ (3; 1% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)

83 (25%) X nodes are leaves.

113 (34%) X nodes have one child.

76 (23%) X nodes have two children.

56 (17%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 23 different relations: punct (223; 48% instances), det (41; 9% instances), nsubj (39; 8% instances), discourse (36; 8% instances), case (30; 6% instances), reparandum (22; 5% instances), cc (14; 3% instances), advmod (12; 3% instances), cop (8; 2% instances), flat:foreign (8; 2% instances), dep (7; 2% instances), obj (5; 1% instances), amod (3; 1% instances), obl:mod (3; 1% instances), advcl (2; 0% instances), aux:tense (2; 0% instances), dep:comp (2; 0% instances), dislocated:subj (2; 0% instances), nmod (2; 0% instances), iobj (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances), parataxis:parenth (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (223; 48% instances), PRON (48; 10% instances), DET (43; 9% instances), ADP (32; 7% instances), ADV (27; 6% instances), INTJ (26; 6% instances), X (20; 4% instances), CCONJ (14; 3% instances), AUX (12; 3% instances), NOUN (6; 1% instances), ADJ (5; 1% instances), VERB (5; 1% instances), PROPN (3; 1% instances), NUM (1; 0% instances)