home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-Rhapsodie: POS Tags: X

There are 107 X lemmas (3%), 107 X types (2%) and 321 X tokens (1%). Out of 15 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: XXX, j~, l~, s~, c~, a~, f~, m~, p~, d~

The 10 most frequent X types: XXX, j~, l~, s~, c~, a~, f~, m~, p~, d~

The 10 most frequent ambiguous lemmas: A (X 3, PROPN 2)

The 10 most frequent ambiguous types: A (X 3, PROPN 2)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.357880).

The 1st highest number of forms (1) was observed with the lemma “A”: A.

The 2nd highest number of forms (1) was observed with the lemma “A~”: A~.

The 3rd highest number of forms (1) was observed with the lemma “Bel~”: Bel~.

X occurs with 2 features: ExtPos (155; 48% instances), Foreign (14; 4% instances)

X occurs with 12 feature-value pairs: ExtPos=ADJ, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=DET, ExtPos=NOUN, ExtPos=NUM, ExtPos=PRON, ExtPos=PROPN, ExtPos=SCONJ, ExtPos=VERB, Foreign=Yes

X occurs with 13 feature combinations. The most frequent feature combination is _ (152 tokens). Examples: XXX, c~, s~, l~, a~, m~, d~, de~, f~, p~

Relations

X nodes are attached to their parents using 21 different relations: reparandum (152; 47% instances), dep (69; 21% instances), root (45; 14% instances), obj (10; 3% instances), obl:mod (7; 2% instances), flat:foreign (6; 2% instances), discourse (5; 2% instances), nmod (4; 1% instances), nsubj (4; 1% instances), conj (3; 1% instances), nmod:appos (3; 1% instances), obl:arg (3; 1% instances), ccomp (2; 1% instances), advcl:cleft (1; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), dep:comp (1; 0% instances), dislocated (1; 0% instances), flat:name (1; 0% instances), parataxis:parenth (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 15 different parts of speech: VERB (86; 27% instances), NOUN (63; 20% instances), (45; 14% instances), PRON (36; 11% instances), X (18; 6% instances), ADV (16; 5% instances), ADJ (12; 4% instances), DET (12; 4% instances), INTJ (11; 3% instances), PROPN (10; 3% instances), ADP (4; 1% instances), AUX (3; 1% instances), CCONJ (3; 1% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)

80 (25%) X nodes are leaves.

111 (35%) X nodes have one child.

75 (23%) X nodes have two children.

55 (17%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 22 different relations: punct (223; 49% instances), det (40; 9% instances), nsubj (39; 9% instances), discourse (36; 8% instances), case (29; 6% instances), reparandum (22; 5% instances), cc (14; 3% instances), advmod (11; 2% instances), cop (8; 2% instances), dep (7; 2% instances), flat:foreign (6; 1% instances), obj (5; 1% instances), obl:mod (4; 1% instances), advcl (2; 0% instances), amod (2; 0% instances), aux:tense (2; 0% instances), dep:comp (2; 0% instances), dislocated (2; 0% instances), iobj (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances), parataxis:parenth (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (223; 49% instances), PRON (49; 11% instances), DET (42; 9% instances), ADP (31; 7% instances), ADV (26; 6% instances), INTJ (26; 6% instances), X (18; 4% instances), CCONJ (14; 3% instances), AUX (12; 3% instances), NOUN (6; 1% instances), VERB (5; 1% instances), ADJ (4; 1% instances), NUM (1; 0% instances), PROPN (1; 0% instances)