This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home fr/pos issue tracker

X: other

Definition

The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.

Note: Some acronyms and foreign words which could be assigned a real part-of-speech category have not yet been fixed and are still marked as X.

Examples


Treebank Statistics (UD_French)

There are 534 X lemmas (1%), 534 X types (1%) and 698 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: etc., a, k, B, s, ‘s, GMT, of, D, e

The 10 most frequent X types: etc., a, k, B, s, ‘s, GMT, of, D, e

The 10 most frequent ambiguous lemmas: a (DET 14, ADP 5, X 4, NOUN 1, PROPN 1), k (NOUN 2, X 1), B (X 8, PROPN 6), s (NOUN 6, X 6), ’s (PART 29, X 5), of (ADP 73, X 5, PROPN 5), D (X 4, PROPN 4), e (CONJ 3, NOUN 1, X 1), AC (PROPN 7, X 3), ARNm (X 3, NOUN 1)

The 10 most frequent ambiguous types: a (AUX 1834, VERB 372, ADP 22, X 4, DET 4, PROPN 1, NOUN 1), k (NOUN 2, X 1), B (X 8, PROPN 6), s (NOUN 6, X 6), ’s (PART 30, X 5, VERB 3, AUX 1), of (ADP 72, PROPN 5, X 5), D (X 4, PROPN 4), e (CONJ 3, X 1, NOUN 1), AC (PROPN 7, X 3), ARNm (X 3, NOUN 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.307036).

The 1st highest number of forms (1) was observed with the lemma “’06”: ‘06.

The 2nd highest number of forms (1) was observed with the lemma “’07”: ‘07.

The 3rd highest number of forms (1) was observed with the lemma “’s”: ’s.

X does not occur with any features.

Relations

X nodes are attached to their parents using 20 different relations: fr-dep/appos (219; 31% instances), fr-dep/compound (134; 19% instances), fr-dep/conj (121; 17% instances), fr-dep/nmod (85; 12% instances), fr-dep/name (53; 8% instances), fr-dep/nsubj (22; 3% instances), fr-dep/dobj (17; 2% instances), fr-dep/dep (7; 1% instances), fr-dep/root (7; 1% instances), fr-dep/advmod (5; 1% instances), fr-dep/xcomp (5; 1% instances), fr-dep/case (4; 1% instances), fr-dep/foreign (4; 1% instances), fr-dep/mwe (3; 0% instances), fr-dep/nsubjpass (3; 0% instances), fr-dep/amod (2; 0% instances), fr-dep/det (2; 0% instances), fr-dep/goeswith (2; 0% instances), fr-dep/nummod (2; 0% instances), fr-dep/parataxis (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: NOUN (283; 41% instances), PROPN (156; 22% instances), X (140; 20% instances), VERB (77; 11% instances), NUM (23; 3% instances), ADJ (8; 1% instances), ROOT (7; 1% instances), PRON (3; 0% instances), SYM (1; 0% instances)

393 (56%) X nodes are leaves.

70 (10%) X nodes have one child.

127 (18%) X nodes have two children.

108 (15%) X nodes have three or more children.

The highest child degree of a X node is 20.

Children of X nodes are attached using 22 different relations: fr-dep/punct (297; 37% instances), fr-dep/case (81; 10% instances), fr-dep/conj (81; 10% instances), fr-dep/compound (75; 9% instances), fr-dep/det (71; 9% instances), fr-dep/appos (50; 6% instances), fr-dep/nummod (39; 5% instances), fr-dep/cc (32; 4% instances), fr-dep/nmod (31; 4% instances), fr-dep/acl:relcl (10; 1% instances), fr-dep/amod (7; 1% instances), fr-dep/advmod (6; 1% instances), fr-dep/acl (5; 1% instances), fr-dep/name (5; 1% instances), fr-dep/foreign (4; 0% instances), fr-dep/cop (3; 0% instances), fr-dep/nsubj (3; 0% instances), fr-dep/mwe (2; 0% instances), fr-dep/nmod:poss (2; 0% instances), fr-dep/advcl (1; 0% instances), fr-dep/dep (1; 0% instances), fr-dep/dobj (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (291; 36% instances), X (140; 17% instances), ADP (81; 10% instances), DET (73; 9% instances), NOUN (66; 8% instances), NUM (46; 6% instances), CONJ (30; 4% instances), PROPN (25; 3% instances), VERB (23; 3% instances), SYM (12; 1% instances), ADJ (8; 1% instances), ADV (8; 1% instances), PRON (4; 0% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]