home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: POS Tags: X

There are 529 X lemmas (1%), 529 X types (1%) and 672 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: etc., k, a, B, s, ‘s, GMT, of, e, v.

The 10 most frequent X types: etc., k, a, B, s, ‘s, GMT, of, e, v.

The 10 most frequent ambiguous lemmas: k (NOUN 2, X 2), a (DET 14, ADP 5, X 4, NOUN 1, PROPN 1), B (X 8, PROPN 6), s (X 6, NOUN 5), ’s (PART 29, X 5), of (ADP 73, PROPN 5, X 5), e (CCONJ 3, NOUN 1, X 1), AC (PROPN 7, X 3), ARNm (X 3, NOUN 1), D (PROPN 4, X 3)

The 10 most frequent ambiguous types: k (NOUN 2, X 2), a (AUX 1821, VERB 370, ADP 22, DET 4, X 4, NOUN 1, PROPN 1), B (X 8, PROPN 6), s (X 6, NOUN 5), ’s (PART 29, X 5, AUX 3, VERB 1), of (ADP 72, PROPN 5, X 5), e (CCONJ 3, NOUN 1, X 1), AC (PROPN 7, X 3), ARNm (X 3, NOUN 1), D (PROPN 4, X 3)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.305352).

The 1st highest number of forms (1) was observed with the lemma “’06”: ‘06.

The 2nd highest number of forms (1) was observed with the lemma “’07”: ‘07.

The 3rd highest number of forms (1) was observed with the lemma “’s”: ’s.

X occurs with 3 features: Gender (2; 0% instances), Number (2; 0% instances), Case (1; 0% instances)

X occurs with 3 feature-value pairs: Case=Voc, Gender=Masc, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is _ (670 tokens). Examples: etc., k, a, B, s, ‘s, GMT, of, e, v.

Relations

X nodes are attached to their parents using 25 different relations: appos (205; 31% instances), conj (120; 18% instances), compound (101; 15% instances), flat:name (55; 8% instances), nmod (53; 8% instances), nsubj (24; 4% instances), obj (17; 3% instances), flat:foreign (15; 2% instances), obl (12; 2% instances), obl:arg (9; 1% instances), root (7; 1% instances), xcomp (7; 1% instances), nsubj:pass (6; 1% instances), advmod (5; 1% instances), case (5; 1% instances), cc (4; 1% instances), dep (4; 1% instances), fixed (4; 1% instances), obl:agent (4; 1% instances), amod (3; 0% instances), flat (3; 0% instances), obl:mod (3; 0% instances), goeswith (2; 0% instances), nummod (2; 0% instances), parataxis (2; 0% instances)

Parents of X nodes belong to 11 different parts of speech: NOUN (274; 41% instances), X (139; 21% instances), PROPN (136; 20% instances), VERB (80; 12% instances), NUM (22; 3% instances), ADJ (8; 1% instances), (7; 1% instances), PRON (3; 0% instances), ADP (1; 0% instances), ADV (1; 0% instances), SYM (1; 0% instances)

297 (44%) X nodes are leaves.

138 (21%) X nodes have one child.

113 (17%) X nodes have two children.

124 (18%) X nodes have three or more children.

The highest child degree of a X node is 11.

Children of X nodes are attached using 21 different relations: punct (318; 37% instances), det (95; 11% instances), conj (81; 9% instances), case (77; 9% instances), compound (62; 7% instances), appos (51; 6% instances), cc (43; 5% instances), nmod (35; 4% instances), nummod (33; 4% instances), flat:foreign (15; 2% instances), acl:relcl (9; 1% instances), amod (7; 1% instances), advmod (6; 1% instances), acl (5; 1% instances), flat:name (5; 1% instances), cop (3; 0% instances), flat (3; 0% instances), nsubj (3; 0% instances), advcl (2; 0% instances), fixed (2; 0% instances), parataxis (2; 0% instances)

Children of X nodes belong to 15 different parts of speech: PUNCT (314; 37% instances), X (139; 16% instances), DET (95; 11% instances), ADP (77; 9% instances), NOUN (62; 7% instances), NUM (43; 5% instances), CCONJ (41; 5% instances), PROPN (29; 3% instances), VERB (20; 2% instances), ADJ (12; 1% instances), SYM (10; 1% instances), ADV (6; 1% instances), PRON (5; 1% instances), AUX (3; 0% instances), SCONJ (1; 0% instances)