home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FQB: POS Tags: X

There are 174 X lemmas (5%), 174 X types (4%) and 244 X tokens (1%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: the, of, ‘s, and, caliente, is, west, hazmat, in, to

The 10 most frequent X types: the, of, ‘s, and, caliente, is, west, hazmat, in, to

The 10 most frequent ambiguous lemmas: the (X 13, PROPN 1), of (X 10, PROPN 1), Lion (X 2, PROPN 1), American (PROPN 1, X 1), Computer (PROPN 1, X 1), Gateway (PROPN 2, X 1), King (PROPN 4, X 1), New (PROPN 16, X 1), Scarlett (PROPN 1, X 1), Star (PROPN 2, X 1)

The 10 most frequent ambiguous types: the (X 13, PROPN 1), of (X 10, PROPN 1), Lion (X 2, PROPN 1), a (AUX 326, VERB 52, X 1), American (PROPN 1, X 1), Computer (PROPN 1, X 1), Gateway (PROPN 2, X 1), King (PROPN 4, X 1), New (PROPN 16, X 1), Scarlett (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.164044).

The 1st highest number of forms (1) was observed with the lemma “’”: .

The 2nd highest number of forms (1) was observed with the lemma “’n”: ‘n.

The 3rd highest number of forms (1) was observed with the lemma “’s”: ’s.

X occurs with 1 features: Foreign (243; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (243 tokens). Examples: the, of, ‘s, and, caliente, is, west, hazmat, in, to

Relations

X nodes are attached to their parents using 11 different relations: flat:foreign (131; 54% instances), dep (73; 30% instances), obj (16; 7% instances), obl:mod (7; 3% instances), nsubj (6; 2% instances), xcomp (5; 2% instances), nmod (2; 1% instances), appos (1; 0% instances), dislocated (1; 0% instances), goeswith (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 7 different parts of speech: X (131; 54% instances), NOUN (42; 17% instances), PROPN (34; 14% instances), VERB (32; 13% instances), NUM (3; 1% instances), ADJ (1; 0% instances), PRON (1; 0% instances)

180 (74%) X nodes are leaves.

3 (1%) X nodes have one child.

10 (4%) X nodes have two children.

51 (21%) X nodes have three or more children.

The highest child degree of a X node is 15.

Children of X nodes are attached using 6 different relations: flat:foreign (131; 49% instances), punct (115; 43% instances), case (13; 5% instances), det (5; 2% instances), dep (1; 0% instances), mark (1; 0% instances)

Children of X nodes belong to 6 different parts of speech: X (131; 49% instances), PUNCT (115; 43% instances), ADP (13; 5% instances), DET (5; 2% instances), SCONJ (1; 0% instances), VERB (1; 0% instances)