home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X

There are 207 X lemmas (13%), 217 X types (8%) and 395 X tokens (2%). Out of 16 observed tags, the rank of X is: 3 in number of lemmas, 3 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: ʃèː, XX, x, nan, kura, shi, ba, a, tunda, ʧét

The 10 most frequent X types: ʃèː, XX, nan, shi, ba, kura, a, tunda, ʧét, kafin

The 10 most frequent ambiguous lemmas: ʃèː (X 24, PART 2), XX (X 14, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 6, SCONJ 1), bâː (PART 9, X 3), dâːmáː (X 3, ADV 1), hár (ADP 25, SCONJ 5, ADV 4, X 3), swǎːt (X 3, ADV 1), wéy (PART 29, X 3), yànzú (X 2, ADV 1), Lim (X 2, PROPN 1)

The 10 most frequent ambiguous types: ʃèː (AUX 42, X 23, PART 2), XX (X 15, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 6, SCONJ 1), ʧét (X 6, VERB 1), bâː (PART 9, X 3), dâːmáː (X 3, ADV 1), hár (ADP 24, SCONJ 5, ADV 4, X 3), wéy (PART 29, X 3), yànzú (X 3, ADV 1), Lim (X 2, PROPN 1)

Morphology

The form / lemma ratio of X is 1.048309 (the average of all parts of speech is 1.611418).

The 1st highest number of forms (8) was observed with the lemma “X”: X, ki, kira, wace, yírtə, ƙasa, ɣá~, ʧi.

The 2nd highest number of forms (2) was observed with the lemma “dàːmuwa”: dàːmuwa, dàːmuwá.

The 3rd highest number of forms (2) was observed with the lemma “kura”: kura, kurâs.

X occurs with 1 features: Foreign (263; 67% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (263 tokens). Examples: nan, shi, ba, kura, a, tunda, kafin, wannan, ɗaya, OK

Relations

X nodes are attached to their parents using 26 different relations: flat:foreign (89; 23% instances), obl (74; 19% instances), root (40; 10% instances), dep (26; 7% instances), discourse (24; 6% instances), obj (21; 5% instances), compound:redup (19; 5% instances), nmod (18; 5% instances), xcomp (12; 3% instances), fixed (10; 3% instances), parataxis (10; 3% instances), reparandum (9; 2% instances), dislocated (7; 2% instances), nsubj (7; 2% instances), advmod (4; 1% instances), conj (4; 1% instances), advcl (3; 1% instances), appos (3; 1% instances), obl:arg (3; 1% instances), vocative (3; 1% instances), cc (2; 1% instances), compound (2; 1% instances), flat (2; 1% instances), cc:preconj (1; 0% instances), flat:name (1; 0% instances), mark (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: VERB (152; 38% instances), X (147; 37% instances), (40; 10% instances), NOUN (17; 4% instances), PART (13; 3% instances), INTJ (7; 2% instances), AUX (5; 1% instances), PROPN (5; 1% instances), ADV (3; 1% instances), NUM (3; 1% instances), PRON (2; 1% instances), SCONJ (1; 0% instances)

214 (54%) X nodes are leaves.

83 (21%) X nodes have one child.

51 (13%) X nodes have two children.

47 (12%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 34 different relations: punct (95; 25% instances), flat:foreign (89; 23% instances), discourse (30; 8% instances), compound:redup (19; 5% instances), nmod (18; 5% instances), advmod (13; 3% instances), case (13; 3% instances), fixed (11; 3% instances), dep (10; 3% instances), ccomp (9; 2% instances), aux (8; 2% instances), acl (7; 2% instances), nmod:poss (6; 2% instances), obj (5; 1% instances), parataxis (5; 1% instances), reparandum (5; 1% instances), appos (4; 1% instances), conj (4; 1% instances), det (4; 1% instances), dislocated (3; 1% instances), obl:arg (3; 1% instances), acl:relcl (2; 1% instances), advcl (2; 1% instances), flat (2; 1% instances), mark (2; 1% instances), nsubj (2; 1% instances), obl (2; 1% instances), vocative (2; 1% instances), xcomp (2; 1% instances), amod (1; 0% instances), cc (1; 0% instances), compound (1; 0% instances), compound:prt (1; 0% instances), flat:name (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (147; 38% instances), PUNCT (95; 25% instances), PART (24; 6% instances), VERB (23; 6% instances), NOUN (22; 6% instances), INTJ (14; 4% instances), PRON (12; 3% instances), SCONJ (9; 2% instances), ADP (8; 2% instances), ADV (8; 2% instances), AUX (8; 2% instances), DET (5; 1% instances), PROPN (5; 1% instances), ADJ (1; 0% instances), NUM (1; 0% instances)