home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-CSUI: POS Tags: X

There are 216 X lemmas (5%), 216 X types (5%) and 378 X tokens (1%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: rate, year, rating, mortgage, subprime, on, listed, net, netto, outlook

The 10 most frequent X types: rate, year, rating, mortgage, subprime, on, listed, net, netto, outlook

The 10 most frequent ambiguous lemmas: and (PROPN 5, X 3), travel (X 3, PROPN 1), of (PROPN 7, X 2), tender (NOUN 2, X 2), Economic (PROPN 2, X 1), International (PROPN 2, X 1), National (PROPN 2, X 1), Partnership (PROPN 1, X 1), Power (PROPN 1, X 1), Ratings (PROPN 6, X 1)

The 10 most frequent ambiguous types: and (PROPN 5, X 3), travel (X 3, PROPN 1), of (PROPN 7, X 2), tender (NOUN 2, X 2), Economic (PROPN 2, X 1), International (PROPN 2, X 1), NPL (PROPN 12, X 1), National (PROPN 2, X 1), Partnership (PROPN 1, X 1), Power (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.085880).

The 1st highest number of forms (1) was observed with the lemma “Ad”: Ad.

The 2nd highest number of forms (1) was observed with the lemma “Agreement”: Agreement.

The 3rd highest number of forms (1) was observed with the lemma “Alternate”: Alternate.

X occurs with 1 features: Foreign (378; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 1 feature combinations. The most frequent feature combination is Foreign=Yes (378 tokens). Examples: rate, year, rating, mortgage, subprime, on, listed, net, netto, outlook

Relations

X nodes are attached to their parents using 15 different relations: flat:foreign (134; 35% instances), nmod (105; 28% instances), appos (27; 7% instances), obj (24; 6% instances), obl (24; 6% instances), conj (19; 5% instances), amod (14; 4% instances), nsubj (12; 3% instances), advcl (5; 1% instances), acl:relcl (4; 1% instances), nsubj:pass (3; 1% instances), dep (2; 1% instances), root (2; 1% instances), xcomp (2; 1% instances), discourse (1; 0% instances)

Parents of X nodes belong to 6 different parts of speech: X (146; 39% instances), NOUN (124; 33% instances), VERB (70; 19% instances), PROPN (31; 8% instances), ADJ (5; 1% instances), (2; 1% instances)

191 (51%) X nodes are leaves.

63 (17%) X nodes have one child.

40 (11%) X nodes have two children.

84 (22%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 25 different relations: flat:foreign (134; 28% instances), punct (116; 25% instances), nmod (83; 18% instances), case (26; 6% instances), appos (16; 3% instances), cc (16; 3% instances), amod (14; 3% instances), acl:relcl (13; 3% instances), conj (11; 2% instances), det (10; 2% instances), case:adv (5; 1% instances), nsubj (5; 1% instances), nmod:poss (4; 1% instances), nmod:tmod (3; 1% instances), nummod (3; 1% instances), advmod (2; 0% instances), mark (2; 0% instances), nmod:lmod (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), cop (1; 0% instances), flat:name (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: X (146; 31% instances), PUNCT (116; 25% instances), NOUN (65; 14% instances), PROPN (40; 8% instances), ADP (31; 7% instances), ADJ (17; 4% instances), CCONJ (16; 3% instances), VERB (14; 3% instances), DET (10; 2% instances), PRON (8; 2% instances), NUM (4; 1% instances), ADV (2; 0% instances), SCONJ (2; 0% instances), AUX (1; 0% instances)