home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sindhi-Isra: POS Tags: ADP

There are 90 ADP lemmas (2%), 117 ADP types (1%) and 14305 ADP tokens (15%). Out of 15 observed tags, the rank of ADP is: 7 in number of lemmas, 8 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: جي, ۾, کي, جو, تي, سان, کان, لاءِ, وارو, مان

The 10 most frequent ADP types: جي, ۾, کي, جو, تي, سان, کان, لاءِ, جا, مان

The 10 most frequent ambiguous lemmas: جي (ADP 3320, PRON 73, NOUN 12, SCONJ 7, CCONJ 5, DET 5), کي (ADP 1966, PRON 12, NOUN 3), جو (ADP 1858, SCONJ 47, DET 12, PRON 1), تي (ADP 1005, ADV 6, PRON 2), وارو (ADP 336, NOUN 6), مان (ADP 296, PRON 38, DET 3), تائين (ADP 139, ADV 4), پوءِ (ADP 135, ADV 105), وٽ (ADP 131, VERB 1), خلاف (ADP 93, NOUN 1)

The 10 most frequent ambiguous types: جي (ADP 3320, NOUN 12, SCONJ 7, CCONJ 5, PRON 2, PROPN 1), کي (ADP 1903, NOUN 3), جو (ADP 1354, SCONJ 47, DET 12), جا (ADP 380, PRON 1), مان (ADP 296, PRON 38, DET 3), واري (ADP 190, NOUN 4, VERB 1), تائين (ADP 139, ADV 4), پوءِ (ADP 135, ADV 105, SCONJ 3), جون (ADP 124, PROPN 1), خلاف (ADP 93, NOUN 1)

Morphology

The form / lemma ratio of ADP is 1.300000 (the average of all parts of speech is 1.872520).

The 1st highest number of forms (12) was observed with the lemma “_”: آهر, بس, جهڙن, جهڙيون, جوڳي, لڳ, منجهس, وانگيان, وٺان, وچان, ڀي, کائونئس.

The 2nd highest number of forms (6) was observed with the lemma “وارو”: وارا, وارن, وارو, واري, وارين, واريون.

The 3rd highest number of forms (6) was observed with the lemma “کان”: کان, کانئس, کانئن, کانسواءِ, کانپو, کانپوءِ.

ADP occurs with 5 features: Case (5571; 39% instances), Number (5506; 38% instances), Gender (5360; 37% instances), ExtPos (85; 1% instances), Person (72; 1% instances)

ADP occurs with 8 feature-value pairs: Case=Acc, Case=Nom, ExtPos=ADP, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3

ADP occurs with 27 feature combinations. The most frequent feature combination is _ (8638 tokens). Examples: ۾, کي, تي, سان, لاءِ, کان, مان, تائين, پوءِ, وٽ

Relations

ADP nodes are attached to their parents using 12 different relations: case (12795; 89% instances), mark (1247; 9% instances), fixed (94; 1% instances), obl (40; 0% instances), dep (38; 0% instances), nsubj (36; 0% instances), xcomp (17; 0% instances), iobj (15; 0% instances), compound (10; 0% instances), obj (10; 0% instances), root (2; 0% instances), flat (1; 0% instances)

Parents of ADP nodes belong to 13 different parts of speech: NOUN (9336; 65% instances), VERB (1375; 10% instances), DET (1347; 9% instances), PROPN (1312; 9% instances), PRON (465; 3% instances), ADJ (175; 1% instances), NUM (128; 1% instances), ADP (100; 1% instances), ADV (62; 0% instances), (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)

14134 (99%) ADP nodes are leaves.

164 (1%) ADP nodes have one child.

4 (0%) ADP nodes have two children.

3 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 13 different relations: fixed (94; 51% instances), advmod:emph (55; 30% instances), obl (10; 5% instances), nmod (7; 4% instances), case (6; 3% instances), punct (4; 2% instances), amod (2; 1% instances), nsubj (2; 1% instances), aux (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), mark (1; 1% instances), obj (1; 1% instances)

Children of ADP nodes belong to 11 different parts of speech: ADP (100; 54% instances), PART (55; 30% instances), NOUN (18; 10% instances), PUNCT (4; 2% instances), DET (2; 1% instances), ADJ (1; 1% instances), ADV (1; 1% instances), AUX (1; 1% instances), PROPN (1; 1% instances), SCONJ (1; 1% instances), VERB (1; 1% instances)