home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hindi-HDTB: POS Tags: ADP

There are 151 ADP lemmas (1%), 155 ADP types (1%) and 74145 ADP tokens (21%). Out of 16 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: का, में, के, को, ने, से, पर, लिए, वाला, तक

The 10 most frequent ADP types: के, में, की, को, ने, से, का, पर, लिए, तक

The 10 most frequent ambiguous lemmas: का (ADP 20916, PROPN 32, NOUN 4, PART 1), में (ADP 10545, PART 4, PROPN 2), के (ADP 7578, PROPN 35, NOUN 2, PART 1), को (ADP 7461, ADV 1), से (ADP 5807, PART 119, DET 4), पर (ADP 4036, CCONJ 108, NOUN 3), लिए (ADP 2220, VERB 1), वाला (ADP 930, AUX 68, PROPN 1), तक (ADP 830, PART 3, PUNCT 1), साथ (ADP 760, ADV 86, NOUN 31, X 1)

The 10 most frequent ambiguous types: के (ADP 15933, PROPN 45, NOUN 4, PART 2, AUX 1), में (ADP 10544, PART 4, PROPN 2), की (ADP 8161, VERB 1264, PROPN 12, AUX 1), से (ADP 5808, PART 122, DET 2), का (ADP 4416, PROPN 4, NOUN 1), पर (ADP 4036, CCONJ 108, NOUN 3), लिए (ADP 2222, VERB 45, AUX 27), तक (ADP 830, PART 3), साथ (ADP 760, ADV 86, NOUN 31, X 1), बाद (ADP 675, ADV 448)

Morphology

The form / lemma ratio of ADP is 1.026490 (the average of all parts of speech is 1.207153).

The 1st highest number of forms (4) was observed with the lemma “का”: का, की, के, को.

The 2nd highest number of forms (4) was observed with the lemma “वाला”: वाला, वाली, वाले, वालों.

The 3rd highest number of forms (3) was observed with the lemma “को”: की, के, को.

ADP occurs with 11 features: AdpType (72950; 98% instances), Case (27279; 37% instances), Gender (27147; 37% instances), Number (25282; 34% instances), Person (3885; 5% instances), Polite (261; 0% instances), Poss (25; 0% instances), VerbForm (5; 0% instances), Aspect (4; 0% instances), Mood (1; 0% instances), Tense (1; 0% instances)

ADP occurs with 17 feature-value pairs: AdpType=Post, Aspect=Perf, Case=Acc, Case=Acc,Gen, Case=Nom, Gender=Fem, Gender=Masc, Mood=Ind, Number=Plur, Number=Sing, Person=2, Person=3, Polite=Form, Poss=Yes, Tense=Past, VerbForm=Fin, VerbForm=Part

ADP occurs with 59 feature combinations. The most frequent feature combination is AdpType=Post (46607 tokens). Examples: के, में, को, ने, से, की, पर, लिए, का, तक

Relations

ADP nodes are attached to their parents using 8 different relations: case (66759; 90% instances), mark (7056; 10% instances), dislocated (297; 0% instances), punct (20; 0% instances), conj (6; 0% instances), nmod (4; 0% instances), nsubj (2; 0% instances), compound (1; 0% instances)

Parents of ADP nodes belong to 11 different parts of speech: NOUN (43986; 59% instances), PROPN (19669; 27% instances), VERB (7062; 10% instances), PRON (2477; 3% instances), NUM (321; 0% instances), ADP (308; 0% instances), ADV (283; 0% instances), DET (18; 0% instances), ADJ (12; 0% instances), PART (8; 0% instances), CCONJ (1; 0% instances)

73547 (99%) ADP nodes are leaves.

557 (1%) ADP nodes have one child.

38 (0%) ADP nodes have two children.

3 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 3.

Children of ADP nodes are attached using 7 different relations: case (298; 46% instances), dep (261; 41% instances), punct (39; 6% instances), compound (25; 4% instances), dislocated (14; 2% instances), det (3; 0% instances), nmod (2; 0% instances)

Children of ADP nodes belong to 6 different parts of speech: ADP (308; 48% instances), PART (261; 41% instances), PUNCT (39; 6% instances), NOUN (30; 5% instances), DET (3; 0% instances), X (1; 0% instances)