home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-PUD: POS Tags: PRON

There are 28 PRON lemmas (1%), 87 PRON types (1%) and 374 PRON tokens (2%). Out of 13 observed tags, the rank of PRON is: 5 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: 그, 그녀, 자신, 그것, _, 그들, 이, 나, 우리, 이것

The 10 most frequent PRON types: 그는, 자신의, 그의, 그녀는, 나는, 그것은, 그들은, 이는, 그녀가, 그녀의

The 10 most frequent ambiguous lemmas: _ (NOUN 4325, VERB 1625, PROPN 1035, ADJ 609, ADV 517, DET 463, AUX 458, CCONJ 125, NUM 27, PRON 24, X 3, PUNCT 1), 이 (PRON 23, PART 16), 내 (PRON 7, NOUN 5), 자신들 (PRON 3, PROPN 1), 자기 (NOUN 1, PRON 1)

The 10 most frequent ambiguous types: 나는 (PRON 18, VERB 1), 누군 (PRON 4, NOUN 1), 우리 (PRON 3, NOUN 1), 내 (NOUN 12, PRON 2), 자신들의 (PRON 2, PROPN 1), 이 (DET 52, PART 16, NOUN 3, PRON 1, PROPN 1), 자기 (NOUN 3, PRON 1)

Morphology

The form / lemma ratio of PRON is 3.107143 (the average of all parts of speech is 3.165468).

The 1st highest number of forms (14) was observed with the lemma “_”: 그녀, 내, 너희들, 누가, 누구, 누군, 뭔, 여러분, 우리, 이, 이거, 이들, 자기, 자신.

The 2nd highest number of forms (9) was observed with the lemma “그”: 그가, 그는, 그로써, 그를, 그만큼, 그에, 그에게, 그와, 그의.

The 3rd highest number of forms (6) was observed with the lemma “우리”: 우리가, 우리는, 우리를, 우리에, 우리에게, 우리의.

PRON occurs with 4 features: Case (347; 93% instances), Polite (344; 92% instances), Person (233; 62% instances), Number (41; 11% instances)

PRON occurs with 10 feature-value pairs: Case=Acc, Case=Advb, Case=Comp, Case=Gen, Case=Nom, Number=Plur, Person=1, Person=2, Person=3, Polite=Form

PRON occurs with 33 feature combinations. The most frequent feature combination is Case=Nom|Person=3|Polite=Form (94 tokens). Examples: 그는, 그녀는, 그녀가, 그가

Relations

PRON nodes are attached to their parents using 12 different relations: nsubj (208; 56% instances), nmod:poss (75; 20% instances), advmod (35; 9% instances), obj (24; 6% instances), nsubj:pass (11; 3% instances), compound (7; 2% instances), csubj (6; 2% instances), iobj (3; 1% instances), root (2; 1% instances), compound:lvc (1; 0% instances), det (1; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: NOUN (214; 57% instances), VERB (110; 29% instances), ADJ (29; 8% instances), PART (11; 3% instances), ADV (3; 1% instances), PRON (2; 1% instances), PROPN (2; 1% instances), (2; 1% instances), DET (1; 0% instances)

344 (92%) PRON nodes are leaves.

22 (6%) PRON nodes have one child.

5 (1%) PRON nodes have two children.

3 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 11 different relations: acl:relcl (9; 21% instances), cop (8; 19% instances), dep:prt (6; 14% instances), punct (5; 12% instances), compound (4; 10% instances), conj (3; 7% instances), det (2; 5% instances), nsubj (2; 5% instances), advmod (1; 2% instances), appos (1; 2% instances), nmod:poss (1; 2% instances)

Children of PRON nodes belong to 9 different parts of speech: NOUN (9; 21% instances), AUX (8; 19% instances), PART (6; 14% instances), PUNCT (5; 12% instances), VERB (5; 12% instances), PROPN (4; 10% instances), DET (2; 5% instances), PRON (2; 5% instances), ADV (1; 2% instances)