Treebank Statistics: UD_English-Atis: POS Tags: PRON
There are 14 PRON lemmas (2%), 17 PRON types (2%) and 3828 PRON tokens (6%).
Out of 14 observed tags, the rank of PRON is: 9 in number of lemmas, 10 in number of types and 6 in number of tokens.
The 10 most frequent PRON lemmas: I, what, there, you, which, it, that, be, one, they
The 10 most frequent PRON types: me, i, what, there, you, which, it, that, ‘s, one
The 10 most frequent ambiguous lemmas: I (PRON 2377, DET 10), what (PRON 852, DET 535), you (PRON 223, DET 16), which (DET 141, PRON 46, ADP 28, VERB 1), that (ADP 331, DET 45, PRON 34), be (AUX 1161, VERB 209, PART 15, PRON 5), one (ADJ 240, NUM 178, PRON 4), they (DET 9, PRON 3), this (DET 18, PRON 3), each (DET 6, PRON 1)
The 10 most frequent ambiguous types: what (PRON 852, DET 535), which (DET 141, PRON 46, ADP 28, VERB 1), that (ADP 331, DET 41, PRON 33), ’s (AUX 91, PART 15, PRON 5), one (NUM 178, PRON 4), this (DET 18, PRON 3), each (DET 6, PRON 1), those (DET 4, PRON 1)
- what
- which
- DET 141: show me the flights on delta or twa which go through atlanta
- PRON 46: are there any flights from boston to san francisco which stop in denver
- ADP 28: are there any flights from denver to atlanta which connect in pittsburgh
- VERB 1: which are the least expensive flights between dallas and baltimore on july nineteenth
- that
- ’s
- one
- this
- each
- those
Morphology
The form / lemma ratio of PRON is 1.214286 (the average of all parts of speech is 1.144766).
The 1st highest number of forms (2) was observed with the lemma “I”: i, me.
The 2nd highest number of forms (2) was observed with the lemma “that”: that, those.
The 3rd highest number of forms (2) was observed with the lemma “they”: them, they.
PRON occurs with 5 features: PronType (3828; 100% instances), Number (2640; 69% instances), Person (2640; 69% instances), Case (2638; 69% instances), Gender (34; 1% instances)
PRON occurs with 11 feature-value pairs: Case=Acc, Case=Nom, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Int,Rel, PronType=Prs
PRON occurs with 10 feature combinations.
The most frequent feature combination is Case=Acc|Number=Sing|Person=1|PronType=Prs (1260 tokens).
Examples: me
Relations
PRON nodes are attached to their parents using 16 different relations: nsubj (1543; 40% instances), iobj (1241; 32% instances), root (631; 16% instances), expl (239; 6% instances), obj (60; 2% instances), det (37; 1% instances), xcomp (33; 1% instances), obl (26; 1% instances), nmod (5; 0% instances), ccomp (3; 0% instances), conj (3; 0% instances), list (2; 0% instances), nsubj:outer (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), parataxis (1; 0% instances)
Parents of PRON nodes belong to 11 different parts of speech: VERB (2830; 74% instances), (631; 16% instances), NOUN (138; 4% instances), ADP (88; 2% instances), ADJ (84; 2% instances), AUX (32; 1% instances), PROPN (14; 0% instances), NUM (4; 0% instances), ADV (3; 0% instances), PRON (3; 0% instances), DET (1; 0% instances)
3158 (82%) PRON nodes are leaves.
29 (1%) PRON nodes have one child.
619 (16%) PRON nodes have two children.
22 (1%) PRON nodes have three or more children.
The highest child degree of a PRON node is 4.
Children of PRON nodes are attached using 21 different relations: cop (630; 47% instances), nsubj (610; 46% instances), obj (21; 2% instances), case (20; 1% instances), aux (8; 1% instances), nmod (7; 1% instances), acl:relcl (6; 0% instances), cc (6; 0% instances), dislocated (5; 0% instances), amod (3; 0% instances), det (3; 0% instances), discourse (3; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), conj (2; 0% instances), flat (2; 0% instances), advmod (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), nmod:tmod (1; 0% instances), xcomp (1; 0% instances)
Children of PRON nodes belong to 12 different parts of speech: AUX (638; 48% instances), NOUN (611; 46% instances), PROPN (35; 3% instances), ADP (22; 2% instances), VERB (10; 1% instances), CCONJ (6; 0% instances), ADJ (4; 0% instances), DET (3; 0% instances), PRON (3; 0% instances), ADV (2; 0% instances), INTJ (2; 0% instances), NUM (1; 0% instances)