home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESLSpok: POS Tags: PROPN

There are 1 PROPN lemmas (6%), 219 PROPN types (9%) and 490 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 12 in number of lemmas, 4 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: Charlie, Japan, Tokyo, Hokkaido, Okinawa, English, Hiroshima, Osaka, New, Saitama

The 10 most frequent ambiguous lemmas: _ (PUNCT 3316, NOUN 3083, PRON 2869, VERB 2552, ADV 1444, AUX 1302, DET 1271, ADP 1136, CCONJ 1124, ADJ 1032, PART 891, PROPN 490, SCONJ 267, INTJ 235, NUM 228, X 72)

The 10 most frequent ambiguous types: English (PROPN 14, ADJ 8), Japanese (ADJ 13, PROPN 7), Line (PROPN 4, NOUN 3), French (PROPN 3, ADJ 1), City (NOUN 2, PROPN 2), S (PROPN 2, NOUN 1), Spanish (PROPN 2, ADJ 1), A (DET 4, PROPN 1, X 1), Good (ADJ 4, INTJ 1, PROPN 1), Italian (ADJ 6, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 219.000000 (the average of all parts of speech is 146.187500).

The 1st highest number of forms (219) was observed with the lemma “_”: A, ABC, Al, Alien, Alpine, America, Americans, Aoyama, April, Arabic, August, Australia, Ayutthaya, Bali, Beijing, Belgium, Benson, Bob, Bon, Brazil, Brisbane, Broadway, Building, California, Canada, Canadian, Canyon, Charlie, Chiba, China, Chocola, Christmas, Cindy, Cinema, Circus, City, Club, Cosmos, Cruise, Cube, Daikanyama, Daimaru, Day, December, Di, Disneyland, Egypt, England, English, Europe, Exorcist, February, France, French, Friday, Fridays, Fuji, G, Garden, George, Germany, Gibson, Gifu, Ginza, Golden, Good, Grand, Green, Guatemala, Gusto, Hachikou, Hakone, Harajuku, Harvard, Hawaii, Higashiguchi, Hiroshima, Hokkaido, Hungary, Hunting, Ikebukuro, Indianapolis, Isetan, Ishikawa, Island, Italian, Italy, J, JAF, JICA, January, Japan, Japanese, John, Jonathan, Jovi, K, Kagoshima, Kami, Kansaiben, Kawagoe, Ken, Kichijoji, Kingdom, Korea, Kumagaya, Kyoto, Kyusyu, Line, London, Makuhari, March, Marion, Martin, May, Mel, Milan, Mile, Mitsukoshi, Monday, Moscow, Naha, New, Niro, Nogizaka, Noh, Norton, O, Odawara, Office, Okinawa, Osaka, Pacino, Pakistan, Paris, Pat, Piccadilly, Pittsburgh, Play, Poland, R, Rag, Robotch, Roppongi, Ryukyu, S, S., Saginomiya, Saitama, Saturday, Saturdays, Seattle, Seibu, Sendai, September, Service, Shakespeare, Shibuya, Shinjuku, Silent, Singapore, Spanish, Spielberg, Stanford, Star, Starbucks, States, Station, Steve, Steven, Studios, Stump, Sunday, Sundays, Switzerland, T, Tag, Takadanobaba, Takahagi, Takashimaya, Tanaka, Taro, Test, Thailand, Thursday, Tobu, Tojo, Tokyo, Tokyu, Tom, Tozai, Turkey, U, U., Ueno, United, Universal, University, VISA, Valentine, Walkman, Wars, Wednesday, Will, Wisconsin, Y, Yamanoko, Yamanote, Yamanouchi, Yamaoka, Yankees, Year, Yodobashi, York, Yurakucho, hamburg, mike, mister, san.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 19 different relations: obl (127; 26% instances), root (96; 20% instances), compound (87; 18% instances), obj (43; 9% instances), nmod (32; 7% instances), nsubj (24; 5% instances), conj (23; 5% instances), flat (23; 5% instances), xcomp (7; 1% instances), obl:tmod (6; 1% instances), appos (5; 1% instances), advcl (4; 1% instances), dislocated (4; 1% instances), dep (2; 0% instances), list (2; 0% instances), nmod:poss (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), nmod:tmod (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: VERB (191; 39% instances), PROPN (99; 20% instances), (96; 20% instances), NOUN (86; 18% instances), ADJ (11; 2% instances), ADV (4; 1% instances), AUX (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances)

156 (32%) PROPN nodes are leaves.

141 (29%) PROPN nodes have one child.

80 (16%) PROPN nodes have two children.

113 (23%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 12.

Children of PROPN nodes are attached using 26 different relations: case (195; 27% instances), punct (170; 23% instances), nsubj (65; 9% instances), cop (64; 9% instances), compound (50; 7% instances), det (30; 4% instances), cc (28; 4% instances), conj (26; 4% instances), flat (23; 3% instances), amod (14; 2% instances), advmod (11; 2% instances), mark (9; 1% instances), dep (8; 1% instances), discourse (6; 1% instances), nmod:poss (5; 1% instances), obl (5; 1% instances), nmod (4; 1% instances), appos (3; 0% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), advcl (2; 0% instances), aux (2; 0% instances), flat:foreign (2; 0% instances), list (2; 0% instances), obl:tmod (2; 0% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (170; 23% instances), ADP (145; 20% instances), PROPN (99; 14% instances), AUX (66; 9% instances), PART (52; 7% instances), NOUN (51; 7% instances), PRON (32; 4% instances), DET (30; 4% instances), CCONJ (29; 4% instances), ADJ (15; 2% instances), ADV (10; 1% instances), VERB (10; 1% instances), SCONJ (9; 1% instances), X (7; 1% instances), INTJ (6; 1% instances)