PROPN: proper noun
Definition
A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object. Western Armenian proper nouns are always capitalized. Note that in Western Armenian, the names of months and days of the week may appear either capitalized or in lowercase. Regardless of their capitalization, they are not considered proper nouns and are therefore not tagged PROPN.
Single-word named entities are tagged PROPN even if they originate from a common noun (Աշտարակ/Aštarak, Իջեւան/Iǰewan) (towns) or an adjective (Սիզավետ/Sizavet, Որոտան/Orotan) (village, river). Even if սիզավետ/sizavet “grassy” is originally an adjective, syntactically it behaves as a noun. As a geographical name it denotes a concrete location (rather than a property of something).
Note that names of languages (հայերեն/hayeren “Armenian”, անգլերեն/angleren “English”) and adjectives derived from geographical names (հայկական/haykakan “Armenian”, հայոց/hayoc’ “Armenian”, անգլիական/angliakan “English”, երևանյան__erewanyan/ “of Yerevan”) are written in lowercase and are not tagged PROPN.
Personal names are typically treated as a sequence of proper nouns (one or more given names and one or more surnames). If the name contains prepositions, conjunctions or articles (foreign names), these are tagged X and the morphological analysis does not include their original part of speech.
When other phrases or sentences are used as names, the component words retain their original tags. For example, in «Մենք ենք մեր սարերը»/‘Menk’ enk’ mer sarerë’ “We are our mountains”, Մենք/Menk’ “we” is PRON, ենք/enk’ “are” is AUX, մեր/mer “our” is DET, etc.
Acronyms of proper nouns, such as ՄԱԿ “UN” and ՆԱՏՕ “NATO”, are tagged PROPN. Even if they contain numbers (as in various product names), they are tagged PROPN and not SYM: ՏՈՒ-154Մ, ԻԼ-76. However, if the token consists entirely of digits (like 7 in Windows 7), it is tagged NUM.
Western Armenian multi-word named entities have internal syntactic structure, which is preserved in the annotation. The headword is always a noun and there may be other nouns involved. They will be tagged PROPN if they are proper nouns as single-word named entities. Even if an adjective is the first word of a multi-word name, and thus it starts with an uppercase letter, it is still tagged ADJ.
Note that the additional feature NameType is used to encode the types of named entities.
Examples
- Մայնի/Mayni
PROPNՖրանքֆուրթ/Frank’fowrt’PROPN“Frankfurt am Main”. Ֆրանքֆորթ/Frank’fort’ is the heads. The Մայնի/Mayni part refers to the river flowing through the city, and is tagged asPROPNtoo. - Լուտուիկ/Lowtowik
PROPNվան/vanXՊեթհովըն/Pet’hovënPROPN“Ludwig van Beethoven”, Միկէլ/MikēlPROPNտէ/tēXՍերվանտես/ServantesPROPN“Miguel de Cervantes”, Շիրին/Širin օղլու/òġlow Յակոբ/Yakob. The non-capitalized parts are prepositions (van, de) or common words (òġlow) in the languages of the names’ origin, and are taggedXin Armenian. - Մեծ/Meç
ADJՄասիս/MasisPROPNis a mountain. Մասիս/Masis is the head and it is taggedPROPNas it is a geographical name. Մեծ/Meç is an adjective meaning “big” and it is taggedADJ. - Հիւսիսային/Hiwsisayin
ADJսառուցեալ/saṙowc’ealADJովկիանոս/ovkianosNOUN“The Arctic Ocean”. Although the three words together are a name of an ocean, ովկիանոս/ovkianos is a common noun and is tagged as such. - Միացեալ/Miac’eal
ADJԱզգերու/AzgerowNOUNԿազմակերպութիւն/Kazmakerpowt’iwnNOUN“United Nations” consists of three words, none of which is proper noun. However, the acronym ՄԱԿ “UNO” is a single-token name and is taggedPROPN. In ԵԱՀԿ/EAHKPROPNՄինսքի/Minsk’iPROPNխումբ/xowmbNOUN“The OSCE Minsk Group” the first two words as single-tokens are proper nouns and are tagged as such.
PROPN in other languages: [axm] [bej] [bg] [bm] [cs] [cy] [da] [el] [en] [es] [et] [eu] [fi] [fro] [fr] [ga] [grc] [hu] [hy] [hyw] [it] [ja] [ka] [kk] [kpv] [ky] [myv] [naq] [no] [oge] [pal] [pcm] [pt] [qpm] [ru] [sl] [sv] [tr] [tt] [uk] [u] [urj] [xcl] [xmf] [yue] [zh]