home edit page issue tracker

This page pertains to UD version 2.

PROPN: proper noun

Definition

A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object. Armenian proper nouns are always written starting with an uppercase letter. Note that names of days of week (երկուշաբթի, երեքշաբթի, չորեքշաբթի, հինգշաբթի, ուրբաթ, շաբաթ, կիրակի)  and names of months (հունվար, փետրվար, մարտ, ապրիլ, մայիս, հունիս, հուլիս, օգոստոս, սեպտեմբեր, հոկտեմբեր, նոյեմբեր, դեկտեմբեր)  are not written capitalized (unlike English) and are not considered proper nouns.

Single-word named entities should be tagged PROPN though they originate from a common noun (Աշտարակ, Իջևան) (towns) or an adjective (Սիզավետ, Որոտան) (village, river). Even if սիզավետ “grassy” is originally an adjective, syntactically it behaves as a noun. As a geographical name it denotes a concrete location (rather than a property of something).

Note that names of languages (հայերեն, անգլերեն) and adjectives derived from geographical names (հայկական, հայոց, անգլիական, երևանյան  “Armenian, English, of Yerevan”) are written in lowercase and are not tagged PROPN.

Personal names are typically treated as a sequence of proper nouns (one or more given names and one or more surnames). If the name contains prepositions, conjunctions or articles (foreign names), these are also tagged as X and the morphological analysis does not include their original part of speech.

When other phrases or sentences are used as names, the component words retain their original tags. For example, in «Մենք ենք մեր սարերը» , Մենք  is PRON, ենք  is AUX, մեր  is DET, etc.

Acronyms of proper nouns, such as ՄԱԿ  “UN” and ՆԱՏՕ  “NATO”, should be tagged PROPN. Even if they contain numbers (as in various product names), they are tagged PROPN and not SYM: ՏՈՒ-154Մ , ԻԼ-76 . However, if the token consists entirely of digits (like in Windows 7 ), it is tagged NUM.

Armenian multi-word named entities have internal syntactic structure, which is preserved in the annotation. The headword is always noun and there may be other nouns involved. They will be tagged PROPN if they are proper nouns as single-word named entities. Even if an adjective is the first word of a multi-word name, and thus it starts with an uppercase letter, it is still tagged ADJ.

Note, that the additional feature NameType is used to encode the types of named entities.

Examples


PROPN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]