Quoted-Printable , or QP encoding , is a binary-to-text encoding system using printable ASCII characters ( alphanumeric and the equals sign = ) to transmit 8-bit data over a 7-bit data path or, generally, over a medium which is not 8-bit clean . Historically, because of the wide range of systems and protocols that could be used to transfer messages, e-mail was often assumed to be non-8-bit-clean – however, modern SMTP servers are in most cases 8-bit clean and support 8BITMIME extension. It can also be used with data that contains non-permitted octets or line lengths exceeding SMTP limits. It is defined as a MIME content transfer encoding for use in e-mail .
21-574: QP , Qp , or Q p may refer to: Computing [ edit ] Quoted-printable , an encoding to send 8-bit data over 7-bit path '=09' QP (Quantum Platform) , a framework for building real-time embedded applications Medicine [ edit ] Qualified person (European Union) , in European Union pharmaceutical regulation ATCvet code QP , designation for antiparasitic veterinary medication Mathematics [ edit ] Q p ,
42-646: A script is a collection of letters and other written signs used to represent textual information in one or more writing systems . Some scripts support one and only one writing system and language , for example, Armenian . Other scripts support many different writing systems; for example, the Latin script supports English , French , German , Italian , Vietnamese , Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish ,
63-583: A Japanese hardcore punk and industrial metal band Quarter Pounder , a hamburger Qatar Petroleum , former name of the oil and gas company QatarEnergy Topics referred to by the same term [REDACTED] This disambiguation page lists articles associated with the title QP . If an internal link led you here, you may wish to change the link to point directly to the intended article. Retrieved from " https://en.wikipedia.org/w/index.php?title=QP&oldid=1208794528 " Category : Disambiguation pages Hidden categories: Short description
84-559: A general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as Dz (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters , and therefore Unicode discourages their use by authors. It
105-456: A particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters. As of version 16.0 , Unicode defines 168 scripts (called "Alias" or "Property value alias") based on the ISO 15924 list. In addition, Unicode assigns
126-540: A system. The term complex system is sometimes used to describe those where the admixture makes classification problematic. Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text-processing algorithms. In addition to explicit or specific script properties, Unicode uses three special values: Unicode provides
147-500: Is different from Wikidata All article disambiguation pages All disambiguation pages Quoted-printable QP works by using the equals sign = as an escape character . It also limits line length to 76, as some software has limits on line length. MIME defines mechanisms for sending other kinds of information in e-mail, including text in languages other than English , using character encodings other than ASCII. However, these encodings often use byte values outside
168-834: Is sometimes treated as a synonym for "script". However, it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script; for example, the Japanese writing system makes use of the Han , Hiragana and Katakana scripts. Most writing systems can be broadly divided into several categories: logographic , syllabic , alphabetic (or segmental ), abugida , abjad and featural ; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize
189-796: Is unlikely that new titlecase letters will be added in the future. Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase. Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation , separators (word separators such as spaces), symbols and non-graphical format characters. These are included in
210-641: Is used in message headers; see MIME#Encoded-Word . The following example is a French text (encoded in UTF-8), with a high frequency of letters with diacritical marks (such as the é ). This encodes the following quotation: J'interdis aux marchands de vanter trop leurs marchandises. Car ils se font vite pédagogues et t'enseignent comme but ce qui n'est par essence qu'un moyen, et te trompant ainsi sur la route à suivre les voilà bientôt qui te dégradent, car si leur musique est vulgaire ils te fabriquent pour te la vendre une âme vulgaire. Script (Unicode) In Unicode ,
231-773: The Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters . The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics , so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters. Unicode 16.0 defines 168 separate scripts, including 99 modern scripts and 69 ancient or historic scripts. More scripts are in
SECTION 10
#1732858942769252-455: The ASCII range so they need to be encoded further before they are suitable for use in a non-8-bit-clean environment. Quoted-Printable encoding is one method used for mapping arbitrary bytes into sequences of ASCII characters. So, Quoted-Printable is not a character encoding scheme itself, but a data coding layer to be used under some byte-oriented character encoding. QP encoding is reversible, meaning
273-626: The byte's numeric value. For example, an ASCII form feed character (decimal value 12) can be represented by =0C , and an ASCII equal sign (decimal value 61) must be represented by =3D . All characters except printable ASCII characters or end of line characters (but also = ) must be encoded in this fashion. All printable ASCII characters (decimal values between 33 and 126) may be represented by themselves, except = (decimal 61, hexadecimal 3D, therefore =3D ). ASCII tab and space characters , decimal values 9 and 32, may be represented by themselves, except if these characters would appear at
294-575: The diacritic combining ring above for any character. In general, the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems, they are said to use the same Latin script. Thus, the Unicode abstraction of scripts is a basic organizing technique. The differences among different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms. Writing system
315-477: The encoded text, soft line breaks may be added as desired. A soft line break consists of an = at the end of an encoded line, and does not appear as a line break in the decoded text. These soft line breaks also allow encoding text without line breaks (or containing very long lines) for an environment where line size is limited, such as the 1000 characters per line limit of some SMTP software, as allowed by RFC 2821. A slightly modified version of Quoted-Printable
336-789: The end of the encoded line. In that case, they would need to be escaped as =09 (tab) or =20 (space), or be followed by a = (soft line break) as the last character of the encoded line. This last solution is valid because it prevents the tab or space from being the last character of the encoded line. If the data being encoded contains meaningful line breaks, they must be encoded as an ASCII CR LF sequence, not as their original byte values, neither directly nor via = signs. Conversely, if byte values 13 and 10 have meanings other than end of line (in media types, for example), then they must be encoded as =0D and =0A respectively. Lines of Quoted-Printable encoded data must not be longer than 76 characters. To satisfy this requirement without altering
357-462: The field of p -adic numbers Quadratic programming , a special type of mathematical optimization problem Quasi-polynomial time , relating to time complexity in computer science QP or EQP , Exact Quantum Polynomial time in computational complexity theory Other uses [ edit ] Akasa Air (IATA code: QP), an Indian low-cost airline qp ligature , a ligature of Latin Qp-Crazy ,
378-412: The input has many 8-bit characters, then Quoted-Printable becomes both unreadable and extremely inefficient. Base64 is not human-readable, but has a uniform overhead for all data and is the more sensible choice for binary formats or text in a script other than the Latin script . Any 8-bit byte value may be encoded with 3 characters: an = followed by two hexadecimal digits (0–9 or A–F) representing
399-495: The name "Common" to ISO 15924's Zyyy code for undetermined scripts, "Inherited" to ISO 15924's Zinh code for inherited scripts, and "Unknown" to ISO 15924's Zzzz code for uncoded scripts. There are script codes defined by ISO 15924 but are not used in Unicode, including Zsym (Symbols) and Zmth (Mathematical notation). The project Missing Scripts—with contributors from the Mainz University of Applied Sciences ,
420-406: The original bytes and hence the non-ASCII characters they represent can be identically recovered. Quoted-Printable and Base64 are the two MIME content transfer encodings, if the trivial "7bit" and "8bit" encoding are not counted. If the text to be encoded does not contain many non-ASCII characters, then Quoted-Printable results in a fairly readable and compact encoded result. On the other hand, if
441-432: The process for encoding or have been tentatively allocated for encoding in roadmaps. When multiple languages make use of the same script, there are frequently some differences, particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character å (sometimes called a Swedish O ), while English has no such character. Nor does English make use of
SECTION 20
#1732858942769#768231