Misplaced Pages

Space (punctuation)

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

In writing , a space ( ) is a blank area that separates words , sentences , syllables (in syllabification ) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex. Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.

#990009

46-405: Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text . The typewriter , on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and

92-403: A Compugraphics system for typesetting and page layout. The magazine did not yet accept articles on floppy disks, but hoped to do so "as matters progress". Before the 1980s, practically all typesetting for publishers and advertisers was performed by specialist typesetting companies. These companies performed keyboarding, editing and production of paper or film output, and formed a large component of

138-486: A Latin-derived alphabet have used various methods of sentence spacing since the advent of movable type in the 15th century. There has been some controversy regarding the proper amount of sentence spacing in typeset material. The Elements of Typographic Style states that only a single word space is required for sentence spacing. Psychological studies suggest "readers benefit from having two spaces after periods." The International System of Units (SI) prescribes inserting

184-402: A case, contained cast metal sorts , each with a single letter or symbol, but backwards (so they would print correctly). The compositor assembled these sorts into words, then lines, then pages of text, which were then bound tightly together by a frame, making up a form or page. If done correctly, all letters were of the same height, and a flat surface of type was created. The form was placed in

230-528: A custom LaTeX-inspired markup (SIL) or in XML. Via the adjunction of 3rd-party modules, composition in Markdown or Djot is also possible. Thin space In typography , a thin space is a space character whose width is usually 1 ⁄ 5 or 1 ⁄ 6 of an em . It is used to add a narrow space, such as between nested quotation marks or to separate glyphs that interfere with one another. It

276-401: A family of typesetting languages with names that were derivatives of the word "SCRIPT". Later versions of SCRIPT included advanced features, such as automatic generation of a table of contents and index, multicolumn page layout, footnotes, boxes, automatic hyphenation and spelling verification. NSCRIPT was a port of SCRIPT to OS and TSO from CP-67/CMS SCRIPT. Waterloo Script was created at

322-479: A keyboard or other devices could produce the desired text. Most of the successful systems involved the in-house casting of the type to be used, hence are termed "hot metal" typesetting. The Linotype machine , invented in 1884, used a keyboard to assemble the casting matrices, and cast an entire line of type at a time (hence its name). In the Monotype System , a keyboard was used to punch a paper tape , which

368-489: A language's orthography for visual display. Typesetting requires one or more fonts (which are widely but erroneously confused with and substituted for typefaces ). One significant effect of typesetting was that authorship of works could be spotted more easily, making it difficult for copiers who have not gained permission. During much of the letterpress era , movable type was composed by hand for each page by workers called compositors . A tray with many dividers, called

414-408: A light source to selectively expose characters onto light-sensitive paper. Originally they were driven by pre-punched paper tapes . Later they were connected to computer front ends. One of the earliest electronic photocomposition systems was introduced by Fairchild Semiconductor . The typesetter typed a line of text on a Fairchild keyboard that had no display. To verify correct content of the line it

460-512: A press and inked, and then printed (an impression made) on paper. Metal type read backwards, from right to left, and a key skill of the compositor was their ability to read this backwards text. Before computers were invented, and thus becoming computerized (or digital) typesetting, font sizes were changed by replacing the characters with a different size of type. In letterpress printing, individual letters and punctuation marks were cast on small metal blocks, known as "sorts," and then arranged to form

506-522: A single whitespace character, with various properties; the more commonly encountered variations include: In URLs , spaces are percent encoded with its ASCII / UTF-8 representation %20 . Typesetting Typesetting is the composition of text for publication, display, or distribution by means of arranging physical type (or sort ) in mechanical systems or glyphs in digital systems representing characters (letters and other symbols). Stored types are retrieved and ordered according to

SECTION 10

#1732786852991

552-479: A solvent the expensive sorts had to be redistributed into the typecase - called sorting or dissing - so they would be ready for reuse. Errors in sorting could later produce misprints if, say, a p was put into the b compartment. The diagram at right illustrates a cast metal sort: a face, b body or shank, c point size, 1 shoulder, 2 nick, 3 groove, 4 foot. Wooden printing sorts were used for centuries in combination with metal type. Not shown, and more

598-438: A space between a number and a unit of measurement (the space being regarded as an implied multiplication sign) but never between a prefix and a base unit; a space (or a multiplication dot ) should also be used between units in compound units. The only exception to this rule is the traditional symbolic notation of angles : degree (e.g., 30°), minute of arc (e.g., 22′), and second of arc (e.g., 8″). The SI also prescribes

644-624: Is considered fairly difficult to learn on its own, and deals more with appearance than structure. The LaTeX macro package, written by Leslie Lamport at the beginning of the 1980s, offered a simpler interface and an easier way to systematically encode the structure of a document. LaTeX markup is widely used in academic circles for published papers and books. Although standard TeX does not provide an interface of any sort, there are programs that do. These programs include Scientific Workplace and LyX , which are graphical/interactive editors; TeXmacs , while being an independent typesetting system, can also aid

690-664: Is not as narrow as the hair space . It is also used in the International System of Units and in many countries as a thousands separator when writing numbers in groups of three digits, in order to facilitate reading. It also avoids the ambiguity of the comma, used as a thousands separator in many countries but as a decimal point in Europe. In Unicode , thin space is encoded at U+ 2009   THIN SPACE (  ,   ). Some text editors, such as IntelliJ IDEA and Android Studio, will display

736-574: Is still included with a number of Unix and Unix-like systems, and has been used to typeset a number of high-profile technical and computer books. Some versions, as well as a GNU work-alike called groff , are now open source . The TeX system, developed by Donald E. Knuth at the end of the 1970s, is another widespread and powerful automated typesetting system that has set high standards, especially for typesetting mathematics. LuaTeX and LuaLaTeX are variants of TeX and of LaTeX scriptable in Lua . TeX

782-657: The Apple Macintosh , Aldus PageMaker (and later QuarkXPress ) and PostScript and on the PC platform with Xerox Ventura Publisher under DOS as well as Pagemaker under Windows. Improvements in software and hardware, and rapidly lowering costs, popularized desktop publishing and enabled very fine control of typeset results much less expensively than the minicomputer dedicated systems. At the same time, word processing systems, such as Wang , WordPerfect and Microsoft Word , revolutionized office documents. They did not, however, have

828-574: The lack of vowels . The earliest Greek script also used interpuncts to divide words rather than spacing, although this practice was soon displaced by the scriptura continua . Word spacing was later used by Irish and Anglo-Saxon scribes, beginning after the creation of the Carolingian minuscule by Alcuin of York and the scribes' adoption of it. Spacing would become standard in Renaissance Italy and France, and then Byzantium by

874-591: The 1970s and early 1980s, such as Datalogics Pager, Penta, Atex , Miles 33, Xyvision, troff from Bell Labs , and IBM's Script product with CRT terminals, were better able to drive these electromechanical devices, and used text markup languages to describe type and other page formatting information. The descendants of these text markup languages include SGML , XML and HTML . The minicomputer systems output columns of text on film for paste-up and eventually produced entire pages and signatures of 4, 8, 16 or more pages using imposition software on devices such as

920-542: The 1980s by fully digital systems employing a raster image processor to render an entire page to a single high-resolution digital image , now known as imagesetting. The first commercially successful laser imagesetter, able to make use of a raster image processor, was the Monotype Lasercomp. ECRM, Compugraphic (later purchased by Agfa ) and others rapidly followed suit with machines of their own. Early minicomputer -based typesetting software introduced in

966-601: The Israeli-made Scitex Dolev. The data stream used by these systems to drive page layout on printers and imagesetters, often proprietary or specific to a manufacturer or device, drove development of generalized printer control languages, such as Adobe Systems ' PostScript and Hewlett-Packard 's PCL . Computerized typesetting was so rare that BYTE magazine (comparing itself to "the proverbial shoemaker's children who went barefoot") did not use any computers in production until its August 1979 issue used

SECTION 20

#1732786852991

1012-582: The University of Waterloo (UW) later. One version of SCRIPT was created at MIT and the AA/CS at UW took over project development in 1974. The program was first used at UW in 1975. In the 1970s, SCRIPT was the only practical way to word process and format documents using a computer. By the late 1980s, the SCRIPT system had been extended to incorporate various upgrades. The initial implementation of SCRIPT at UW

1058-426: The bed of a press. In this process, called stereotyping , the entire form is pressed into a fine matrix such as plaster of Paris or papier mâché to create a flong , from which a positive form is cast in type metal . Advances such as the typewriter and computer would push the state of the art even farther ahead. Still, hand composition and letterpress printing have not fallen completely out of use, and since

1104-560: The character as its suggested abbreviation of " THSP ". Unicode's U+ 202F   NARROW NO-BREAK SPACE is a non-breaking space with a width similar to that of the thin space. In LaTeX and Plain TeX , \thinspace produces a narrow, non-breaking space . Inside and outside of math formulae in LaTeX, \, also produces a narrow, non-breaking space. In all versions of LibreOffice and in some of Microsoft Word ,

1150-484: The concern of the casterman, is the "set", or width of each sort. Set width, like body size, is measured in points. In order to extend the working life of type, and to account for the finite sorts in a case of type, copies of forms were cast when anticipating subsequent printings of a text, freeing the costly type for other work. This was particularly prevalent in book and newspaper work where rotary presses required type forms to wrap an impression cylinder rather than set in

1196-490: The conversion to do-it-yourself easier, but also opened up a gap between skilled designers and amateurs. The advent of PostScript, supplemented by the PDF file format, provided a universal method of proofing designs and layouts, readable on major computers and operating systems. QuarkXPress had enjoyed a market share of 95% in the 1990s, but lost its dominance to Adobe InDesign from the mid-2000s onward. IBM created and inspired

1242-475: The design of printed works. Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways: Modern English uses a space to separate words, but not all languages follow this practice. Spaces were not used to separate words in Latin until roughly 600–800 AD. Ancient Hebrew and Arabic did use spaces partly to compensate in clarity for

1288-519: The end of the 16th century; then entering into the Slavic languages in Cyrillic in the 17th century, and only in modern times entering modern Sanskrit . CJK languages do not use spaces when dealing with text containing mostly Chinese characters and kana . In Japanese , spaces may occasionally be used to separate people's family names from given names , to denote omitted particles (especially

1334-566: The graphic arts industry. In the United States, these companies were located in rural Pennsylvania, New England or the Midwest, where labor was cheap and paper was produced nearby, but still within a few hours' travel time of the major publishing centers. In 1985, with the new concept of WYSIWYG (for What You See Is What You Get) in text editing and word processing on personal computers, desktop publishing became available, starting with

1380-466: The help of scripting languages. YesLogic's Prince is another one, which is based on CSS Paged Media. During the mid-1970s, Joe Ossanna , working at Bell Laboratories , wrote the troff typesetting program to drive a Wang C/A/T phototypesetter owned by the Labs; it was later enhanced by Brian Kernighan to support output to different equipment, such as laser printers . While its use has fallen off, it

1426-508: The introduction of digital typesetting, it has seen a revival as an artisanal pursuit. However, it is a small niche within the larger typesetting market. The time and effort required to manually compose the text led to several efforts in the 19th century to produce mechanical typesetting. While some, such as the Paige compositor , met with limited success, by the end of the 19th century, several methods had been devised whereby an operator working

Space (punctuation) - Misplaced Pages Continue

1472-418: The negative film, resulting in a column of black type on white paper, or a galley . The galley was then cut up and used to create a mechanical drawing or paste up of a whole page. A large film negative of the page is shot and used to make plates for offset printing . The next generation of phototypesetting machines to emerge were those that generated characters on a cathode-ray tube display. Typical of

1518-411: The photo of the composing stick, a lower case 'q' looks like a 'd', a lower case 'b' looks like a 'p', a lower case 'p' looks like a 'b' and a lower case 'd' looks like a 'q'. This is reputed to be the origin of the expression "mind your p's and q's". It might just as easily have been "mind your b's and d's". A forgotten but important part of the process took place after the printing: after cleaning with

1564-410: The phrase for " Republic of Korea " is usually spelled without spaces as 대한민국 rather than with a space as 대한 민국 . Runic texts use either an interpunct -like or a colon -like punctuation mark to separate words. There are two Unicode characters dedicated for this: U+16EB ᛫ RUNIC SINGLE PUNCTUATION and U+16EC ᛬ RUNIC MULTIPLE PUNCTUATION . Languages with

1610-473: The preparation of TeX documents through its export capability. GNU TeXmacs (whose name is a combination of TeX and Emacs , although it is independent from both of these programs) is a typesetting system which is at the same time a WYSIWYG word processor . SILE borrows some algorithms from TeX and relies on other libraries such as HarfBuzz and ICU , with an extensible core engine developed in Lua . By default, SILE's input documents can be composed in

1656-599: The special characters and symbols dialog (often available via Insert > Symbol or Insert > Special Characters ), has both the thin space and the narrow no-break space available for point-and-click insertion. In LibreOffice's Symbol dialog, there is an easy-to-find box field to narrow the searching; in Word's Symbol dialog, under font = "(normal text)", the characters are found in subset = "General Punctuation", Unicode character 2009 and nearby. Other word processing programs and in many Linux configurations, have ways of producing

1702-431: The text for a page. The size of the type was determined by the size of the character on the face of the sort. A compositor would need to physically swap out the sorts for a different size to change the font size. During typesetting, individual sorts are picked from a type case with the right hand, and set from left to right into a composing stick held in the left hand, appearing to the typesetter as upside down. As seen in

1748-543: The topic particle wa ), and for certain literary or artistic effects. Modern Korean , however, has spaces as an essential part of its writing system (because of Western influence), given the phonetic nature of the hangul script that requires word dividers to avoid ambiguity, as opposed to Chinese characters which are mostly very distinguishable from each other. In Korean, spaces are used to separate chunks of nouns, nouns and particles , adjectives, and verbs; for certain compounds or phrases, spaces may be used or not, for example

1794-646: The type were the Alphanumeric APS2 (1963), IBM 2680 (1967), I.I.I. VideoComp (1973?), Autologic APS5 (1975), and Linotron 202 (1978). These machines were the mainstay of phototypesetting for much of the 1970s and 1980s. Such machines could be "driven online" by a computer front-end system or took their data from magnetic tape. Type fonts were stored digitally on conventional magnetic disk drives. Computers excel at automatically typesetting and correcting documents. Character-by-character, computer-aided phototypesetting was, in turn, rapidly rendered obsolete in

1840-542: The typographic ability or flexibility required for complicated book layout, graphics, mathematics, or advanced hyphenation and justification rules ( H and J ). By 2000, this industry segment had shrunk because publishers were now capable of integrating typesetting and graphic design on their own in-house computers. Many found the cost of maintaining high standards of typographic design and technical skill made it more economical to outsource to freelancers and graphic design specialists. The availability of cheap or free fonts made

1886-474: The use of a space (often typographically a thin space ) as a thousands separator where required. Both the point and the comma are reserved as decimal markers . Sometimes a narrow non-breaking space or non-breaking space , respectively, is recommended (as in, for example, IEEE Standards and IEC standards ) to avoid the separation of units and values or parts of compounds units, due to automatic line wrap and word wrap . Unicode defines many variants of

Space (punctuation) - Misplaced Pages Continue

1932-521: Was a SCRIPT variant developed at IBM in the 1980s. DWScript is a version of SCRIPT for MS-DOS, named after its author, D. D. Williams, but was never released to the public and only used internally by IBM. Script is still available from IBM as part of the Document Composition Facility for the z/OS operating system. The standard generalized markup language ( SGML ) was based upon IBM Generalized Markup Language (GML). GML

1978-631: Was a set of macros on top of IBM Script. DSSSL is an international standard developed to provide a stylesheets for SGML documents. XML is a successor of SGML. XSL-FO is most often used to generate PDF files from XML files. The arrival of SGML/XML as the document model made other typesetting engines popular. Such engines include Datalogics Pager, Penta, Miles 33's OASYS, Xyvision's XML Professional Publisher , FrameMaker , and Arbortext . XSL-FO compatible engines include Apache FOP , Antenna House Formatter , and RenderX 's XEP . These products allow users to program their SGML/XML typesetting process with

2024-556: Was documented in the May 1975 issue of the Computing Centre Newsletter, which noted some the advantages of using SCRIPT: The article also pointed out SCRIPT had over 100 commands to assist in formatting documents, though 8 to 10 of these commands were sufficient to complete most formatting jobs. Thus, SCRIPT had many of the capabilities computer users generally associate with contemporary word processors. SCRIPT/VS

2070-460: Was then fed to control a casting machine. The Ludlow Typograph involved hand-set matrices, but otherwise used hot metal. By the early 20th century, the various systems were nearly universal in large newspapers and publishing houses. Phototypesetting or "cold type" systems first appeared in the early 1960s and rapidly displaced continuous casting machines. These devices consisted of glass or film disks or strips (one per font ) that spun in front of

2116-419: Was typed a second time. If the two lines were identical a bell rang and the machine produced a punched paper tape corresponding to the text. With the completion of a block of lines the typesetter fed the corresponding paper tapes into a phototypesetting device that mechanically set type outlines printed on glass sheets into place for exposure onto a negative film . Photosensitive paper was exposed to light through

#990009