Misplaced Pages

Spreadsheet

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

A spreadsheet is a computer application for computation , organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets . The program operates on data entered in cells of a table. Each cell may contain either numeric or text data, or the results of formulas that automatically calculate and display a value based on the contents of other cells. The term spreadsheet may also refer to one such electronic document.

#770229

98-640: Spreadsheet users can adjust any stored value and observe the effects on calculated values. This makes the spreadsheet useful for "what-if" analysis since many cases can be rapidly investigated without manual recalculation. Modern spreadsheet software can have multiple interacting sheets and can display data either as text and numerals or in graphical form. Besides performing basic arithmetic and mathematical functions , modern spreadsheets provide built-in functions for common financial accountancy and statistical operations. Such calculations as net present value or standard deviation can be applied to tabular data with

196-432: A data modeling construct for the relational model, and the difference between the two has become irrelevant. The 1980s ushered in the age of desktop computing . The new computers empowered their users with spreadsheets like Lotus 1-2-3 and database software like dBASE . The dBASE product was lightweight and easy for any computer user to understand out of the box. C. Wayne Ratliff , the creator of dBASE, stated: "dBASE

294-666: A database management system. Programs within a suite use similar commands for similar functions. Usually, sharing data between the components is easier than with a non-integrated collection of functionally equivalent programs. This was particularly an advantage at a time when many personal computer systems used text-mode displays and commands instead of a graphical user interface . Humans have organized data into tables , that is, grids of columns and rows, since ancient times. The Babylonians used clay tablets to store data as far back as 1800 BCE. Other examples can be found in book-keeping ledgers and astronomical records. Since at least 1906

392-526: A workbook . A workbook is physically represented by a file containing all the data for the book, the sheets, and the cells with the sheets. Worksheets are normally represented by tabs that flip between pages, each one containing one of the sheets, although Numbers changes this model significantly. Cells in a multi-sheet book add the sheet name to their reference, for instance, "Sheet 1!C10". Some systems extend this syntax to allow cell references to different workbooks. Users interact with sheets primarily through

490-484: A 1962 report by the System Development Corporation of California as the first to use the term "data-base" in a specific technical sense. As computers grew in speed and capability, a number of general-purpose database systems emerged; by the mid-1960s a number of such systems had come into commercial use. Interest in a standard began to grow, and Charles Bachman , author of one such product,

588-551: A box for holding data . A single cell is usually referenced by its column and row (C2 would represent the cell containing the value 30 in the example table below). Usually rows, representing the dependent variables , are referenced in decimal notation starting from 1, while columns representing the independent variables use 26-adic bijective numeration using the letters A-Z as numerals. Its physical size can usually be tailored to its content by dragging its height or width at box intersections (or for entire columns or rows by dragging

686-440: A custom multitasking kernel with built-in networking support, but modern DBMSs typically rely on a standard operating system to provide these functions. Since DBMSs comprise a significant market , computer and storage vendors often take into account DBMS requirements in their own development plans. Databases and DBMSs can be categorized according to the database model(s) that they support (such as relational or XML ),

784-443: A database management system. Existing DBMSs provide various functions that allow management of a database and its data which can be classified into four main functional groups: Both a database and its DBMS conform to the principles of a particular database model . "Database system" refers collectively to the database model, database management system, and database. Physically, database servers are dedicated computers that hold

882-404: A database. One way to classify databases involves the type of their contents, for example: bibliographic , document-text, statistical, or multimedia objects. Another way is by their application area, for example: accounting, music compositions, movies, banking, manufacturing, or insurance. A third way is by some technical aspect, such as the database structure or interface type. This section lists

980-543: A different chain, based on IBM's papers on System R. Though Oracle V1 implementations were completed in 1978, it was not until Oracle Version 2 when Ellison beat IBM to market in 1979. Stonebraker went on to apply the lessons from INGRES to develop a new database, Postgres, which is now known as PostgreSQL . PostgreSQL is often used for global mission-critical applications (the .org and .info domain name registries use it as their primary data store , as do many large companies and financial institutions). In Sweden, Codd's paper

1078-463: A different type of entity . Only in the mid-1980s did computing hardware become powerful enough to allow the wide deployment of relational systems (DBMSs plus applications). By the early 1990s, however, relational systems dominated in all large-scale data processing applications, and as of 2018 they remain dominant: IBM Db2 , Oracle , MySQL , and Microsoft SQL Server are the most searched DBMS . The dominant database language, standardized SQL for

SECTION 10

#1732783477771

1176-423: A few of the adjectives used to characterize different kinds of databases. Connolly and Begg define database management system (DBMS) as a "software system that enables users to define, create, maintain and control access to the database." Examples of DBMS's include MySQL , MariaDB , PostgreSQL , Microsoft SQL Server , Oracle Database , and Microsoft Access . The DBMS acronym is sometimes extended to indicate

1274-436: A manual request to recalculate since the recalculation of large or complex spreadsheets often reduced data entry speed. Many modern spreadsheets still retain this option. Recalculation generally requires that there are no circular dependencies in a spreadsheet. A dependency graph is a graph that has a vertex for each object to be updated, and an edge connecting two objects whenever one of them needs to be updated earlier than

1372-473: A pre-programmed function in a formula. Spreadsheet programs also provide conditional expressions, functions to convert between text and numbers, and functions that operate on strings of text. Spreadsheets have replaced paper-based systems throughout the business world. Although they were first developed for accounting or bookkeeping tasks, they now are used extensively in any context where tabular lists are built, sorted, and shared. LANPAR, available in 1969,

1470-449: A set of operations based on the mathematical system of relational calculus (from which the model takes its name). Splitting the data into a set of normalized tables (or relations ) aimed to ensure that each "fact" was only stored once, thus simplifying update operations. Virtual tables called views could present the data in different ways for different users, but views could not be directly updated. Codd used mathematical terms to define

1568-447: A single large "chunk". Subsequent multi-user versions were tested by customers in 1978 and 1979, by which time a standardized query language – SQL – had been added. Codd's ideas were establishing themselves as both workable and superior to CODASYL, pushing IBM to develop a true production version of System R, known as SQL/DS , and, later, Database 2 ( IBM Db2 ). Larry Ellison 's Oracle Database (or more simply, Oracle ) started from

1666-452: A strong demand for massively distributed databases with high partition tolerance, but according to the CAP theorem , it is impossible for a distributed system to simultaneously provide consistency , availability, and partition tolerance guarantees. A distributed system can satisfy any two of these guarantees at the same time, but not all three. For that reason, many NoSQL databases are using what

1764-454: A time by navigating the links, they would use a declarative query language that expressed what data was required, rather than the access path by which it should be found. Finding an efficient access path to the data became the responsibility of the database management system, rather than the application programmer. This process, called query optimization, depended on the fact that queries were expressed in terms of mathematical logic. Codd's paper

1862-581: A variant used in VisiCalc and known as "A1 notation". Additionally, spreadsheets have the concept of a range , a group of cells, normally contiguous. For instance, one can refer to the first ten cells in the first column with the range "A1:A10". LANPAR innovated forward referencing/natural order calculation which didn't re-appear until Lotus 123 and Microsoft's MultiPlan Version 2. In modern spreadsheet applications, several spreadsheets, often known as worksheets or simply sheets , are gathered together to form

1960-416: A version that ran on IBM mainframes was introduced under the name AutoTab . ( National CSS offered a similar product, CSSTAB, which had a moderate timesharing user base by the early 1970s. A major application was opinion research tabulation.) AutoPlan/AutoTab was not a WYSIWYG interactive spreadsheet program, it was a simple scripting language for spreadsheets. The user defined the names and labels for

2058-467: A web-based spreadsheet application XL2Web developed by 2Web Technologies , combined with DocVerse which enabled multiple-user online collaboration of Office documents. In 2016 Collabora Online Calc was launched, notable in that the web based spreadsheet could be hosted and integrated into any environment without dependency on a 3rd party for authentication or maintenance. Collabora Online runs LibreOffice kit at its core, which grew from StarOffice that

SECTION 20

#1732783477771

2156-960: Is called eventual consistency to provide both availability and partition tolerance guarantees with a reduced level of data consistency. NewSQL is a class of modern relational databases that aims to provide the same scalable performance of NoSQL systems for online transaction processing (read-write) workloads while still using SQL and maintaining the ACID guarantees of a traditional database system. Databases are used to support internal operations of organizations and to underpin online interactions with customers and suppliers (see Enterprise software ). Databases are used to hold administrative information and more specialized data, such as engineering data or economic models. Examples include computerized library systems, flight reservation systems , computerized parts inventory systems , and many content management systems that store websites as collections of webpages in

2254-515: Is classified by IBM as a hierarchical database . IDMS and Cincom Systems ' TOTAL databases are classified as network databases. IMS remains in use as of 2014 . Edgar F. Codd worked at IBM in San Jose, California , in one of their offshoot offices that were primarily involved in the development of hard disk systems. He was unhappy with the navigational model of the CODASYL approach, notably

2352-463: Is indistinguishable from a batch compiler with added input data, producing an output report, i.e. , a 4GL or conventional, non-interactive, batch computer program. However, this concept of an electronic spreadsheet was outlined in the 1961 paper "Budgeting Models and System Simulation" by Richard Mattessich . The subsequent work by Mattessich (1964a, Chpt. 9, Accounting and Analytical Methods ) and its companion volume, Mattessich (1964b, Simulation of

2450-462: Is organized. Because of the close relationship between them, the term "database" is often used casually to refer to both a database and the DBMS used to manipulate it. Outside the world of professional information technology , the term database is often used to refer to any collection of related data (such as a spreadsheet or a card index) as size and usage requirements typically necessitate use of

2548-421: Is still pursued in certain applications by some companies like Netezza and Oracle ( Exadata ). IBM started working on a prototype system loosely based on Codd's concepts as System R in the early 1970s. The first version was ready in 1974/5, and work then started on multi-table systems in which the data could be split so that all of the data for a record (some of which is optional) did not have to be stored in

2646-404: Is the basis of query optimization. There is no loss of expressiveness compared with the hierarchic or network models, though the connections between tables are no longer so explicit. In the hierarchic and network models, records were allowed to have a complex internal structure. For example, the salary history of an employee might be represented as a "repeating group" within the employee record. In

2744-667: The Integrated Data Store (IDS), founded the Database Task Group within CODASYL , the group responsible for the creation and standardization of COBOL . In 1971, the Database Task Group delivered their standard, which generally became known as the CODASYL approach , and soon a number of commercial products based on this approach entered the market. The CODASYL approach offered applications

2842-599: The Michigan Terminal System . The system remained in production until 1998. In the 1970s and 1980s, attempts were made to build database systems with integrated hardware and software. The underlying philosophy was that such integration would provide higher performance at a lower cost. Examples were IBM System/38 , the early offering of Teradata , and the Britton Lee, Inc. database machine. Another approach to hardware support for database management

2940-434: The database models that they support. Relational databases became dominant in the 1980s. These model data as rows and columns in a series of tables , and the vast majority use SQL for writing and querying data. In the 2000s, non-relational databases became popular, collectively referred to as NoSQL , because they use different query languages . Formally, a "database" refers to a set of related data accessed through

3038-471: The hierarchical model and the CODASYL model ( network model ). These were characterized by the use of pointers (often physical disk addresses) to follow relationships from one record to another. The relational model , first proposed in 1970 by Edgar F. Codd , departed from this tradition by insisting that applications should search for data by content, rather than by following links. The relational model employs sets of ledger-style tables, each used for

Spreadsheet - Misplaced Pages Continue

3136-426: The personal computer from a hobby for computer enthusiasts into a business tool. VisiCalc was the first spreadsheet that combined many of the essential features of modern spreadsheet applications, such as a WYSIWYG interactive user interface, automatic recalculation, status and formula lines, range copying with relative and absolute references, and formula building by selecting referenced cells. Unaware of LANPAR at

3234-461: The professor and manipulate it to represent it and show ratios etc. In 1964, a book entitled Business Computer Language was written by Kimball, Stoffells and Walsh. Both the book and program were copyrighted in 1966 and years later that copyright was renewed. Applied Data Resources had a FORTRAN preprocessor called Empires. In the late 1960s, Xerox used BCL to develop a more sophisticated version for their timesharing system. A key invention in

3332-400: The relational structure of a database. Spreadsheets and databases are interoperable—sheets can be imported into databases to become tables within them, and database queries can be exported into spreadsheets for further analysis. A spreadsheet program is one of the main components of an office productivity suite , which usually also contains a word processor , a presentation program , and

3430-402: The software that interacts with end users , applications , and the database itself to capture and analyze the data. The DBMS additionally encompasses the core facilities provided to administer the database. The sum total of the database, the DBMS and the associated applications can be referred to as a database system . Often the term "database" is also used loosely to refer to any of the DBMS,

3528-622: The 1980s and early 1990s. The 1990s, along with a rise in object-oriented programming , saw a growth in how data in various databases were handled. Programmers and designers began to treat the data in their databases as objects . That is to say that if a person's data were in a database, that person's attributes, such as their address, phone number, and age, were now considered to belong to that person instead of being extraneous data. This allows for relations between data to be related to objects and their attributes and not to individual fields. The term " object–relational impedance mismatch " described

3626-740: The Apple II, this helped it grow in popularity. Lotus 1-2-3 was the leading spreadsheet for several years. Microsoft released the first version of Excel for the Apple Macintosh on September 30, 1985, and then ported it to Windows, with the first version being numbered 2.05 (to synchronize with the Macintosh version 2.2) and released in November 1987. Microsoft's Windows 3.x platforms of the early 1990s made it possible for their Excel spreadsheet application to take market share from Lotus. By

3724-584: The Federal Circuit (CCPA), overturning the Patent Office in 1983 — establishing that "something does not cease to become patentable merely because the point of novelty is in an algorithm." However, in 1995 a federal district court ruled the patent unenforceable due to inequitable conduct by the inventors during the application process. The United States Court of Appeals for the Federal Circuit upheld that decision in 1996. The actual software

3822-519: The Firm through a Budget Computer Program ) applied computerized spreadsheets to accounting and budgeting systems (on mainframe computers programmed in FORTRAN IV ). These batch Spreadsheets dealt primarily with the addition or subtraction of entire columns or rows (of input variables), rather than individual cells . In 1962, this concept of the spreadsheet, called BCL for Business Computer Language,

3920-686: The University of Michigan began development of the MICRO Information Management System based on D.L. Childs ' Set-Theoretic Data model. MICRO was used to manage very large data sets by the US Department of Labor , the U.S. Environmental Protection Agency , and researchers from the University of Alberta , the University of Michigan , and Wayne State University . It ran on IBM mainframe computers using

4018-499: The Wikimedia System Administrators, please include the details below. Request from 172.68.168.236 via cp1112 cp1112, Varnish XID 950573317 Upstream caches: cp1112 int Error: 429, Too Many Requests at Thu, 28 Nov 2024 08:44:38 GMT Database In computing , a database is an organized collection of data or a type of data store based on the use of a database management system ( DBMS ),

Spreadsheet - Misplaced Pages Continue

4116-461: The X and Y locations. X locations, the columns, are normally represented by letters, "A," "B," "C," etc., while rows are normally represented by numbers, 1, 2, 3, etc. A single cell can be referred to by addressing its row and column, "C10". This electronic concept of cell references was first introduced in LANPAR (Language for Programming Arrays at Random) (co-invented by Rene Pardo and Remy Landau) and

4214-539: The ability to navigate around a linked data set which was formed into a large network. Applications could find records by one of three methods: Later systems added B-trees to provide alternate access paths. Many CODASYL databases also added a declarative query language for end users (as distinct from the navigational API ). However, CODASYL databases were complex and required significant training and effort to produce useful applications. IBM also had its own DBMS in 1966, known as Information Management System (IMS). IMS

4312-438: The actual databases and run only the DBMS and related software. Database servers are usually multiprocessor computers, with generous memory and RAID disk arrays used for stable storage. Hardware database accelerators, connected to one or more servers via a high-speed channel, are also used in large-volume transaction processing environments . DBMSs are found at the heart of most database applications . DBMSs may be built around

4410-438: The business plans that they were presenting to venture capitalists. They decided to save themselves a lot of effort and wrote a computer program that produced their tables for them. This program, originally conceived as a simple utility for their personal use, would turn out to be the first software product offered by the company that would become known as Capex Corporation . "AutoPlan" ran on GE's Time-sharing service; afterward,

4508-416: The cell itself. Alternatively, a value can be based on a formula (see below), which might perform a calculation, display the current date or time, or retrieve external data such as a stock quote or a database value. The Spreadsheet Value Rule Computer scientist Alan Kay used the term value rule to summarize a spreadsheet's operation: a cell's value relies solely on the formula the user has typed into

4606-403: The cell. The formula may rely on the value of other cells, but those cells are likewise restricted to user-entered data or formulas. There are no 'side effects' to calculating a formula: the only output is to display the calculated result inside its occupying cell. There is no natural mechanism for permanently modifying the contents of a cell unless the user manually modifies the cell's contents. In

4704-420: The cells. A given cell can hold data by simply entering it in, or a formula, which is normally created by preceding the text with an equals sign. Data might include the string of text hello world , the number 5 or the date 10-Sep-97 . A formula would begin with the equals sign, =5*3 , but this would normally be invisible because the display shows the result of the calculation, 15 in this case, not

4802-418: The column- or row-headers). An array of cells is called a sheet or worksheet . It is analogous to an array of variables in a conventional computer program (although certain unchanging values, once entered, could be considered, by the same analogy, constants ). In most implementations, many worksheets may be located within a single spreadsheet. A worksheet is simply a subset of the spreadsheet divided for

4900-425: The context of programming languages, this yields a limited form of first-order functional programming . A standard of spreadsheets since the 1980s, this optional feature eliminates the need to manually request the spreadsheet program to recalculate values (nowadays typically the default option unless specifically 'switched off' for large spreadsheets, usually to improve performance). Some earlier spreadsheets required

4998-586: The database system or an application associated with the database. Small databases can be stored on a file system , while large databases are hosted on computer clusters or cloud storage . The design of databases spans formal techniques and practical considerations, including data modeling , efficient data representation and storage, query languages , security and privacy of sensitive data, and distributed computing issues, including supporting concurrent access and fault tolerance . Computer scientists may classify database management systems according to

SECTION 50

#1732783477771

5096-470: The development of electronic spreadsheets was made by Rene K. Pardo and Remy Landau, who filed in 1970 U.S. patent 4,398,249 on a spreadsheet automatic natural order calculation algorithm . While the patent was initially rejected by the patent office as being a purely mathematical invention, following 12 years of appeals, Pardo and Landau won a landmark court case at the Predecessor Court of

5194-487: The entire spreadsheet) can optionally be "locked" to prevent accidental overwriting. Typically this would apply to cells containing formulas but might apply to cells containing "constants" such as a kilogram/pounds conversion factor (2.20462262 to eight decimal places). Even though individual cells are marked as locked, the spreadsheet data are not protected until the feature is activated in the file preferences. Arithmetic Too Many Requests If you report this error to

5292-403: The first "non-procedural" computer languages) as opposed to left-to-right, top to bottom sequence for calculating the results in each cell that was used by VisiCalc , SuperCalc , and the first version of MultiPlan . Without forward referencing/natural order calculation, the user had to refresh the spreadsheet until the values in all cells remained unchanged. Once the cell values stayed constant,

5390-411: The formula itself. This may lead to confusion in some cases. The key feature of spreadsheets is the ability for a formula to refer to the contents of other cells, which may, in turn, be the result of a formula. To make such a formula, one replaces a number with a cell reference. For instance, the formula =5*C10 would produce the result of multiplying the value in cell C10 by the number 5. If C10 holds

5488-400: The inconvenience of translating between programmed objects and database tables. Object databases and object–relational databases attempt to solve this problem by providing an object-oriented language (sometimes as extensions to SQL) that programmers can use as alternative to purely relational SQL. On the programming side, libraries known as object–relational mappings (ORMs) attempt to solve

5586-430: The lack of a "search" facility. In 1970, he wrote a number of papers that outlined a new approach to database construction that eventually culminated in the groundbreaking A Relational Model of Data for Large Shared Data Banks . In this paper, he described a new system for storing and working with large databases. Instead of records being stored in some sort of linked list of free-form records as in CODASYL, Codd's idea

5684-528: The largest market share on the Windows and Macintosh platforms. A spreadsheet program is a standard feature of an office productivity suite . In 2006 Google launched a beta release spreadsheet web application , this is currently known as Google Sheets and one of the applications provided in Google Drive . A spreadsheet consists of a table of cells arranged into rows and columns and referred to by

5782-610: The legacy batch system into each user's spreadsheet monthly. It was designed to optimize the power of APL through object kernels, increasing program efficiency by as much as 50 fold over traditional programming approaches. An example of an early "industrial weight" spreadsheet was APLDOT, developed in 1976 at the United States Railway Association on an IBM 360/91, running at The Johns Hopkins University Applied Physics Laboratory in Laurel, MD. The application

5880-534: The model to view results of underlying formulas. His idea became VisiCalc. VisiCalc for the Apple II went on to become the first killer application , a program so compelling, people would buy a particular computer just to use it. It was ported to other computers, including CP/M machines, Atari 8-bit computers , and the Commodore PET , but VisiCalc remains best known as an Apple II program. SuperCalc

5978-576: The model: relations, tuples, and domains rather than tables, rows, and columns. The terminology that is now familiar came from early implementations. Codd would later criticize the tendency for practical implementations to depart from the mathematical foundations on which the model was based. The use of primary keys (user-oriented identifiers) to represent cross-table relationships, rather than disk addresses, had two primary motivations. From an engineering perspective, it enabled tables to be relocated and resized without expensive database reorganization. But Codd

SECTION 60

#1732783477771

6076-438: The numbers within a range. Spreadsheets share many principles and traits of databases , but spreadsheets and databases are not the same things. A spreadsheet is essentially just one table, whereas a database is a collection of many tables with machine-readable semantic relationships. While it is true that a workbook that contains three sheets is indeed a file containing multiple tables that can interact with each other, it lacks

6174-541: The other. Dependency graphs without circular dependencies form directed acyclic graphs , representations of partial orderings (in this case, across a spreadsheet) that can be relied upon to give a definite result. This feature refers to updating a cell's contents periodically with a value from an external source—such as a cell in a "remote" spreadsheet. For shared, Web-based spreadsheets, it applies to "immediately" updating cells another user has updated. All dependent cells must be updated also. Once entered, selected cells (or

6272-418: The programming language from the end-user. Through IBM's VM operating system , it was among the first programs to auto-update each copy of the application as new versions were released. Users could specify simple mathematical relationships between rows and between columns. Compared to any contemporary alternatives, it could support very large spreadsheets. It loaded actual financial planning data drawn from

6370-480: The relational approach, the data would be normalized into a user table, an address table and a phone number table (for instance). Records would be created in these optional tables only if the address or phone numbers were actually provided. As well as identifying rows/records using logical identifiers rather than disk addresses, Codd changed the way in which applications assembled data from multiple records. Rather than requiring applications to gather data one record at

6468-599: The relational model, has influenced database languages for other data models. Object databases were developed in the 1980s to overcome the inconvenience of object–relational impedance mismatch , which led to the coining of the term "post-relational" and also the development of hybrid object–relational databases . The next generation of post-relational databases in the late 2000s became known as NoSQL databases, introducing fast key–value stores and document-oriented databases . A competing "next generation" known as NewSQL databases attempted new implementations that retained

6566-419: The relational model, the process of normalization led to such internal structures being replaced by data held in multiple tables, connected only by logical keys. For instance, a common use of a database system is to track information about users, their name, login information, various addresses and phone numbers. In the navigational approach, all of this data would be placed in a single variable-length record. In

6664-455: The relational/SQL model while aiming to match the high performance of NoSQL compared to commercially available relational DBMSs. The introduction of the term database coincided with the availability of direct-access storage (disks and drums) from the mid-1960s onwards. The term represented a contrast with the tape-based systems of the past, allowing shared interactive use rather than daily batch processing . The Oxford English Dictionary cites

6762-426: The right order ("Forward Referencing/Natural Order Calculation"). Pardo and Landau developed and implemented the software in 1969. LANPAR was used by Bell Canada, AT&T, and the 18 operating telephone companies nationwide for their local and national budgeting operations. LANPAR was also used by General Motors. Its uniqueness was Pardo's co-invention incorporating forward referencing/natural order calculation (one of

6860-461: The rows and columns, then the formulas that defined each row or column. In 1975, Autotab-II was advertised as extending the original to a maximum of " 1,500 rows and columns, combined in any proportion the user requires... " GE Information Services, which operated the time-sharing service, also launched its own spreadsheet system, Financial Analysis Language (FAL), circa 1974. It was later supplemented by an additional spreadsheet language, TABOL, which

6958-415: The sake of clarity. Functionally, the spreadsheet operates as a whole and all cells operate as global variables within the spreadsheet (each variable having 'read' access only except its containing cell). A cell may contain a value or a formula , or it may simply be left empty. By convention, formulas usually begin with = sign. A value can be entered from the computer keyboard by directly typing into

7056-623: The same problem. XML databases are a type of structured document-oriented database that allows querying based on XML document attributes. XML databases are mostly used in applications where the data is conveniently viewed as a collection of documents, with a structure that can vary from the very flexible to the highly rigid: examples include scientific articles, patents, tax filings, and personnel records. NoSQL databases are often very fast, do not require fixed table schemas, avoid join operations by storing denormalized data, and are designed to scale horizontally . In recent years, there has been

7154-582: The technology progress in the areas of processors , computer memory , computer storage , and computer networks . The concept of a database was made possible by the emergence of direct access storage media such as magnetic disks , which became widely available in the mid-1960s; earlier systems relied on sequential storage of data on magnetic tape . The subsequent development of database technology can be divided into three eras based on data model or structure: navigational , SQL/ relational , and post-relational. The two main early navigational data models were

7252-477: The term "spread sheet" has been used in accounting to mean a grid of columns and rows in a ledger. And prior to the rise of computerized spreadsheets, "spread" referred to a newspaper or magazine item (text or graphics) that covers two facing pages, extending across the centerfold and treating the two pages as one large page. The compound word 'spread-sheet' came to mean the format used to present book-keeping ledgers—with columns for categories of expenditures across

7350-521: The time Lotus responded with usable Windows products, Microsoft had begun to assemble their Office suite. By 1995, Excel was the market leader, edging out Lotus 1-2-3, and in 2013, IBM discontinued Lotus 1-2-3 altogether. In 2006 Google launched their beta release Google Sheets , a web based spreadsheet application that can be accessed by multiple users from any device type using a compatible web browser, it can be used online and offline (with or without internet connectivity). Google Sheets originated from

7448-415: The time, PC World magazine called VisiCalc the first electronic spreadsheet. Bricklin has spoken of watching his university professor create a table of calculation results on a blackboard . When the professor found an error, he had to tediously erase and rewrite several sequential entries in the table, triggering Bricklin to think that he could replicate the process on a computer, using the blackboard as

7546-416: The top, invoices listed down the left margin, and the amount of each payment in the cell where its row and column intersect—which were, traditionally, a "spread" across facing pages of a bound ledger (book for keeping accounting records) or on oversized sheets of paper (termed 'analysis paper') ruled into rows and columns in that format and approximately twice as wide as ordinary paper. A batch "spreadsheet"

7644-423: The type(s) of computer they run on (from a server cluster to a mobile phone ), the query language (s) used to access the database (such as SQL or XQuery ), and their internal engineering, which affects performance, scalability , resilience, and security. The sizes, capabilities, and performance of databases and their respective DBMSs have grown in orders of magnitude. These performance increases were enabled by

7742-410: The underlying database model , with RDBMS for the relational , OODBMS for the object (oriented) and ORDBMS for the object–relational model . Other extensions can indicate some other characteristics, such as DDBMS for a distributed database management systems. The functionality provided by a DBMS can vary enormously. The core functionality is the storage, retrieval and update of data. Codd proposed

7840-455: The use of a "database management system" (DBMS), which is an integrated set of computer software that allows users to interact with one or more databases and provides access to all of the data contained in the database (although restrictions may exist that limit access to particular data). The DBMS provides various functions that allow entry, storage and retrieval of large quantities of information and provides ways to manage how that information

7938-460: The use of a "language" for data access , known as QUEL . Over time, INGRES moved to the emerging SQL standard. IBM itself did one test implementation of the relational model, PRTV , and a production one, Business System 12 , both now discontinued. Honeywell wrote MRDS for Multics , and now there are two new implementations: Alphora Dataphor and Rel. Most other DBMS implementations usually called relational are actually SQL DBMSs. In 1970,

8036-521: The user was assured that there were no remaining forward references within the spreadsheet. In 1968, three former employees from the General Electric computer company headquartered in Phoenix, Arizona set out to start their own software development house . A. Leroy Ellison, Harry N. Cantrell, and Russell E. Edwards found themselves doing a large number of calculations when making tables for

8134-476: The value 3 the result will be 15 . But C10 might also hold its formula referring to other cells, and so on. The ability to chain formulas together is what gives a spreadsheet its power. Many problems can be broken down into a series of individual mathematical steps, and these can be assigned to individual formulas in cells. Some of these formulas can apply to ranges as well, like the SUM function that adds up all

8232-443: Was ICL 's CAFS accelerator, a hardware disk controller with programmable search capabilities. In the long term, these efforts were generally unsuccessful because specialized database machines could not keep pace with the rapid development and progress of general-purpose computers. Thus most database systems nowadays are software systems running on general-purpose hardware, using general-purpose computer data storage. However, this idea

8330-538: Was a development of software written for the Apollo program on the System/360 . IMS was generally similar in concept to CODASYL, but used a strict hierarchy for its model of data navigation instead of CODASYL's network model. Both concepts later became known as navigational databases due to the way data was accessed: the term was popularized by Bachman's 1973 Turing Award presentation The Programmer as Navigator . IMS

8428-649: Was a spreadsheet application published by Sorcim in 1980, and originally bundled (along with WordStar) as part of the CP/M software package included with the Osborne 1 portable computer. It quickly became the de facto standard spreadsheet for CP/M. The introduction of Lotus 1-2-3 in November 1982 accelerated the acceptance of the IBM Personal Computer . It was written especially for IBM PC DOS and had improvements in speed and graphics compared to VisiCalc on

8526-412: Was also read and Mimer SQL was developed in the mid-1970s at Uppsala University . In 1984, this project was consolidated into an independent enterprise. Another data model, the entity–relationship model , emerged in 1976 and gained popularity for database design as it emphasized a more familiar description than the earlier relational model. Later on, entity–relationship constructs were retrofitted as

8624-475: Was called LANPAR — LANguage for Programming Arrays at Random. This was conceived and entirely developed in the summer of 1969, following Pardo and Landau's recent graduation from Harvard University. Co-inventor Rene Pardo recalls that he felt that one manager at Bell Canada should not have to depend on programmers to program and modify budgeting forms, and he thought of letting users type out forms in any order and having an electronic computer calculate results in

8722-466: Was developed by an independent author, Oliver Vellacott in the UK. Both FAL and TABOL were integrated with GEIS's database system, DMS. The IBM Financial Planning and Control System was developed in 1976, by Brian Ingham at IBM Canada. It was implemented by IBM in at least 30 countries. It ran on an IBM mainframe and was the first application for financial planning developed with APL that completely hid

8820-403: Was different from programs like BASIC, C, FORTRAN, and COBOL in that a lot of the dirty work had already been done. The data manipulation is done by dBASE instead of by the user, so the user can concentrate on what he is doing, rather than having to mess with the dirty details of opening, reading, and closing files, and managing space allocation." dBASE was one of the top selling software titles in

8918-546: Was generally not successful. The main concepts are those of a grid of cells , called a sheet, with either raw data, called values, or formulas in the cells. Formulas say how to mechanically compute new values from existing values. Values are general numbers, but can also be pure text, dates, months, etc. Extensions of these concepts include logical spreadsheets. Various tools for programming sheets, visualizing data, remotely connecting sheets, displaying cells' dependencies, etc. are commonly provided. A "cell" can be thought of as

9016-560: Was implemented on an IBM 1130 and in 1963 was ported to an IBM 7040 by R. Brian Walsh at Marquette University , Wisconsin . This program was written in Fortran . Primitive timesharing was available on those machines. In 1968 BCL was ported by Walsh to the IBM 360 /67 timesharing machine at Washington State University . It was used to assist in the teaching of finance to business students. Students were able to take information prepared by

9114-543: Was launched 39 years ago in 1985. Notable current spreadsheet software: Discontinued spreadsheet software: Several companies have attempted to break into the spreadsheet market with programs based on very different paradigms. Lotus introduced what is likely the most successful example, Lotus Improv , which saw some commercial success, notably in the financial world where its powerful data mining capabilities remain well respected to this day. Spreadsheet 2000 attempted to dramatically simplify formula construction, but

9212-422: Was more interested in the difference in semantics: the use of explicit identifiers made it easier to define update operations with clean mathematical definitions, and it also enabled query operations to be defined in terms of the established discipline of first-order predicate calculus ; because these operations have clean mathematical properties, it becomes possible to rewrite queries in provably correct ways, which

9310-422: Was picked up by two people at Berkeley, Eugene Wong and Michael Stonebraker . They started a project known as INGRES using funding that had already been allocated for a geographical database project and student programmers to produce code. Beginning in 1973, INGRES delivered its first test products which were generally ready for widespread use in 1979. INGRES was similar to System R in a number of ways, including

9408-416: Was the first electronic spreadsheet on mainframe and time sharing computers. LANPAR was an acronym: LANguage for Programming Arrays at Random. VisiCalc (1979) was the first electronic spreadsheet on a microcomputer, and it helped turn the Apple II into a popular and widely used personal computer. Lotus 1-2-3 was the leading spreadsheet when DOS was the dominant operating system. Microsoft Excel now has

9506-490: Was to organize the data as a number of " tables ", each table being used for a different type of entity. Each table would contain a fixed number of columns containing the attributes of the entity. One or more columns of each table were designated as a primary key by which the rows of the table could be uniquely identified; cross-references between tables always used these primary keys, rather than disk addresses, and queries would join tables based on these key relationships, using

9604-552: Was used successfully for many years in developing such applications as financial and costing models for the US Congress and for Conrail . APLDOT was dubbed a "spreadsheet" because financial analysts and strategic planners used it to solve the same problems they addressed with paper spreadsheet pads. The concept of spreadsheets became widely known due to VisiCalc , developed for the Apple II in 1979 by VisiCorp staff Dan Bricklin and Bob Frankston . Significantly, it also turned

#770229