Misplaced Pages

Digital object memory

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

A digital object memory ( DOMe ) is a digital storage space intended to keep permanently all related information about a concrete physical object instance that is collected during the lifespan of this object and thus forms a basic building block for the Internet of Things (IoT) by connecting digital information with physical objects.

#907092

44-424: Such memories require each object instance to be uniquely identified and this ID to be attached to the physical object. The underlying techniques to create identification codes and to attach them to objects are manifold but machine-readable techniques are mandatory. Commonly used are barcodes with one or two dimensions (e.g. QRcode or DataMatrix ) and radio based tags like RFID or NFC . Such codes or tags are

88-780: A common framework for processing this information to extract meaning and create structured data about the information. Software that creates machine-processable structure can utilize the linguistic, auditory, and visual structure that exist in all forms of human communication. Algorithms can infer this inherent structure from text, for instance, by examining word morphology , sentence syntax, and other small- and large-scale patterns. Unstructured information can then be enriched and tagged to address ambiguities and relevancy-based techniques then used to facilitate search and discovery. Examples of "unstructured data" may include books, journals, documents, metadata , health records , audio , video , analog data , images, files, and unstructured text such as

132-437: A data model, generating summaries of subsets of documents (i.e., cells within a text cube) may be performed with phrase-based approaches. Biomedical research generates one major source of unstructured data as researchers often publish their findings in scholarly journals. Though the language in these documents is challenging to derive structural elements from (e.g., due to the complicated technical vocabulary contained within and

176-413: A decentralized way using RFID-tags. In addition order-related information are stored in centralized manufacturer databases. Such an architecture allows for transport of product-related data at the location of the physical object and process related data in real-time via backend systems. Identifier (computer science) An identifier is a name that identifies (that is, labels the identity of) either

220-490: A design space for exploring the value of a user-generated Internet of Old Things in which people's memories are linked to objects. The project Smart Product Networks (SmaProN), funded by the German Ministry of Education and Research, explores the dynamic linking of smart products to product bundles and product hierarchies, based on the developed Tip'n'Tell architecture (to integrate distributed product information) and

264-499: A digital memory and thus support intelligent applications along the product's lifecycle . By the use of integrated sensors , relations in the production process become transparent and supply chains as well as environmental influences retraceable. The producer gets supported and the consumer better informed about the product. Funded by the German Ministry of Education and Research, the project RES-COM (Resource Conservation by Context-Activated Machine-to-Machine Communication) focuses on

308-611: A good case example in a recent-decades, technical-nomenclature context. The capitalization variations seen with specific designators reveals an instance of this problem occurring in natural languages , where the proper noun/common noun distinction (and its complications) must be dealt with. A universe in which every object had a UID would not need any namespaces, which is to say that it would constitute one gigantic namespace; but human minds could never keep track of, or semantically interrelate, so many UIDs. Unstructured data Unstructured data (or unstructured information )

352-596: A leading innovation project, sponsored by the German Ministry of Education and Research that aims at obtaining comprehensive access to product information through the use of semantic technologies. The project follows an approach which does not only consult structured data from company-owned information sources, such as product databases, to respond to inquiries, it also looks at unstructured data from office documents and web 2.0 sources, such as wikis , blogs , and internet forums , as well as sensor and RFID data. The ADiWa project (Alliance Digital Product Flow), funded by

396-603: A low cost solution but demand an underlying server infrastructure to host the memory data. In contrast to the mentioned memories providing only a passive storage space, the more sophisticated active digital object memories ( ADOMe ) are based on embedded systems in terms of cyber-physical systems (CPS) and provide on the hardware side and on the software side there are Such active memories allow for "on-object" processing of object-related tasks, such as condition monitoring, compilation of associated data, and memory clean-up. In addition to strictly passive memories (storage space

440-480: A marble plate or a lump of gold. Such objects don't have all the attributes of industrial products, but nevertheless a digital black box attached to them for lifelogging can make sense for specific applications. Semantic product memories ( SemProM ) go beyond that, since they provide a machine-understandable meaning description of their contents based on semantic web technologies. If a product memory has no explicit semantic markup, only propriety software can exploit

484-852: A number of businesses to research applications, leading to the development of fields like sentiment analysis , voice of the customer mining, and call center optimization. The emergence of Big Data in the late 2000s led to a heightened interest in the applications of unstructured data analytics in contemporary fields such as predictive analytics and root cause analysis . The term is imprecise for several reasons: Techniques such as data mining , natural language processing (NLP), and text analytics provide different methods to find patterns in, or otherwise interpret, this information. Common techniques for structuring text usually involve manual tagging with metadata or part-of-speech tagging for further text mining -based structuring. The Unstructured Information Management Architecture (UIMA) standard provided

SECTION 10

#1732790691908

528-686: A unique object or a unique class of objects, where the "object" or class may be an idea, person, physical countable object (or class thereof), or physical noncountable substance (or class thereof). The abbreviation ID often refers to identity, identification (the process of identifying), or an identifier (that is, an instance of identification). An identifier may be a word, number, letter, symbol, or any combination of those. The words, numbers, letters, or symbols may follow an encoding system (wherein letters, digits, words, or symbols stand for [represent] ideas or longer names) or they may simply be arbitrary. When an identifier follows an encoding system, it

572-434: Is a language-independent label, sign or token that uniquely identifies an object within an identification scheme . The suffix "identifier" is also used as a representation term when naming a data element . ID codes may inherently carry metadata along with them. For example, when you know that the food package in front of you has the identifier "2011-09-25T15:42Z-MFR5-P02-243-45", you not only have that data, you also have

616-429: Is essential for any kind of symbolic processing. In computer languages , identifiers are tokens (also called symbols ) which name language entities. Some of the kinds of entities an identifier might denote include variables , types , labels , subroutines , and packages . A resource may carry multiple identifiers. Typical examples are: The inverse is also possible, where multiple resources are represented with

660-511: Is information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text -heavy, but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities that make it difficult to understand using traditional programs as compared to data stored in fielded form in databases or annotated ( semantically tagged ) in documents. In 1998, Merrill Lynch said "unstructured data comprises

704-497: Is located in the web as mentioned above) and active memories (with "on-object" storage) hybrid forms are also available, that perform simple tasks "on-product", outsource more complex tasks to server-based infrastructures, and keep both representations in sync. Digital product memories ( DPM ) are a subclass of digital object memories, which include memories for all artifacts that were intentionally created such as containers and pieces of art or valuable and rare natural objects such as

748-547: Is often referred to as a code or id code . For instance the ISO/IEC 11179 metadata registry standard defines a code as system of valid symbols that substitute for longer values in contrast to identifiers without symbolic meaning. Identifiers that do not follow any encoding scheme are often said to be arbitrary Ids ; they are arbitrarily assigned and have no greater meaning. (Sometimes identifiers are called "codes" even when they are actually arbitrary, whether because

792-426: Is tagged, but HTML mark-up typically serves solely for rendering. It does not capture the meaning or function of tagged elements in ways that support automated processing of the information content of the page. XHTML tagging does allow machine processing of elements, although it typically does not capture or convey the semantic meaning of tagged terms. Since unstructured data commonly occurs in electronic documents ,

836-465: The German tank problem ). Opaque identifiers—identifiers designed to avoid leaking even that small amount of information—include "really opaque pointers " and Version 4 UUIDs . In computer science , identifiers (IDs) are lexical tokens that name entities . Identifiers are used extensively in virtually all information processing systems. Identifying entities makes it possible to refer to them, which

880-410: The domain knowledge required to fully contextualize observations), the results of these activities may yield links between technical and medical studies and clues regarding new disease therapies. Recent efforts to enforce structure upon biomedical documents include self-organizing map approaches for identifying topics among documents, general-purpose unsupervised algorithms , and an application of

924-575: The identifier "Model T" identifies the class (model) of automobiles that Ford's Model T comprises; whereas the unique identifier "Model T Serial Number 159,862" identifies one specific member of that class—that is, one particular Model T car, owned by one specific person. The concepts of name and identifier are denotatively equal, and the terms are thus denotatively synonymous ; but they are not always connotatively synonymous, because code names and Id numbers are often connotatively distinguished from names in

SECTION 20

#1732790691908

968-560: The CaseOLAP workflow to determine associations between protein names and cardiovascular disease topics in the literature. CaseOLAP defines phrase-category relationships in an accurate (identifies relationships), consistent (highly reproducible), and efficient manner. This platform offers enhanced accessibility and empowers the biomedical community with phrase-mining tools for widespread biomedical research applications. In Sweden (EU), pre 2018, some data privacy regulations did not apply if

1012-575: The German Ministry of Education and Research, makes the huge potential of information from the Internet of Things accessible for business-relevant workflows that can be strategically planned and manipulated. For the data-level connection of objects from the real world, results from available solutions and from the SemProM project shall be used. ADiWa focuses on business processes , which can be controlled and manipulated based on evaluated information from

1056-583: The United Kingdom. A project aim is to explore the implications of Internet of Things technologies for the design of novel forms of augmented memory systems. While the potential implications of the Internet of Things for supply chain management and energy consumption have been acknowledged and discussed, its application for the engagement with personal and social memories has been rarely mentioned. More and more newly manufactured objects are often tagged at production and made traceable. Tales of Things provides

1100-407: The body of an e-mail message, Web page , or word-processor document. While the main content being conveyed does not have a defined structure, it generally comes packaged in objects (e.g. in files or documents, ...) that themselves have structure and are thus a mix of structured and unstructured data, but collectively this is still referred to as "unstructured data". For example, an HTML web page

1144-482: The data in question was confirmed as "unstructured". This terminology, unstructured data, is rarely used in the EU after GDPR came into force in 2018. GDPR does neither mention nor define "unstructured data". It does use the word "structured" as follows (without defining it); GDPR Case-law on what defines a "filing system"; "the specific criterion and the specific form in which the set of personal data collected by each of

1188-401: The development of technologies (containing interfaces, protocols and data models) for proactive resource conservation based on M2M -communication. With a defined interaction with active digital object memories the project tries to leverage the integration of distributed and active components to existing centralized structures in the field of industry and manufacturing. The Aletheia project is

1232-415: The environment. The project thereby also focuses on small devices with limited storage capabilities and thus also requires efficient storage mechanisms. Moreover, the project aims to apply the results achieved by the incubator group for optimizing the data exchange between different smart products. Tales of Things and electronic Memory (TOTeM) is a three-year collaborative project between five universities in

1276-606: The extraction and classification of unstructured text. However, only since the turn of the century has the technology caught up with the research interest. In 2004, the SAS Institute developed the SAS Text Miner, which uses Singular Value Decomposition (SVD) to reduce a hyper-dimensional textual space into smaller dimensions for significantly more efficient machine-analysis. The mathematical and technological advances sparked by machine textual analysis prompted

1320-459: The global datasphere will grow to 163 zettabytes by 2025 and majority of that will be unstructured. The Computer World magazine states that unstructured information might account for more than 70–80% of all data in organizations. The earliest research into business intelligence focused in on unstructured textual data, rather than numerical data. As early as 1958, computer science researchers like H.P. Luhn were particularly concerned with

1364-718: The information stored in the memory. In contrast, semantic product memories can be interpreted by any software that has access to the semantic description of the epistemological primitives and the ontologies used for capturing memory contents. In the context of the Object Memory Modeling Incubator Group, part of the W3C Incubator Activity, an object memory format, which allows for modeling of events or other information about individual physical artifacts (ideally over their lifetime) and thus implements an object memory model ( OMM ),

Digital object memory - Misplaced Pages Continue

1408-433: The members who engage in preaching is actually structured is irrelevant, so long as that set of data makes it possible for the data relating to a specific person who has been contacted to be easily retrieved , which is however for the referring court to ascertain in the light of all the circumstances of the case in the main proceedings.” ( CJEU , Todistajat v. Tietosuojavaltuutettu, Jehovan, Paragraph 61 ). If personal data

1452-692: The metadata that tells you that it was packaged on September 25, 2011, at 3:42pm UTC, manufactured by Licensed Vendor Number 5, at the Peoria, IL, USA plant, in Building 2, and was the 243rd package off the line in that shift, and was inspected by Inspector Number 45. Arbitrary identifiers might lack metadata. For example, if a food package just says 100054678214, its ID may not tell anything except identity—no date, manufacturer name, production sequence rank, or inspector number. In some cases, arbitrary identifiers such as sequential serial numbers leak information (i.e.

1496-434: The original naming convention, which had formerly been latent and moot, become painfully apparent, often necessitating retronymy , synonymity , translation/ transcoding , and so on. Such limitations generally accompany the shift away from the original context to the broader one. Typically the system shows implicit context (context was formerly assumed, and narrow), lack of capacity (e.g., low number of possible IDs, reflecting

1540-451: The outmoded narrow context), lack of extensibility (no features defined and reserved against future needs), and lack of specificity and disambiguating capability (related to the context shift, where longstanding uniqueness encounters novel nonuniqueness). Within computer science, this problem is called naming collision . The story of the origination and expansion of the CODEN system provides

1584-440: The product description model SPDO (using semantic web-based representation languages). The project RFID-Based Automotive Network (RAN), founded by the German Ministry of Education and Research, focuses on the development of a RFID-based hybrid control architecture and valuation methods for value added chains. Using the example of automobile industry they develop a combined data management that exchanges product-related information in

1628-537: The real world. The SmartProducts project, funded by the European Union in the 7th Research Framework Programme (FP7), develops the scientific and technological basis for building "smart products" with embedded proactive knowledge. Smart products help customers, designers and workers to deal with the ever-increasing complexity and variety of modern products. Such smart products leverage proactive knowledge to communicate and co-operate with humans, other products and

1672-406: The same identifier (discussed below). Many codes and nomenclatural systems originate within a small namespace . Over the years, some of them bleed into larger namespaces (as people interact in ways they formerly had not, e.g., cross-border trade, scientific collaboration, military alliance, and general cultural interconnection or assimilation). When such dissemination happens, the limitations of

1716-467: The sense of traditional natural language naming. For example, both " Jamie Zawinski " and " Netscape employee number 20" are identifiers for the same specific human being; but normal English-language connotation may consider "Jamie Zawinski" a "name" and not an "identifier", whereas it considers "Netscape employee number 20" an "identifier" but not a "name." This is an emic indistinction rather than an etic one. In metadata, an identifier

1760-405: The speaker believes that they have deeper meaning or simply because they are speaking casually and imprecisely.) The unique identifier ( UID ) is an identifier that refers to only one instance —only one particular object in the universe. A part number is an identifier, but it is not a unique identifier—for that, a serial number is needed, to identify each instance of the part design. Thus

1804-413: The unstructured data contained within text documents. These workflows are generally designed to handle sets of thousands or even millions of documents, or far more than manual approaches to annotation may permit. Several of these approaches are based upon the concept of online analytical processing, or OLAP , and may be supported by data models such as text cubes. Once document metadata is available through

Digital object memory - Misplaced Pages Continue

1848-455: The use of a content or document management system which can categorize entire documents is often preferred over data transfer and manipulation from within the documents. Document management thus provides the means to convey structure onto document collections . Search engines have become popular tools for indexing and searching through such data, especially text. Specific computational workflows have been developed to impose structure upon

1892-457: The vast majority of data found in an organization, some estimates run as high as 80%." It's unclear what the source of this number is, but nonetheless it is accepted by some. Other sources have reported similar or higher percentages of unstructured data. As of 2012 , IDC and Dell EMC project that data will grow to 40 zettabytes by 2020, resulting in a 50-fold growth from the beginning of 2010. More recently, IDC and Seagate predict that

1936-432: Was created. The model consists of a block-based approach to partition the entire memory to groups each with associated object-related information. Each block consists of the data itself (the so-called payload ) and a set of metadata attributes to describe the block content. Funded by the German Ministry of Education and Research, the project SemProM (Semantic Product Memory) employs smart labels in order to give products

#907092