<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-2508171737210923387</id><updated>2011-07-08T02:55:24.287-07:00</updated><title type='text'>Data Warehouse</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>5</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-2508171737210923387.post-7390108434922100746</id><published>2009-03-07T10:56:00.002-08:00</published><updated>2009-03-07T10:57:10.510-08:00</updated><title type='text'>Data warehouse</title><content type='html'>&lt;h3 id="siteSub"&gt;From Wikipedia, the free encyclopedia&lt;/h3&gt;                 &lt;!-- start content --&gt;    &lt;p&gt;&lt;b&gt;Data warehouse&lt;/b&gt; is a &lt;a href="http://en.wikipedia.org/wiki/Repository" title="Repository"&gt;repository&lt;/a&gt; of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis&lt;sup id="cite_ref-InmonDefinition_0-0" class="reference"&gt;&lt;a href="http://en.wikipedia.org/wiki/Data_warehouse#cite_note-InmonDefinition-0" title=""&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt;.&lt;/p&gt; &lt;p&gt;This definition of the data warehouse focuses on data storage. However, the means to retrieve and analyze data, to extract, transform and load data, and to manage the &lt;a href="http://en.wikipedia.org/wiki/Data_dictionary" title="Data dictionary"&gt;data dictionary&lt;/a&gt; are also considered essential components of a data warehousing system. Many references to data warehousing use this broader context. Thus, an expanded definition for data warehousing includes &lt;a href="http://en.wikipedia.org/wiki/Business_intelligence_tools" title="Business intelligence tools"&gt;business intelligence tools&lt;/a&gt;, tools to &lt;a href="http://en.wikipedia.org/wiki/Extract,_transform,_load" title="Extract, transform, load"&gt;extract, transform, and load&lt;/a&gt; data into the repository, and tools to manage and retrieve &lt;a href="http://en.wikipedia.org/wiki/Metadata" title="Metadata"&gt;metadata&lt;/a&gt;.&lt;/p&gt; &lt;p&gt;In contrast to data warehouses are operational systems that perform day-to-day &lt;a href="http://en.wikipedia.org/wiki/Transaction_processing" title="Transaction processing"&gt;transaction processing&lt;/a&gt;.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2508171737210923387-7390108434922100746?l=iniwarehouse.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/7390108434922100746/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse_07.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/7390108434922100746'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/7390108434922100746'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse_07.html' title='Data warehouse'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2508171737210923387.post-1497809420379946991</id><published>2009-03-07T10:56:00.001-08:00</published><updated>2009-03-07T10:56:59.494-08:00</updated><title type='text'>Data warehouse</title><content type='html'>&lt;h3 id="siteSub"&gt;From Wikipedia, the free encyclopedia&lt;/h3&gt;                 &lt;!-- start content --&gt;    &lt;p&gt;&lt;b&gt;Data warehouse&lt;/b&gt; is a &lt;a href="http://en.wikipedia.org/wiki/Repository" title="Repository"&gt;repository&lt;/a&gt; of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis&lt;sup id="cite_ref-InmonDefinition_0-0" class="reference"&gt;&lt;a href="http://en.wikipedia.org/wiki/Data_warehouse#cite_note-InmonDefinition-0" title=""&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt;.&lt;/p&gt; &lt;p&gt;This definition of the data warehouse focuses on data storage. However, the means to retrieve and analyze data, to extract, transform and load data, and to manage the &lt;a href="http://en.wikipedia.org/wiki/Data_dictionary" title="Data dictionary"&gt;data dictionary&lt;/a&gt; are also considered essential components of a data warehousing system. Many references to data warehousing use this broader context. Thus, an expanded definition for data warehousing includes &lt;a href="http://en.wikipedia.org/wiki/Business_intelligence_tools" title="Business intelligence tools"&gt;business intelligence tools&lt;/a&gt;, tools to &lt;a href="http://en.wikipedia.org/wiki/Extract,_transform,_load" title="Extract, transform, load"&gt;extract, transform, and load&lt;/a&gt; data into the repository, and tools to manage and retrieve &lt;a href="http://en.wikipedia.org/wiki/Metadata" title="Metadata"&gt;metadata&lt;/a&gt;.&lt;/p&gt; &lt;p&gt;In contrast to data warehouses are operational systems that perform day-to-day &lt;a href="http://en.wikipedia.org/wiki/Transaction_processing" title="Transaction processing"&gt;transaction processing&lt;/a&gt;.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2508171737210923387-1497809420379946991?l=iniwarehouse.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/1497809420379946991/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/1497809420379946991'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/1497809420379946991'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse.html' title='Data warehouse'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2508171737210923387.post-5134049234881124829</id><published>2009-03-07T10:55:00.002-08:00</published><updated>2009-03-07T10:56:27.561-08:00</updated><title type='text'>History of data warehousing</title><content type='html'>&lt;p&gt;The concept of data warehousing dates back to the late 1980s &lt;sup id="cite_ref-1" class="reference"&gt;&lt;a href="http://en.wikipedia.org/wiki/Data_warehouse#cite_note-1" title=""&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt; when IBM researchers Barry Devlin and Paul Murphy developed the "business data warehouse". In essence, the data warehousing concept was intended to provide an architectural model for the flow of data from operational systems to &lt;a href="http://en.wikipedia.org/wiki/Decision_support" title="Decision support" class="mw-redirect"&gt;decision support&lt;/a&gt; environments. The concept attempted to address the various problems associated with this flow - mainly, the high costs associated with it. In the absence of a data warehousing architecture, an enormous amount of redundancy was required to support multiple decision support environments. In larger corporations it was typical for multiple decision support environments to operate independently. Each environment served different users but often required much of the same data. The process of gathering, cleaning and integrating data from various sources, usually long existing operational systems (usually referred to as &lt;a href="http://en.wikipedia.org/wiki/Legacy_systems" title="Legacy systems" class="mw-redirect"&gt;legacy systems&lt;/a&gt;), was typically in part replicated for each environment. Moreover, the operational systems were frequently reexamined as new decision support requirements emerged. Often new requirements necessitated gathering, cleaning and integrating new data from the operational systems that were logically related to prior gathered data.&lt;/p&gt; &lt;p&gt;Based on analogies with real-life warehouses, data warehouses were intended as large-scale collection/storage/staging areas for corporate data. Data could be retrieved from one central point or data could be distributed to "retail stores" or "&lt;a href="http://en.wikipedia.org/wiki/Data_mart" title="Data mart"&gt;data marts&lt;/a&gt;" that were tailored for ready access by users.&lt;/p&gt; &lt;p&gt;Key developments in early years of data warehousing were:&lt;/p&gt; &lt;ul&gt;&lt;li&gt;1960s - &lt;a href="http://en.wikipedia.org/wiki/General_Mills" title="General Mills"&gt;General Mills&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/Dartmouth_College" title="Dartmouth College"&gt;Dartmouth College&lt;/a&gt;, in a joint research project, develop the terms &lt;i&gt;dimensions&lt;/i&gt; and &lt;i&gt;facts&lt;/i&gt;.&lt;sup id="cite_ref-kimball16_2-0" class="reference"&gt;&lt;a href="http://en.wikipedia.org/wiki/Data_warehouse#cite_note-kimball16-2" title=""&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt;&lt;/li&gt;&lt;li&gt;1970s - &lt;a href="http://en.wikipedia.org/wiki/ACNielsen" title="ACNielsen"&gt;ACNielsen&lt;/a&gt; and IRI provide dimensional data marts for retail sales.&lt;sup id="cite_ref-kimball16_2-1" class="reference"&gt;&lt;a href="http://en.wikipedia.org/wiki/Data_warehouse#cite_note-kimball16-2" title=""&gt;&lt;span&gt;&lt;/span&gt;&lt;span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt;&lt;/li&gt;&lt;li&gt;1983 - &lt;a href="http://en.wikipedia.org/wiki/Teradata" title="Teradata"&gt;Teradata&lt;/a&gt; introduces a database management system specifically designed for decision support.&lt;/li&gt;&lt;li&gt;1988 - Barry Devlin and Paul Murphy publish the article &lt;i&gt;&lt;a href="http://www.research.ibm.com/journal/sj/271/ibmsj2701G.pdf" class="external text" title="http://www.research.ibm.com/journal/sj/271/ibmsj2701G.pdf" rel="nofollow"&gt;An architecture for a business and information systems&lt;/a&gt;&lt;/i&gt; in &lt;i&gt;IBM Systems Journal&lt;/i&gt; where they introduce the term "business data warehouse".&lt;/li&gt;&lt;li&gt;1990 - Red Brick Systems introduces Red Brick Warehouse, a database management system specifically for data warehousing.&lt;/li&gt;&lt;li&gt;1991 - Prism Solutions introduces Prism Warehouse Manager, software for developing a data warehouse.&lt;/li&gt;&lt;li&gt;1991 - &lt;a href="http://en.wikipedia.org/wiki/Bill_Inmon" title="Bill Inmon"&gt;Bill Inmon&lt;/a&gt; publishes the book &lt;i&gt;Building the Data Warehouse&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;1995 - The Data Warehousing Institute, a for-profit organization that promotes data warehousing, is founded.&lt;/li&gt;&lt;li&gt;1996 - &lt;a href="http://en.wikipedia.org/wiki/Ralph_Kimball" title="Ralph Kimball"&gt;Ralph Kimball&lt;/a&gt; publishes the book &lt;i&gt;The Data Warehouse Toolkit&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;1997 - Oracle 8, with support for star queries, is released.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2508171737210923387-5134049234881124829?l=iniwarehouse.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/5134049234881124829/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/history-of-data-warehousing.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/5134049234881124829'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/5134049234881124829'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/history-of-data-warehousing.html' title='History of data warehousing'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2508171737210923387.post-5878388435180330225</id><published>2009-03-07T10:55:00.001-08:00</published><updated>2009-03-07T10:55:47.442-08:00</updated><title type='text'>Data warehouse architecture</title><content type='html'>&lt;p&gt;Architecture, in the context of an organization's data warehousing efforts, is a conceptualization of how the data warehouse is built. There is no right or wrong architecture. The worthiness of the architecture can be judged in how the conceptualization aids in the building, maintenance, and usage of the data warehouse.&lt;/p&gt; &lt;p&gt;One possible simple conceptualization of a data warehouse architecture consists of the following interconnected layers:&lt;/p&gt; &lt;dl&gt;&lt;dt&gt;Operational database layer&lt;/dt&gt;&lt;dd&gt;The source data for the data warehouse - An organization's &lt;a href="http://en.wikipedia.org/wiki/Enterprise_resource_planning" title="Enterprise resource planning"&gt;ERP&lt;/a&gt; systems fall into this layer.&lt;/dd&gt;&lt;dt&gt;Data access layer&lt;/dt&gt;&lt;dd&gt;The interface between the operational and informational access layer - Tools to &lt;a href="http://en.wikipedia.org/wiki/Extract,_transform,_load" title="Extract, transform, load"&gt;extract, transform, load&lt;/a&gt; data into the warehouse fall into this layer.&lt;/dd&gt;&lt;dt&gt;Metadata layer&lt;/dt&gt;&lt;dd&gt;The data directory - This is usually more detailed than an operational system data directory. There are dictionaries for the entire warehouse and sometimes dictionaries for the data that can be accessed by a particular reporting and analysis tool.&lt;/dd&gt;&lt;dt&gt;Informational access layer&lt;/dt&gt;&lt;dd&gt;The data accessed for reporting and analyzing and the tools for reporting and analyzing data - &lt;a href="http://en.wikipedia.org/wiki/Business_intelligence" title="Business intelligence"&gt;Business intelligence&lt;/a&gt; tools fall into this layer. And the Inmon-Kimball differences about design methodology, discussed later in this article, have to do with this layer.&lt;/dd&gt;&lt;/dl&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2508171737210923387-5878388435180330225?l=iniwarehouse.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/5878388435180330225/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse-architecture.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/5878388435180330225'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/5878388435180330225'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/data-warehouse-architecture.html' title='Data warehouse architecture'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2508171737210923387.post-7548570722880343547</id><published>2009-03-07T10:54:00.000-08:00</published><updated>2009-03-07T10:55:24.660-08:00</updated><title type='text'>Normalized versus dimensional approach for storage of data</title><content type='html'>&lt;p&gt;There are two leading approaches to storing data in a data warehouse - the dimensional approach and the normalized approach.&lt;/p&gt; &lt;p&gt;In the dimensional approach, transaction data are partitioned into either "facts", which are generally numeric transaction data, or "dimensions", which are the reference information that gives context to the facts. For example, a sales transaction can be broken up into facts such as the number of products ordered and the price paid for the products, and into dimensions such as order date, customer name, product number, order ship-to and bill-to locations, and salesperson responsible for receiving the order. A key advantage of a dimensional approach is that the data warehouse is easier for the user to understand and to use. Also, the retrieval of data from the data warehouse tends to operate very quickly. The main disadvantages of the dimensional approach are: 1) In order to maintain the integrity of facts and dimensions, loading the data warehouse with data from different operational systems is complicated, and 2) It is difficult to modify the data warehouse structure if the organization adopting the dimensional approach changes the way in which it does business.&lt;/p&gt; &lt;p&gt;In the normalized approach, the data in the data warehouse are stored following, to a degree, &lt;a href="http://en.wikipedia.org/wiki/Database_normalization" title="Database normalization"&gt;database normalization&lt;/a&gt; rules. Tables are grouped together by &lt;b&gt;subject areas&lt;/b&gt; that reflect general data categories (e.g., data on customers, products, finance, etc.) The main advantage of this approach is that it is straightforward to add information into the database. A disadvantage of this approach is that, because of the number of tables involved, it can be difficult for users both to 1) join data from different sources into meaningful information and then 2) access the information without a precise understanding of the sources of data and of the &lt;a href="http://en.wikipedia.org/wiki/Data_structure" title="Data structure"&gt;data structure&lt;/a&gt; of the data warehouse.&lt;/p&gt; &lt;p&gt;These approaches are not mutually exclusive. Dimensional approaches can involve normalizing data to a degree.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2508171737210923387-7548570722880343547?l=iniwarehouse.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://iniwarehouse.blogspot.com/feeds/7548570722880343547/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/normalized-versus-dimensional-approach.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/7548570722880343547'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2508171737210923387/posts/default/7548570722880343547'/><link rel='alternate' type='text/html' href='http://iniwarehouse.blogspot.com/2009/03/normalized-versus-dimensional-approach.html' title='Normalized versus dimensional approach for storage of data'/><author><name>Andry Septia Nurrahman</name><uri>http://www.blogger.com/profile/11738634130176322155</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://4.bp.blogspot.com/_gmV1PiPRWCE/Sckx-3j5xjI/AAAAAAAAABA/oVWoojba14w/S220/Andry+Septia+Nurrahman.jpg'/></author><thr:total>1</thr:total></entry></feed>
