Data Warehousing and Data Mining

DATA WAREHOUSING

A data warehouse is a collection of data designed to support management decision-making. It is a subject-oriented, integrated, time-variant, nonvolatile collection of data used in support of management decision making processes. 

CHARACTERISTICS OF DATA WAREHOUSE:-

  1. It typically integrates several resources e.g, sales databases from various regions.
  2. It must be optimized for access to very large amounts of data.
  3. It provides multi-user support.
  4. It maintains transparency.
  5. It is mostly read-accessed and rarely write-accessed.
  6. Data warehouses are maintained separately from operational data.
  7. It is capable of handling dynamic sparse matrices.
  8. It supports unrestricted cross-dimensional operations.  
  9. It provides consistent and flexible reporting performance.
  10. It is having unlimited dimensions and aggregation levels.
DATA WAREHOUSE COMPONENTS:-
  1. Summarized Data
  2. Operational Data-Store
  3. Integration/Transformation Programs
  4. Detailed Data
  5. Meta Data
  6. Archives 

DATA MINING

Data mining is a collection of techniques that are used to find undiscovered patterns by manipulating large volumes of data. It is a process of mining and discovering new information. It is used in conjunction with data warehousing to help in certain types of decisions. It is applied to the operational database with individual transactions. 

CHARACTERISTICS OF DATA MINING:-

  1. PREDICTION- Data mining helps in the prediction of the behavior of certain data attributes in the future. It is very helpful in complex data scenarios. In a scientific context, certain seismic wave patterns may predict an earthquake with high probability.
  2. IDENTIFICATION- Major goal of data mining is to identify the existence of an item, an event, or an activity on the basis of analysis made on different data patterns.
  3. CLASSIFICATION- Data mining is helpful in classifying the data into different categories on the basis of certain parameters.
  4. OPTIMIZATION- The most important goal of data mining is to optimize the use of limited resources like time, cost, space, manpower, and machine power in such a way that it will make a boom in output such as profits, increase in sales, etc.
DATA MINING COMPONENTS:-
  1. Extraction
  2. Store and Manage
  3. Data Access
  4. Analyze
  5. Presentation

Post a Comment

0 Comments