数据仓库,就不是数据库了吗?

楼主
讨论所有与SQL有关的内容



来源:https://www.guru99.com/database-vs-data-warehouse.html

 

本来摘自guru99, 粗略过了下,非常适合入门的读者阅读。对数据仓库好有更清晰 的了解。如果大家有需求翻译成中文的,可阅后留言。

 

What is Database?

A database is a collection of related data which represents some elements of the real world. It is designed to be built and populated with data for a specific task. It is also a building block of your data solution.

In this tutorial, you will learn

 

  • What is Database?

     

  • What is a Data Warehouse?

     

  • Why use a Database?

     

  • Why Use Data Warehouse?

     

  • Characteristics of Database

     

  • Characteristics of Data Warehouse

     

  • Difference between Database and Data Warehouse

     

  • Applications of Database

     

  • Applications of Data Warehousing

     

  • Disadvantages of Database

     

  • Disadvantages of Data Warehouse

What is a Data Warehouse?

A data warehouse is an information system which stores historical and commutative data from single or multiple sources. It is designed to analyze, report, integrate transaction data from different sources.

Data Warehouse eases the analysis and reporting process of an organization. It is also a single version of truth for the organization for decision making and forecasting process.

Why use a Database?

Here, are prime reasons for using Database system:

  • It offers the security of data and its access

  • A database offers a variety of techniques to store and retrieve data.

  • Database act as an efficient handler to balance the requirement of multiple applications using the same data

  • A DBMS offers integrity constraints to get a high level of protection to prevent access to prohibited data.

  • A database allows you to access concurrent data in such a way that only a single user can access the same data at a time.

 

Why Use Data Warehouse?

Here, are Important reasons for using Data Warehouse:

 

  • Data warehouse helps business users to access critical data from some sources all in one place.

     

  • It provides consistent information on various cross-functional activities

     

  • Helps you to integrate many sources of data to reduce stress on the production system.

     

  • Data warehouse helps you to reduce TAT (total turnaround time) for analysis and reporting.

     

  • Data warehouse helps users to access critical data from different sources in a single place so, it saves user's time of retrieving data information from multiple sources. You can also access data from the cloud easily.

     

  • Data warehouse allows you to stores a large amount of historical data to analyze different periods and trends to make future predictions.

     

  • Enhances the value of operational business applications and customer relationship management systems

     

  • Separates analytics processing from transactional databases, improving the performance of both systems

     

  • Stakeholders and users may be overestimating the quality of data in the source systems. Data warehouse provides more accurate reports.

     

Characteristics of Database

 

  • Offers security and removes redundancy

  • Allow multiple views of the data

  • Database system follows the ACID compliance ( Atomicity, Consistency, Isolation, and Durability).

  • Allows insulation between programs and data

  • Sharing of data and multiuser transaction processing

  • Relational Database support multi-user environment

 

Characteristics of Data Warehouse

 

  • A data warehouse is subject oriented as it offers information related to theme instead of companies' ongoing operations.

     

  • The data also needs to be stored in the Datawarehouse in common and unanimously acceptable manner.

     

  • The time horizon for the data warehouse is relatively extensive compared with other operational systems.

     

  • A data warehouse is non-volatile which means the previous data is not erased when new information is entered in it.

Difference between Database and Data Warehouse

 

Parameter Database Data Warehouse
Purpose Is designed to record Is designed to analyze
Processing Method The database uses the Online Transactional Processing (OLTP) Data warehouse uses Online Analytical Processing (OLAP).
Usage The database helps to perform fundamental operations for your business Data warehouse allows you to analyze your business.
Tables and Joins Tables and joins of a database are complex as they are normalized. Table and joins are simple in a data warehouse because they are denormalized.
Orientation Is an application-oriented collection of data It is a subject-oriented collection of data
Storage limit Generally limited to a single application Stores data from any number of applications
Availability Data is available real-time Data is refreshed from source systems as and when needed
Usage ER modeling techniques are used for designing. Data modeling techniques are used for designing.
Technique Capture data Analyze data
Data Type Data stored in the Database is up to date. Current and Historical Data is stored in Data Warehouse. May not be up to date.
Storage of data Flat Relational Approach method is used for data storage. Data Ware House uses dimensional and normalized approach for the data structure. Example: Star and snowflake schema.
Query Type Simple transaction queries are used. Complex queries are used for analysis purpose.
Data Summary Detailed Data is stored in a database. It stores highly summarized data.

 

Applications of Database

 

Sector Usage
Banking Use in the banking sector for customer information, account-related activities, payments, deposits, loans, credit cards, etc.
Airlines Use for reservations and schedule information.
Universities To store student information, course registrations, colleges, and results.
Telecommunication It helps to store call records, monthly bills, balance maintenance, etc.
Finance Helps you to store information related stock, sales, and purchases of stocks and bonds.
Sales & Production Use for storing customer, product and sales details.
Manufacturing It is used for the data management of the supply chain and for tracking production of items, inventories status.
HR Management Detail about employee's salaries, deduction, generation of paychecks, etc.

 

Applications of Data Warehousing

 

Sector Usage
Airline It is used for airline system management operations like crew assignment, analyzes of route, frequent flyer program discount schemes for passenger, etc.
Banking It is used in the banking sector to manage the resources available on the desk effectively.
Healthcare sector Data warehouse used to strategize and predict outcomes, create patient's treatment reports, etc. Advanced machine learning, big data enable datawarehouse systems can predict ailments.
Insurance sector Data warehouses are widely used to analyze data patterns, customer trends, and to track market movements quickly.
Retain chain It helps you to track items, identify the buying pattern of the customer, promotions and also used for determining pricing policy.
Telecommunication In this sector, data warehouse used for product promotions, sales decisions and to make distribution decisions.

 

Disadvantages of Database

 

  • Cost of Hardware and Software of an implementing Database system is high which can increase the budget of your organization.

     

  • Many DBMS systems are often complex systems, so the training for users to use the DBMS is required.

     

  • DBMS can't perform sophisticated calculations

  • Issues regarding compatibility with systems which is already in place

  • Data owners may lose control over their data, raising security, ownership, and privacy issues.

 

Disadvantages of Data Warehouse

 

  • Adding new data sources takes time, and it is associated with high cost.

     

  • Sometimes problems associated with the data warehouse may be undetected for many years.

     

  • Data warehouses are high maintenance systems. Extracting, loading, and cleaning data could be time-consuming.

  • The data warehouse may look simple, but actually, it is too complicated for the average users. You need to provide training to end-users, who end up not using the data mining and warehouse.

  • Despite best efforts at project management, the scope of data warehousing will always increase.

 

What Works Best for You?

To sum up, we can say that the database helps to perform the fundamental operation of business while the data warehouse helps you to analyze your business. You choose either one of them based on your business goals.

编辑于 2021-10-12 17:01
分享扩散:

沙发
发表于 2021-10-12 17:07:38
什么是数据库?
数据库是代表现实世界中某些元素的相关数据的集合。它旨在为特定任务构建和填充数据。它也是您的数据解决方案的构建块。

在本教程中,您将学习



什么是数据库?



什么是数据仓库?



为什么要使用数据库?



为什么要使用数据仓库?



数据库的特点



数据仓库的特点



数据库和数据仓库的区别



数据库应用



数据仓库的应用



数据库的缺点



数据仓库的缺点

什么是数据仓库?
数据仓库是一种信息系统,它存储来自单个或多个来源的历史和交换数据。它旨在分析​​、报告、整合来自不同来源的交易数据。

数据仓库简化了组织的分析和报告过程。它也是组织决策和预测过程的单一版本。

为什么要使用数据库?
以下是使用数据库系统的主要原因:

它提供数据及其访问的安全性

数据库提供了多种技术来存储和检索数据。

数据库作为一个有效的处理程序来平衡使用相同数据的多个应用程序的需求

DBMS 提供完整性约束以获得高级保护,以防止访问被禁止的数据。

数据库允许您以这样一种方式访问​​并发数据,即一次只有一个用户可以访问相同的数据。


为什么要使用数据仓库?
以下是使用数据仓库的重要原因:



数据仓库可帮助业务用户在一处访问来自某些来源的关键数据。



它提供有关各种跨职能活动的一致信息



帮助您集成多种数据源,以减轻生产系统的压力。



数据仓库可帮助您减少用于分析和报告的 TAT(总周转时间)。



数据仓库帮助用户在一个地方访问来自不同来源的关键数据,从而节省用户从多个来源检索数据信息的时间。您还可以轻松地从云端访问数据。



数据仓库可以让你存储大量的历史数据来分析不同的时期和趋势,从而做出未来的预测。



提高运营业务应用程序和客户关系管理系统的价值



将分析处理与事务数据库分开,提高两个系统的性能



利益相关者和用户可能高估了源系统中的数据质量。数据仓库提供更准确的报告。



数据库的特点


提供安全性并消除冗余

允许多个数据视图

数据库系统遵循 ACID 合规性(原子性、一致性、隔离性和持久性)。

允许程序和数据之间的隔离

数据共享和多用户事务处理

关系型数据库支持多用户环境


数据仓库的特点


数据仓库是面向主题的,因为它提供与主题相关的信息,而不是公司正在进行的运营。



数据还需要以共同且一致可接受的方式存储在数据仓库中。



与其他操作系统相比,数据仓库的时间范围相对广泛。



数据仓库是非易失性的,这意味着在输入新信息时不会删除以前的数据。

数据库和数据仓库的区别




范围        数据库        数据仓库
目的        旨在记录        旨在分析
加工方式        数据库使用在线事务处理 (OLTP)        数据仓库使用在线分析处理 (OLAP)。
用法        数据库有助于为您的业务执行基本操作        数据仓库允许您分析您的业务。
表和联接        数据库的表和连接很复杂,因为它们是规范化的。        表和联接在数据仓库中很简单,因为它们是非规范化的。
方向        是一个面向应用的数据集合        它是面向主题的数据集合
存储限制        通常仅限于单个应用程序        存储来自任意数量应用程序的数据
可用性        数据实时可用        在需要时从源系统刷新数据
用法        ER 建模技术用于设计。        数据建模技术用于设计。
技术        捕获数据        分析数据
数据类型        存储在数据库中的数据是最新的。        当前和历史数据存储在数据仓库中。可能不是最新的。
数据存储        Flat Relational Approach 方法用于数据存储。        数据仓库对数据结构使用维度和规范化方法。示例:星形和雪花模式。
查询类型        使用简单的事务查询。        复杂查询用于分析目的。
数据汇总        详细数据存储在数据库中。        它存储高度汇总的数据。

数据库应用


扇区        用法
银行业        用于银行部门的客户信息、账户相关活动、支付、存款、贷款、信用卡等。
航空公司        用于预订和日程信息。
大学        存储学生信息、课程注册、大学和结果。
电信        它有助于存储通话记录、每月账单、余额维护等。
金融        帮助您存储与股票、股票和债券的销售和购买相关的信息。
销售与生产        用于存储客户、产品和销售详细信息。
制造业        它用于供应链的数据管理和跟踪项目的生产、库存状态。
人力资源管理        员工工资、扣减、工资发放等详细信息。

数据仓库的应用


扇区        用法
航空公司        它用于航空公司系统管理操作,如机组分配、航线分析、旅客常旅客计划折扣计划等。
银行业        它用于银行业以有效管理桌面上的可用资源。
医疗保健行业        数据仓库用于制定策略和预测结果、创建患者的治疗报告等。先进的机器学习、大数据使数据仓库系统可以预测疾病。
保险业        数据仓库广泛用于分析数据模式、客户趋势以及快速跟踪市场动向。
保留链        它可以帮助您跟踪项目、识别客户的购买模式、促销活动,还用于确定定价政策。
电信        在这个领域,数据仓库用于产品促销、销售决策和分销决策。

数据库的缺点


实施数据库系统的硬件和软件成本很高,这会增加您组织的预算。



许多 DBMS 系统往往是复杂的系统,因此需要对用户进行 DBMS 使用培训。



DBMS 无法执行复杂的计算

与现有系统的兼容性问题

数据所有者可能会失去对其数据的控制,从而引发安全、所有权和隐私问题。


数据仓库的缺点


添加新数据源需要时间,并且成本高。



有时,与数据仓库相关的问题可能多年未被发现。



数据仓库是高维护系统。提取、加载和清理数据可能非常耗时。

数据仓库看似简单,但实际上对于普通用户来说太复杂了。您需要为最终不使用数据挖掘和仓库的最终用户提供培训。

尽管在项目管理方面尽了最大努力,但数据仓库的范围总是会增加。


什么最适合你?
综上所述,我们可以说数据库帮助执行业务的基本操作,而数据仓库则帮助您分析您的业务。您可以根据自己的业务目标选择其中之一。
板凳
发表于 2021-10-13 10:04:41

英文的,逃跑
地板
发表于 2021-10-13 14:42:23
这全英文的入门者也费劲呀
5楼
发表于 2021-10-13 15:05:43
what fu? are you think I can catch that?
6楼
发表于 2021-10-13 20:59:50 发布于APP客户端
翻译成中文好些
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

返回顶部 返回列表