Using Apache Hive for distributed data warehouse on Hadoop | Alan Brown | LinkedIn

No Title

No Description

Using Apache Hive for distributed data warehouse on Hadoop

Apache Hive is a Data warehouse system which is built to work on Hadoop. It is used to querying and managing large datasets residing in distributed storage. Hive was developed at Facebook before becoming an open source project. Hive provides structure onto the data within Hadoop and enables users to query that data using a SQL-like language called HiveQL (HQL).

Hive is used because the tables in Hive are similar to tables in a relational database.Users familiar with SQL will find much familiar in HiveQL. Many users can simultaneously query data using HiveQL.

https://www.linkedin.com/pulse/using-apache-hive-distributed-data-warehouse-hadoop-alan-brown

The following two tabs change content below.

Eric Axelrod

President & Chief Architect at DIGR
I have helped companies bring new data driven products to market, drive efficiency out of their supply chain, execute strategic plans, and drive top line and bottom line growth by enabling every business function with actionable analytics. I can transform a business which is lacking critical insight into an agile, strategic, data driven organization.

One thought on “Using Apache Hive for distributed data warehouse on Hadoop | Alan Brown | LinkedIn

Leave a Reply