Apache Hive Interview Questions and Answers
Freshers / Beginner level questions & answers
Ques 1. What is Apache Hive?
Apache Hive is a data warehousing and SQL-like query language for Apache Hadoop.
Example:
SELECT * FROM table_name;
Ques 2. What is HiveQL?
Hive Query Language (HiveQL) is a SQL-like language used to query data stored in Hive.
Example:
SELECT column1, column2 FROM table_name WHERE condition;
Ques 3. What is the purpose of Hive metastore?
Hive metastore stores metadata about Hive tables, partitions, and databases.
Ques 4. What is the purpose of Hive SerDe?
Hive SerDe (Serializer/Deserializer) is used to read and write data in custom formats.
Ques 5. What is the purpose of Hive partitions?
Hive partitions allow you to divide a table into smaller, more manageable parts based on a specific column.
Ques 6. What is the purpose of Hive HCatalog?
HCatalog is a storage and table management layer for Hadoop that enables sharing of data between Pig, MapReduce, and Hive.
Ques 7. What is Hive's role in the Hadoop ecosystem?
Hive provides a high-level SQL-like interface for querying and analyzing data stored in Hadoop Distributed File System (HDFS).
Ques 8. What are the types of Hive tables?
Hive supports managed tables (internal) and external tables. Managed tables store data in a Hive-controlled location, while external tables reference data stored outside Hive.
Ques 9. How can you limit the number of rows returned in a Hive query?
You can use the 'LIMIT' clause to restrict the number of rows returned in a Hive query.
Example:
SELECT * FROM table_name LIMIT 10;
Most helpful rated by users: