Hive is no longer using Mapreduce but Apache Tez that follow DAG and avoid multiple times reload of data.
@StarburstData3 ай бұрын
Yes, good call out! Although Hive/Tez isn't used in data lakehouses either, Apache Tez is used in some Hive implementations of data lakes and it does reduce some of the issues associated with traditional Hive. You can think of those Hive implementations as moving a bit closer to a data lakehouse but a true lakehouse requires one of the 3 modern table formats: Iceberg, Delta Lake, or Hudi.