big data – Zukunft-Innovation-Technik (ZuInnoTe)

Revisiting Big Data Formats: Apache Iceberg, Delta Lake and Apache Hudi

Aug. 5, 2023

—

von

in analytics, big data, cloud, flink, hadoop, hive, spark

Novel Big Data formats, such as Apache Parquet, Apache ORC or Apache Avro have been years ago the game changer for processing massive amounts of data efficiently as I wrote in a previous blog post (aside of the Big Data platforms leveraging them). Nowadays we see the emergence of new Big Data formats, such as…

A Study on using a Rust-based dynamic Module system in WebAssembly for processing Data

Aug. 1, 2022

—

von

Jörn Franke

in artificial intelligence, big data, cloud, serverless computing, webassembly

I have also published the source code of this on Codeberg (EU Git hosting) and Github (US Git hosting). Nowadays we have a plethora of programming languages and platforms at our fingertips. They have different advantages and disadvantages depending on the use case and preferences. Often many different combinations of components are used for data…

HadoopOffice – A Vision for the coming Years

Jan. 2, 2018

—

von

Jörn Franke

in analytics, flink, hive, office, streaming, tech

HadoopOffice is already since more than a year available (first commit: 16.10.2016). Currently it supports Excel formats based on the Apache POI parsers/writers. Meanwhile a lot of functionality has been added, such as: Support for .xlsx and .xls formats – reading and writing Encryption/Decryption Support Support for Hadoop mapred.* and mapreduce.* APIs Support for Spark…

Schlagwort: big data

Revisiting Big Data Formats: Apache Iceberg, Delta Lake and Apache Hudi

A Study on using a Rust-based dynamic Module system in WebAssembly for processing Data

HadoopOffice – A Vision for the coming Years