Reading/Writing Excel documents with the HadoopOffice library on Hadoop and Spark – First release

2017-01-08 --- Jörn Franke

Reading/Writing office documents, such as Excel, has been always challenging on Big data platforms. Although many libraries exist for reading/writing office documents, they have never been really integrated in Hadoop or Spark and thus lead to a lot of development efforts.

There are several use cases for using office documents jointly with Big data technologies:

Hence, the HadoopOffice library was created and the first version has just been released!

It features:

Of course, further releases are planned: