Jörn Franke – Seite 4 – Zukunft-Innovation-Technik (ZuInnoTe)

Spark+Scala+Graphx: Analyzing the Bitcoin Transaction Graph

Sep. 20, 2016

—

von

in analytics, big data, bitcoin, graph, tech

The hadoopcryptoledger library provides now an example how you can generate a Bitcoin Transaction Graph using the Big Data graph analysis technologies Spark+Scala+Graphx. Basically it demonstrates how to read the Bitcoin Blockchain from HDFS, transform it into a graph with Bitcoin addresses as vertices and transactions between them as edges. The example returns the 5…

Hive & Bitcoin: Analytics on Blockchain data with SQL

Apr. 28, 2016

—

von

Jörn Franke

in analytics, big data, bitcoin, data warehouse, hive, tech

You can now analyze the Bitcoin Blockchain using Hive and the hadoopcryptoledger library with the new HiveSerde plugin. Basically you can link any data that you loaded in Hive with Bitcoin Blockchain data. For example, you can link Blockchain data with important events in history to determine what causes Bitcoin exchange rates to increase or…

Using Apache Spark to Analyze the Bitcoin Blockchain

Apr. 17, 2016

—

von

Jörn Franke

in analytics, big data, bitcoin, tech

The hadoopcryptoledger library provides now a simple example how you can analyze the Bitcoin Blockchain with Apache Spark. Previously, I described how you can use Hadoop MR or any other Hadoop ecosystem-compatible application to analyze it. Basically, it leverages the HadoopRDD API to read the Hadoop File Format of the hadoopcryptoledger library. Afterwards you can…

Analyzing the Bitcoin Blockchain using the Hadoop Ecosystem – A first Approach

Apr. 10, 2016

—

von

Jörn Franke

in analytics, big data, bitcoin, tech

Bitcoin and other crytocurrencies have drawn a lot of attention of companies, public organizations and individuals. While many use cases exists there is still a long road ahead to make them part of everybody’s life. The recently released first version of the open source hadoopycryptoledger library is a first attempt to make this happen. It…

Batch-processing & Interactive Analytics for Big Data – the Role of in-Memory

Dez. 22, 2015

—

von

Jörn Franke

in big data, business, emerging trends, enterprise application integration, in-memory

In this blog post I will discuss various aspects of in-memory technologies and describe how various Big Data technologies fit into this context. Especially, I will focus on the difference between in-memory batch analytics and interactive in-memory analytics. Additionally, I will illustrate when in-memory technology is really beneficial. In-memory technology leverages the fast main memory…

Hive Optimizations with Indexes, Bloom-Filters and Statistics

Juli 25, 2015

—

von

Jörn Franke

in big data, data warehouse, hive, tech

This blog post describes how Storage Indexes, Bitmap Indexes, Compact Indexes, Aggregate Indexes, Covering Indexes/Materialized Views, Bloom-Filters and statistics can increase performance with Apache Hive to enable a real-time datawarehouse. Furthermore, I will address how index-paradigms change due to big data volumes. Generally it is recommended to use less traditional indexes, but focus on storage indexes…

Big Data – What is next? OLTP, OLAP, Predictive Analytics, Sampling and Probabilistic Databases

Juni 28, 2015

—

von

Jörn Franke

in big data, business, emerging trends, enterprise architecture

Big Data has matured over the last years and is becoming more and more a standard technology used in various industries. Coming from established concepts, such as OLAP or OLTP, in context of Big Data, I go in this blog post beyond them describing what is needed for next generation applications, such as autonomous cars, industry…

Big Data Lab in the Cloud with Hadoop+Spark+R+Python

Apr. 18, 2015

—

von

Jörn Franke

in analytics, big data, cloud, tech

This is an update of the second big data lab for the cloud. Similar to previous versions, this document described how you can create a Big Data Lab in the cloud on Amazon EMR. Besides some major upgrades to the newest Amazon Hadoop AMI (3.6.0) Spark (1.3.0) and R, it includes now also the possibility…

Master Data Management and the Internet of Things

Jan. 1, 2015

—

von

Jörn Franke

in big data, business, master data management

Master Data Management (MDM) has matured and grown significantly over the last years. The main motivation for master data management is to have a complete and accurate view on master data objects in your organization. Master data objects describe key assets, such as machines or customers, generating value for your organization. Hence, MDM fosters processes…

Enabling WebRTC in modern Java Enterprise Web Applications

Dez. 30, 2014

—

von

Jörn Franke

in cloud, communication, tech, webrtc

I recently started a small project to create a sample enterprise Big Data web application using Spring. You can find the source code here and a demonstration here. One feature in this application WebRTC. I started working with WebRTC since its introduction around 2011/2012. Now, it became a W3C standard and has been implemented in…

Autor: Jörn Franke