big-data

Hadoop Hands-On (Process Mining with Hadoop)

Hadoop Hands-on (Last updated: 31. January. 2020) This blog post is a supplement for Hadoop instruction at Introduction to Data Science, RWTH-Aachen. This post covers: What is Hadoop Distributed File System (HDFS)? How can we use it? What is Hadoop MapReduce? How can we use it? How can we apply process mining techniques to an event log with billions of events with Hadoop? We are living in the world of big data.

The (easiest) Hadoop installation - MacOS/Linux

The (easiest) Hadoop installation - MacOS/Linux (Last updated: 24. Jan. 2020 14:00) This blog post is a supplement for Hadoop instruction at Introduction to Data Science, RWTH-Aachen. The goal is to make sure that everyone can run Hadoop at his/her PC. Thus, I aim for explaining how to install Hadoop in the easiest manner by taking the simplest approach. This is not a succint way to install Hadoop (If you think you are quite familar with how (system-related) things work, I recommend you to find other blog posts).

The (easiest) Hadoop installation - Windows

The (easiest) Hadoop installation - Windows (Last updated: 24. Jan. 2020 14:00) This blog post is a supplement for Hadoop instruction at Introduction to Data Science, RWTH-Aachen. The goal is to make sure that everyone can run Hadoop at his/her PC. Thus, I aim for explaining how to install Hadoop in the easiest manner by taking the simplest approach. This is not a succint way to install Hadoop (If you think you are quite familar with how (system-related) things work, I recommend you to find other blog posts).