Distributed File System

Data-intensive Applications may not use Database Managments Systems (DBMS) at all. Hadoop is a processing Framework for running operations on data that is stored in files. Hadoop is suitable for map-reduce operations. It is possible to run Hadoop on EC2 and S3.

Edit tutorial

Comment on This Data Unit