Přeskočit na obsah

Hadoop

Hadoop is a popular open-source framework for distributed storage and distributed processing of large volumes of data, especially where the processing involves a MapReduce algorithm. Unlike traditional cluster resources in MetaCentrum, which assume just-in-time stage-in of data before processing, the Hadoop environment is intended for mid- to long-term storage/collection of data to be processed.
Aside of a large native Hadoop cluster extensible on demand with additional virtual nodes, MetaCentrum can also offer purely virtual Hadoop PaaS clusters for prototyping your solutions prior to large-scale deployment.

How to Enroll

The service is intended for MetaCentrum users and registration is subject to approval:

  1. Register with MetaCentrum (unless you are already a memeber).
  2. Apply for access to Hadoop.

User Documentation and Support

Current Hardware Resources

Currently it is possible to launch Hadoop from within the Metacentrum cloud infrastructure.

Last changed:Fri Aug 12 11:57:28 CEST 2022